Please use this identifier to cite or link to this item:
Title: A smart eavesdropping system : recognizing keywords by human subjects
Authors: Zhang, Chun Meng
Keywords: DRNTU::Engineering::Electrical and electronic engineering
Issue Date: 2015
Abstract: Speech Recognition (SR) gains its popularity in research area as the advance of modern technologies. It can translate speech into text with the aid of computers and speech recognition applications. In this final year project, an open source speech recognition engine named Pocketsphinx from Carnegie Mellon University (CMU) is integrated into our existing Eavesdropping System to perform speech recognition or keyword spotting tasks on the output audio files from the system. This report covers the details on the development of whole speech recognition framework assembled. Pocketsphinx is compiled and installed in a Linux server remotely and communicates with the client Matlab programs using network sockets. System parameters are carefully tuned to ensure the performance. Experiments on different combinations of Acoustic Models (AM) and Language Models (LM) are also conducted and evaluated. Acoustic model adaptation which adapts the speech recognizer into specific acoustic environment or speaker to enhance the recognition performance is also presented. Furthermore, another commercially available speech recognition application named Dragon Naturally Speaking (DNS) 12 is also experimented and compared with Pocketsphinx used in the system.
Rights: Nanyang Technological University
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
  Restricted Access
2.25 MBAdobe PDFView/Open

Page view(s)

Updated on Nov 26, 2020

Download(s) 50

Updated on Nov 26, 2020

Google ScholarTM


Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.