Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/41749
Title: | Psychoacoustic model for robust speech recognition | Authors: | Luo, Xue Wen | Keywords: | DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing | Issue Date: | 2008 | Source: | Luo, X. W. (2008). Psychoacoustic model for robust speech recognition. Master’s thesis, Nanyang Technological University, Singapore. | Abstract: | This thesis presents a detailed study on psychoacoustic modeling for feature extraction for robust speech recognition. In an automatic speech recognition (ASR) system, feature extraction is critical to determining the recognizer's performance. The most popular feature vectors for ASR are Mel Frequency Cepstral Coefficients (MFCC). However, it is also well known that its performance drops dramatically under noisy condition. One of the objectives of this thesis is to improve the robustness of a recognizer. Compared to an ASR system, human is good at tolerating background noise, hence psychoacoustic modeling of human hearing system is investigated and integrated into speech features extraction process of a speech recognizer to increase the robustness of it. | URI: | https://hdl.handle.net/10356/41749 | DOI: | 10.32657/10356/41749 | Schools: | School of Electrical and Electronic Engineering | Fulltext Permission: | open | Fulltext Availability: | With Fulltext |
Appears in Collections: | EEE Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
LuoXuewen08.pdf | 3.14 MB | Adobe PDF | View/Open |
Page view(s) 50
461
Updated on Mar 28, 2024
Download(s) 20
281
Updated on Mar 28, 2024
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.