Please use this identifier to cite or link to this item:
Title: An improved model of masking effects for robust speech recognition system
Authors: Dai, Peng
Soon, Ing Yann
Keywords: DRNTU::Engineering::Electrical and electronic engineering
Issue Date: 2013
Source: Dai, P., & Soon, I. Y. (2013). An improved model of masking effects for robust speech recognition system. Speech communication, 55(3), 387-396.
Series/Report no.: Speech communication
Abstract: Performance of an automatic speech recognition system drops dramatically in the presence of background noise unlike the human auditory system which is more adept at noisy speech recognition. This paper proposes a novel auditory modeling algorithm which is integrated into the feature extraction front-end for Hidden Markov Model (HMM). The proposed algorithm is named LTFC which simulates properties of the human auditory system and applies it to the speech recognition system to enhance its robustness. It integrates simultaneous masking, temporal masking and cepstral mean and variance normalization into ordinary mel-frequency cepstral coefficients (MFCC) feature extraction algorithm for robust speech recognition. The proposed method sharpens the power spectrum of the signal in both the frequency domain and the time domain. Evaluation tests are carried out on the AURORA2 database. Experimental results show that the word recognition rate using our proposed feature extraction method has been effectively increased.
ISSN: 0167-6393
DOI: 10.1016/j.specom.2012.12.005
Fulltext Permission: none
Fulltext Availability: No Fulltext
Appears in Collections:EEE Journal Articles

Google ScholarTM




Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.