Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/105600
Title: | Discriminative feature extraction for speech recognition using continuous output codes | Authors: | Dehzangi, Omid Ma, Bin Chng, Eng Siong Li, Haizhou |
Keywords: | DRNTU::Engineering::Computer science and engineering | Issue Date: | 2012 | Source: | Dehzangi, O., Ma, B., Chng, E. S., & Li, H. (2012). Discriminative feature extraction for speech recognition using continuous output codes. Pattern Recognition Letters, 33(13), 1703-1709. | Series/Report no.: | Pattern recognition letters | Abstract: | Feature transformation techniques have been widely investigated to reduce feature redundancy and to introduce additional discriminative information with the aim to improve the performance of automatic speech recognition (ASR). In this paper, we propose a novel method to obtain discriminative feature transformation based on output coding technique for speech recognition. The output coding transformation projects the speech features from their original space to a new one where each dimension of the features captures information to distinguish different phones. Using polynomial expansion, the short-time spectral features are first expanded to a high-dimensional space where the generalized linear discriminant sequence kernel is applied on the sequences of input feature vectors. Then, the output coding transformation formulated via a set of linear SVMs projects the sequences of high dimensional vectors into a tractable low-dimensional feature space where the resultant features are well-separated continuous output codes for the subsequent multi-class classification problem. Our experimental results on the TIMIT corpus show that the proposed features achieve 10.5% ASR error rate reduction over the conventional spectral features. | URI: | https://hdl.handle.net/10356/105600 http://hdl.handle.net/10220/17340 |
ISSN: | 0167-8655 | DOI: | 10.1016/j.patrec.2012.05.012 | Schools: | School of Computer Engineering | Fulltext Permission: | none | Fulltext Availability: | No Fulltext |
Appears in Collections: | SCSE Journal Articles |
SCOPUSTM
Citations
20
14
Updated on Dec 1, 2023
Web of ScienceTM
Citations
20
9
Updated on Oct 27, 2023
Page view(s) 50
582
Updated on Dec 7, 2023
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.