Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/102781
Title: | Experiments on cross-language attribute detection and phone recognition with minimal target-specific training data | Authors: | Siniscalchi, Sabato Marco. Lyu, Dau-Cheng. Svendsen, Torbjørn. Lee, Chin-Hui. |
Keywords: | DRNTU::Engineering::Computer science and engineering | Issue Date: | 2011 | Source: | Siniscalchi, S. M., Lyu, D. C., Svendsen, T., & Lee, C. H. (2011). Experiments on cross-language attribute detection and phone recognition with minimal target-specific training data. IEEE transactions on audio, speech, and language processing, 20(3), 875-887. | Series/Report no.: | IEEE transactions on audio, speech, and language processing | Abstract: | A state-of-the-art automatic speech recognition (ASR) system can often achieve high accuracy for most spoken languages of interest if a large amount of speech material can be collected and used to train a set of language-specific acoustic phone models. However, designing good ASR systems with little or no language-specific speech data for resource-limited languages is still a challenging research topic. As a consequence, there has been an increasing interest in exploring knowledge sharing among a large number of languages so that a universal set of acoustic phone units can be defined to work for multiple or even for all languages. This work aims at demonstrating that a recently proposed automatic speech attribute transcription framework can play a key role in designing language-universal acoustic models by sharing speech units among all target languages at the acoustic phonetic attribute level. The language-universal acoustic models are evaluated through phone recognition. It will be shown that good cross-language attribute detection and continuous phone recognition performance can be accomplished for “unseen” languages using minimal training data from the target languages to be recognized. Furthermore, a phone-based background model (PBM) approach will be presented to improve attribute detection accuracies. | URI: | https://hdl.handle.net/10356/102781 http://hdl.handle.net/10220/16448 |
DOI: | 10.1109/TASL.2011.2167610 | Schools: | School of Computer Engineering | Fulltext Permission: | none | Fulltext Availability: | No Fulltext |
Appears in Collections: | SCSE Journal Articles |
SCOPUSTM
Citations
5
73
Updated on Mar 6, 2025
Web of ScienceTM
Citations
5
67
Updated on Oct 29, 2023
Page view(s) 50
532
Updated on Mar 15, 2025
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.