Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/79851
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Xiao, Xiong | en |
dc.contributor.author | Nwe, Tin Lay | en |
dc.contributor.author | Chng, Eng Siong | en |
dc.contributor.author | Ma, Bin | en |
dc.contributor.author | Li, Haizhou | en |
dc.contributor.author | Wang, Lei | en |
dc.contributor.author | Ni, Chongjia | en |
dc.contributor.author | Leung, Cheung-Chi | en |
dc.contributor.author | You, Changhuai | en |
dc.contributor.author | Xie, Lei | en |
dc.contributor.author | Xu, Haihua | en |
dc.date.accessioned | 2019-05-22T05:09:30Z | en |
dc.date.accessioned | 2019-12-06T13:35:20Z | - |
dc.date.available | 2019-05-22T05:09:30Z | en |
dc.date.available | 2019-12-06T13:35:20Z | - |
dc.date.issued | 2016 | en |
dc.identifier.citation | Wang, L., Ni, C., Leung, C. -C., You, C., Xie, L., Xu, H., . . . Li, H. (2016). The NNi Vietnamese speech recognition system for mediaeval 2016. Multimedia Benchmark Workshop, 1739. | en |
dc.identifier.uri | https://hdl.handle.net/10356/79851 | - |
dc.description.abstract | This paper provides an overall description of the Vietnamese speech recognition system developed by the joint team for MediaEval 2016. The submitted system consisted of 3 subsystems, and adopted different deep neural network-based techniques such as fMLLR transformed bottleneck features, sequence training, etc. Besides the acoustic modeling techniques, speech data augmentation was also examined to develop a more robust acoustic model. The I2R team collected a number of text resources from the Internet and made them available to other participants in the task. The web text crawled from the Internet was used to train a 5-gram language model. The submitted system obtained the token error rate (TER) of 15.1, 23.0 and 50.5 on Devel local set, Devel set and Test set, respectively. | en |
dc.format.extent | 3 p. | en |
dc.language.iso | en | en |
dc.rights | © 2016 The Author(s). | en |
dc.subject | Vietnamese | en |
dc.subject | Recognition | en |
dc.subject | DRNTU::Engineering::Computer science and engineering | en |
dc.title | The NNi Vietnamese speech recognition system for mediaeval 2016 | en |
dc.type | Conference Paper | en |
dc.contributor.school | School of Computer Science and Engineering | en |
dc.contributor.conference | Multimedia Benchmark Workshop | en |
dc.description.version | Published version | en |
dc.identifier.url | http://ceur-ws.org/Vol-1739/MediaEval_2016_paper_52.pdf | en |
item.fulltext | With Fulltext | - |
item.grantfulltext | open | - |
Appears in Collections: | SCSE Conference Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
The NNI Vietnamese Speech Recognition System.pdf | 182.16 kB | Adobe PDF | ![]() View/Open |
Page view(s)
114
Updated on Apr 17, 2021
Download(s) 50
17
Updated on Apr 17, 2021
Google ScholarTM
Check
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.