Recognition of visual speech elements using adaptively boosted hidden Markov models

DSpace/Manakin Repository


Search DR-NTU

Advanced Search Subject Search


My Account

Recognition of visual speech elements using adaptively boosted hidden Markov models

Show simple item record

dc.contributor.author Foo, Say Wei
dc.contributor.author Lian, Yong
dc.contributor.author Dong, Liang
dc.date.accessioned 2009-04-27T01:59:29Z
dc.date.available 2009-04-27T01:59:29Z
dc.date.copyright 2004
dc.date.issued 2009-04-27T01:59:29Z
dc.identifier.citation Foo, S. W., Lian, Y., & Dong, L. (2004). Recognition of visual speech elements using adaptively boosted hidden Markov models. IEEE Transactions on Circuits and Systems for Video Technology, 14(5), 693-705.
dc.identifier.issn 1051-8215
dc.identifier.uri http://hdl.handle.net/10220/4584
dc.description.abstract The performance of automatic speech recognition (ASR) system can be significantly enhanced with additional information from visual speech elements such as the movement of lips, tongue, and teeth, especially under noisy environment. In this paper, a novel approach for recognition of visual speech elements is presented. The approach makes use of adaptive boosting (AdaBoost) and hidden Markov models (HMMs) to build an AdaBoost-HMM classifier. The composite HMMs of the AdaBoost-HMM classifier are trained to cover different groups of training samples using the AdaBoost technique and the biased Baum–Welch training method. By combining the decisions of the component classifiers of the composite HMMs according to a novel probability synthesis rule, a more complex decision boundary is formulated than using the single HMM classifier. The method is applied to the recognition of the basic visual speech elements. Experimental results show that the AdaBoost-HMM classifier outperforms the traditional HMM classifier in accuracy, especially for visemes extracted from contexts.
dc.format.extent 13 p.
dc.language.iso en
dc.relation.ispartofseries IEEE transactions on circuits and systems for video technology
dc.rights © 2006 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
dc.title Recognition of visual speech elements using adaptively boosted hidden Markov models
dc.type Journal Article
dc.identifier.openurl http://sfxna09.hosted.exlibrisgroup.com:3410/ntu/sfxlcl3?sid=metalib:EVII&id=doi:10.1109/TCSVT.2004.826773&genre=&isbn=&issn=10518215&date=2004&volume=14&issue=5&spage=693&epage=705&aulast=Foo&aufirst=%20Say%20Wei&auinit=&title=IEEE%20Transactions%20on%20Circuits%20and%20Systems%20for%20Video%20Technology&atitle=Recognition%20of%20visual%20speech%20elements%20using%20adaptively%20boosted%20hidden%20markov%20models
dc.identifier.doi http://dx.doi.org/10.1109/TCSVT.2004.826773
dc.description.version Published version

Files in this item

Files Size Format View Description
J13-IEEECASVideoTech2004DL.pdf 533.0Kb PDF View/Open Published version

This item appears in the following Collection(s)

Show simple item record


Total views

All Items Views
Recognition of visual speech elements using adaptively boosted hidden Markov models 318

Total downloads

All Bitstreams Views
J13-IEEECASVideoTech2004DL.pdf 267

Top country downloads

Country Code Views
United States of America 97
China 92
Russian Federation 12
Singapore 9
Australia 7

Top city downloads

city Views
Mountain View 69
Beijing 62
Singapore 9
Perth 7
Mohammedia 4

Downloads / month

  2015-01 2015-02 2015-03 total
J13-IEEECASVideoTech2004DL.pdf 0 0 8 8