Please use this identifier to cite or link to this item:
Title: Cross-speaker viseme mapping using hidden Markov models
Authors: Dong, Liang
Foo, Say Wei
Yong, Lian
Keywords: DRNTU::Engineering::Electrical and electronic engineering::Electronic systems
Issue Date: 2003
Source: Dong, L., Foo, S. W., & Yong,L. (2003). Cross-speaker viseme mapping using hidden Markov models. Proceedings of the 4th International Conference on Information, Communications and Signal Processing and the 4th IEEE Pacific-Rim Conference on Multimedia (pp. 1384-1388). Vol.3. Singapore: IEEE.
Abstract: In this paper, a method of mapping visual speech between different speakers is proposed. This approach adopts Hidden Markov Model (HMM) to model the basic visual speech element – viseme. Some mapping terms are applied to associate the state chains decoded for the visemes produced by different speakers. The HMMs configured in this way are trained using the Baum-Welch estimation, and are used to generate new visemes. Experiments are conducted to map the visemes produced by several speakers to a destination speaker. The experimental results show that the proposed approach provides good accuracy and continuity for mapping the visemes.
DOI: 10.1109/ICICS.2003.1292692
Rights: International Conference on Information, Communications and Signal Processing and the IEEE Pacific-Rim Conference Multimedia © 2003 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:EEE Conference Papers

Files in This Item:
File Description SizeFormat 
Accepted308.78 kBAdobe PDFThumbnail

Google ScholarTM




Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.