Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/68997
Full metadata record
DC FieldValueLanguage
dc.contributor.authorIrwan Widjojo-
dc.contributor.authorLee, Kean Hin-
dc.date.accessioned2016-08-23T04:47:08Z-
dc.date.available2016-08-23T04:47:08Z-
dc.date.issued2016-
dc.identifier.urihttp://hdl.handle.net/10356/68997-
dc.description.abstractOne of the most challenging tasks in automatic visual speech recognition is the extraction of feature parameters from image sequences of lips. There are primarily two approaches to extract visual speech information from image sequences, i.e. model-based approach and pixel-based approach. The advantage of mode1-based approach is that the parameters of the contour model of the lip are less influenced by the variability of lighting condition, lip location and rotation but the construction of an efficient and yet robust lip contour that is capable of tracking the lip has made this task difficult. The pixel-based approach on the other hand must take the variability of lighting condition, lip rotation and location into account. Despite many researches undertaken, lip tracking remains a challenging task due to the diverse variation of face images. The pixel based approach was adopted in this project. Raw data for visual speech recognition were obtained using digital camcorder. These video recordings were converted to image sequences and the lip of the speaker on each frame was extracted. The lip boundaries were obtained after the lip on each frame was located. The contour of the lip was drawn based on the lip boundaries using least square polynomial. Ten important visual speech features for all frames were extracted and then quantized. These vector sequences were ready to be used for training of HMMs. The trained models were used for recognition of unknown vector sequences.en_US
dc.format.extent93 p.en_US
dc.language.isoenen_US
dc.rightsNanyang Technological University-
dc.subjectDRNTU::Engineering::Electrical and electronic engineeringen_US
dc.titleAutomatic visual speech recognitionen_US
dc.typeFinal Year Project (FYP)en_US
dc.contributor.supervisorFoo Say Weien_US
dc.contributor.schoolSchool of Electrical and Electronic Engineeringen_US
dc.description.degreeBachelor of Engineeringen_US
item.fulltextWith Fulltext-
item.grantfulltextrestricted-
Appears in Collections:EEE Student Reports (FYP/IA/PA/PI)
Files in This Item:
File Description SizeFormat 
IrwanWidjojo_LeeKeanHin2003.pdf
  Restricted Access
16.56 MBAdobe PDFView/Open

Page view(s)

134
Updated on Jun 20, 2021

Download(s)

10
Updated on Jun 20, 2021

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.