Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/181618
Full metadata record
DC FieldValueLanguage
dc.contributor.authorYeo, Clementen_US
dc.date.accessioned2024-12-11T06:19:13Z-
dc.date.available2024-12-11T06:19:13Z-
dc.date.issued2024-
dc.identifier.citationYeo, C. (2024). Image processing using artificial intelligence. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181618en_US
dc.identifier.urihttps://hdl.handle.net/10356/181618-
dc.description.abstractHuman pose estimation is an important part of computer vision that determines the positions and orientations of a human body in 2D or 3D images and videos. This project explores the application of Artificial Intelligence (AI) techniques for 3D HPE, specifically leveraging the MixSTE: Seq2seq Mixed Spatio-Temporal Encoder, used to estimate poses from video sequences. MixSTE combines spatial and temporal feature extraction to accurately predict human poses by modeling complex body dynamics over time. The main goal of this work is to develop and assess MixSTE for human pose estimation in videos, focusing on enhancing the accuracy and reliability of pose predictions, even in challenging conditions involving occlusions and diverse body movements. The proposed system uses a sequence-to-sequence (seq2seq) architecture to effectively encode and decode spatial and temporal information, providing a significant advancement over existing methods that often struggle with temporal inconsistencies. The experiments were performed on benchmark datasets like Human3.6M, and the results indicate that the proposed approach achieves high accuracy in 3D pose estimation, outperforming several state-of-the-art methods in terms of Mean Per Joint Position Error (MPJPE). This work demonstrates the potential of MixSTE for real-world applications, including activity recognition, human-computer interaction, and animation, contributing to the broader field of AI-driven human motion analysis.en_US
dc.language.isoenen_US
dc.publisherNanyang Technological Universityen_US
dc.subjectEngineeringen_US
dc.titleImage processing using artificial intelligenceen_US
dc.typeFinal Year Project (FYP)en_US
dc.contributor.supervisorYap Kim Huien_US
dc.contributor.schoolSchool of Electrical and Electronic Engineeringen_US
dc.description.degreeBachelor's degreeen_US
dc.contributor.supervisoremailEKHYap@ntu.edu.sgen_US
item.grantfulltextrestricted-
item.fulltextWith Fulltext-
Appears in Collections:EEE Student Reports (FYP/IA/PA/PI)
Files in This Item:
File Description SizeFormat 
FYP Final Report.pdf
  Restricted Access
1.18 MBAdobe PDFView/Open

Page view(s)

73
Updated on Apr 26, 2025

Download(s)

11
Updated on Apr 26, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.