Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/167270
Title: Recovering 6D object pose and semantic information from RGBD image
Authors: Lee, Wen Jie
Keywords: Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Issue Date: 2023
Publisher: Nanyang Technological University
Source: Lee, W. J. (2023). Recovering 6D object pose and semantic information from RGBD image. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/167270
Project: B058 
Abstract: The applications of 6-Dimentional (6D) pose estimation plays an important role in today’s technology. The increasing prominence of robotics, automation, and augmented reality necessitates the need for precise and effective 6D pose information to ensure its successful execution. For industrial applications, easily replicable datasets and robust pose estimations are crucial in determining the possible applications for the 6D pose method. The use of color (Red, Green, Blue) and depth information, also known as RGBD images have emerged as a popular source of data for 6D pose estimation. This project explores two popular RGBD pointcloud processing methods for 6D pose estimation: Normalized Object Coordinate Space (NOCS) and Deep Point-wise 3D Keypoints Voting Network (PVN3D). A review of the methodology, results, and dataset requirements regarding the two methods were discussed. Upon further inspecting and comparison of the methods, the replication of the NOCS dataset was found difficult to replicate with the limited knowledge and resources in hand, while PVN3D was found to meet the requirements. The LineMOD dataset which was utilized by PVN3D was shown to be replaceable using the in-house equipment provided. Furthermore, the results obtained from the training and testing of NOCS did not perform up to expectations, while the PVN3D results were acceptable. Overall, this project concludes that PVN3D is a viable 6D pose estimation method for industrial applications.
URI: https://hdl.handle.net/10356/167270
Schools: School of Mechanical and Aerospace Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:MAE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
B058_Recovering 6D Object Pose And Semantic Information From RGBD Image_LEEWENJIE.pdf
  Restricted Access
Undergraduate project report4 MBAdobe PDFView/Open

Page view(s)

199
Updated on May 7, 2025

Download(s)

7
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.