Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/184513
Title: | 3D object detection: integrating sparse encoders with transformer-based decoders | Authors: | Shao, Yuming | Keywords: | Computer and Information Science | Issue Date: | 2025 | Publisher: | Nanyang Technological University | Source: | Shao, Y. (2025). 3D object detection: integrating sparse encoders with transformer-based decoders. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184513 | Project: | ISM-DISS-04596 | Abstract: | Accurate 3D object detection is essential for precise robot manipulation and as sembly tasks in industrial environments, where reliable object classification and localization significantly influence the efficiency and robustness of automated systems. Although existing point cloud encoders such as SECOND, PointPillars, and PV-RCNN have demonstrated strong feature extraction capabilities for sparse and dense 3D point clouds, they generally lack effective mechanisms to capture comprehensive global spatial relationships between objects. To address this limitation, this dissertation proposes the integration of specialized transformer based decoders with encoders, combining their strengths in local geometric en coding and global feature context aggregation. Experiments conducted on three widely used 3D open source datasets, KITTI and nuScenes, showcasing the effectiveness of our Voxel-DETR3D architecture. The results show promising results in evaluation metrics in all datasets when using task-specific decoders, achieving notable performance gains. These findings demonstrate that the proposed integration of encoder and decoder substantially enhances object detection accuracy, facilitating improvements in the reliability and effectiveness of robotic assembly tasks. Keywords: Transformer Decoder, 3D Object Detection, Point Cloud, Voxeliza tion, Industrial Assembly. | URI: | https://hdl.handle.net/10356/184513 | Schools: | School of Electrical and Electronic Engineering | Fulltext Permission: | restricted | Fulltext Availability: | With Fulltext |
Appears in Collections: | EEE Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
NTU_Dissertation_3D_Object_Detection_Shao_Yuming.pdf Restricted Access | 4.32 MB | Adobe PDF | View/Open |
Google ScholarTM
Check
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.