Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/182930
Title: Multi-modality object detection in car cabin monitoring
Authors: Han, Jiaxuan
Keywords: Engineering
Issue Date: 2025
Publisher: Nanyang Technological University
Source: Han, J. (2025). Multi-modality object detection in car cabin monitoring. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/182930
Abstract: With the rapid advancement of autonomous driving technology, ensuring cabin safety and providing a smarter, more personalized driving experience have become critical objectives. Computer vision-based object detection has emerged as a key research topic in intelligent cabin systems. However, the constantly changing conditions within cabin environments present significant challenges to maintaining detection accuracy and reliability. This dissertation explores an effective multi-modality object detection model tailored for complex cabin scenarios. We adopted a Cross-Modality Fusion Transformer (CFT) model, built on a dual-stream YOLOv5 feature extraction backbone, and apply it to the multimodal TICaM dataset. Comprehensive preprocessing steps, including format adjustments and data augmentation, are employed to optimize feature fusion between RGB and infrared modalities. Experimental results demonstrate that the CFT model significantly enhances accuracy and robustness compared to single-modality models. To improve inference speed and enable real-time applications in vehicle monitoring systems, we also modified the original Cross Stage Partial Network module in YOLOv5. This attempt lays foundations for future efforts in balancing detection accuracy and inference efficiency, advancing the development of real-time intelligent vehicle monitoring systems.
URI: https://hdl.handle.net/10356/182930
Schools: School of Electrical and Electronic Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Theses

Files in This Item:
File Description SizeFormat 
Multi-Modality Object Detection in Car Cabin Monitoring.pdf
  Restricted Access
9.44 MBAdobe PDFView/Open

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.