Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/184545
Title: Multi-view fusion for action recognition in car cabin environment
Authors: Wang, Xiaojian
Keywords: Engineering
Issue Date: 2025
Publisher: Nanyang Technological University
Source: Wang, X. (2025). Multi-view fusion for action recognition in car cabin environment. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184545
Abstract: Action recognition plays a critical role in autonomous driving systems for interpreting driver behavior. Conventional single view methods that depend on manually designed features and Convolutional Neural Networks (CNNs) frequently face challenges in handling occlusions, illumination variations, and viewpoint alterations, resulting in limited performance. This study addresses these limitations through the integration of data from multiple synchronized camera perspectives. This research adopts UniFormerV2 as the baseline architecture and investigates three distinct fusion methodologies: early fusion, late fusion, and a META (Motion and Multi-View Excitation and Temporal Aggregation) block. Early fusion merges multi-view data at the preprocessing phase, whereas late fusion aggregates processed features while preserving view-specific characteristics. The META-based framework employs motion-aware mechanisms and cross-view attention to dynamically model spatiotemporal dependencies, thereby improving inter-view information exchange and action comprehension. Benchmark evaluations conducted on the Drive&Act dataset demonstrate that multi-view models outperform single view models, particularly in complex scenarios involving occlusions or actions across different spatial regions. The META-based method achieves the highest accuracy.
URI: https://hdl.handle.net/10356/184545
Schools: School of Electrical and Electronic Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Theses

Files in This Item:
File Description SizeFormat 
Wang Xiaojian-Dissertation.pdf
  Restricted Access
4.48 MBAdobe PDFView/Open

Page view(s)

24
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.