Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/169137
Title: | Multi-modal semantic segmentation in poor lighting conditions | Authors: | Li, Zifeng | Keywords: | Engineering::Computer science and engineering::Computer applications | Issue Date: | 2023 | Publisher: | Nanyang Technological University | Source: | Li, Z. (2023). Multi-modal semantic segmentation in poor lighting conditions. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/169137 | Abstract: | Semantic segmentation is a complicate dense prediction task that consumes significant computational resources, and the use of multi-modal RGB-T data makes its computational burden even more severe. This dissertation presents a novel and lightweight network for RGB-T semantic segmentation with a parameter-free feature fusion module that facilitates efficient fusion between modalities. The proposed method integrates both modalities by leveraging multi-scale features from both RGB and T domains in different feature extraction stages. Specifically, we employ a dual-encoder architecture to extract RGB-T features and fuse them with a parameter-free cross-modal attention mechanism, taking the advantage of the complementary information provided by the two modalities to improve segmentation accuracy. Besides, we further investigate the impact of different pretrained strategies on the performance of the model. We evaluate our approach on several benchmark datasets, including the MFNet and PST900 datasets. Experimental results show that our approach outperforms real-time state-of-the-art methods in the literature while showing comparable performance with state-of the-art methods that require up to 100 times the computational complexity. Our findings demonstrate the effectiveness of lightweight RGB-T model for semantic segmentation and highlight the potential of this approach for various real-world applications. | URI: | https://hdl.handle.net/10356/169137 | Schools: | School of Electrical and Electronic Engineering | Fulltext Permission: | restricted | Fulltext Availability: | With Fulltext |
Appears in Collections: | EEE Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Dissertation.pdf Restricted Access | 8.65 MB | Adobe PDF | View/Open |
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.