Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/184055
Title: | Neuroscience-inspired approaches for visual‑audio multimodal visual search | Authors: | S Jivaganesh | Keywords: | Computer and Information Science | Issue Date: | 2025 | Publisher: | Nanyang Technological University | Source: | S Jivaganesh (2025). Neuroscience-inspired approaches for visual‑audio multimodal visual search. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184055 | Abstract: | This project aimed to investigate the effect of auditory stimuli on visual search in humans through human eye-tracking experiments and to explore various visual-audio search models to examine how a multi-sensory approach would influence visual search efficiency and accuracy. The human experiment involved exposing the subject to 240 different trials of images and audio stimuli under seven different conditions and 20 different target categories. The eye fixations are then collected and processed to determine the effects of auditory stimuli on visual search. The results found that, while audio semantics has a slight effect in aiding visual search, the difference was not significant. However, human participants heavily relied on visual cues instead of audio cues. The model experiment aimed to explore various sound localisation models and integrate them with the human-inspired visual search model, IVSN. The results showed that the integration of the IVSN significantly boosted the weak performance of the sound localisation model in our dataset, but concluded that future work needs to reduce the hybrid model’s reliance on the IVSN representation. | URI: | https://hdl.handle.net/10356/184055 | Schools: | College of Computing and Data Science | Fulltext Permission: | restricted | Fulltext Availability: | With Fulltext |
Appears in Collections: | CCDS Student Reports (FYP/IA/PA/PI) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
S Jivaganesh_FYP_Amended_Report.pdf Restricted Access | 15.14 MB | Adobe PDF | View/Open |
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.