Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/184055
Title: Neuroscience-inspired approaches for visual‑audio multimodal visual search
Authors: S Jivaganesh
Keywords: Computer and Information Science
Issue Date: 2025
Publisher: Nanyang Technological University
Source: S Jivaganesh (2025). Neuroscience-inspired approaches for visual‑audio multimodal visual search. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184055
Abstract: This project aimed to investigate the effect of auditory stimuli on visual search in humans through human eye-tracking experiments and to explore various visual-audio search models to examine how a multi-sensory approach would influence visual search efficiency and accuracy. The human experiment involved exposing the subject to 240 different trials of images and audio stimuli under seven different conditions and 20 different target categories. The eye fixations are then collected and processed to determine the effects of auditory stimuli on visual search. The results found that, while audio semantics has a slight effect in aiding visual search, the difference was not significant. However, human participants heavily relied on visual cues instead of audio cues. The model experiment aimed to explore various sound localisation models and integrate them with the human-inspired visual search model, IVSN. The results showed that the integration of the IVSN significantly boosted the weak performance of the sound localisation model in our dataset, but concluded that future work needs to reduce the hybrid model’s reliance on the IVSN representation.
URI: https://hdl.handle.net/10356/184055
Schools: College of Computing and Data Science 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:CCDS Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
S Jivaganesh_FYP_Amended_Report.pdf
  Restricted Access
15.14 MBAdobe PDFView/Open

Page view(s)

12
Updated on May 5, 2025

Download(s)

1
Updated on May 5, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.