Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/181381
Title: AV-FDTI: audio-visual fusion for drone threat identification
Authors: Yang, Yizhuo
Yuan, Shenghai
Yang, Jianfei
Nguyen, Thien Hoang
Cao, Muqing
Nguyen, Thien-Minh
Wang, Han
Xie, Lihua
Keywords: Engineering
Issue Date: 2024
Source: Yang, Y., Yuan, S., Yang, J., Nguyen, T. H., Cao, M., Nguyen, T., Wang, H. & Xie, L. (2024). AV-FDTI: audio-visual fusion for drone threat identification. Journal of Automation and Intelligence, 3(3), 144-151. https://dx.doi.org/10.1016/j.jai.2024.06.002
Project: I2201E0013 
WP5 
Journal: Journal of Automation and Intelligence 
Abstract: In response to the evolving challenges posed by small unmanned aerial vehicles (UAVs), which have the potential to transport harmful payloads or cause significant damage, we present AV-FDTI, an innovative Audio-Visual Fusion system designed for Drone Threat Identification. AV-FDTI leverages the fusion of audio and omnidirectional camera feature inputs, providing a comprehensive solution to enhance the precision and resilience of drone classification and 3D localization. Specifically, AV-FDTI employs a CRNN network to capture vital temporal dynamics within the audio domain and utilizes a pretrained ResNet50 model for image feature extraction. Furthermore, we adopt a visual information entropy and cross-attention-based mechanism to enhance the fusion of visual and audio data. Notably, our system is trained based on automated Leica tracking annotations, offering accurate ground truth data with millimeter-level accuracy. Comprehensive comparative evaluations demonstrate the superiority of our solution over the existing systems. In our commitment to advancing this field, we will release this work as open-source code and wearable AV-FDTI design, contributing valuable resources to the research community.
URI: https://hdl.handle.net/10356/181381
ISSN: 2949-8554
DOI: 10.1016/j.jai.2024.06.002
Schools: School of Electrical and Electronic Engineering 
Rights: © 2024 The Authors. Published by Elsevier B.V. on behalf of KeAi Communications Co. Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:EEE Journal Articles

Files in This Item:
File Description SizeFormat 
1-s2.0-S2949855424000285-main.pdf1.95 MBAdobe PDFThumbnail
View/Open

SCOPUSTM   
Citations 50

4
Updated on May 5, 2025

Page view(s)

106
Updated on May 6, 2025

Download(s) 50

33
Updated on May 6, 2025

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.