Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/181381
Title: | AV-FDTI: audio-visual fusion for drone threat identification | Authors: | Yang, Yizhuo Yuan, Shenghai Yang, Jianfei Nguyen, Thien Hoang Cao, Muqing Nguyen, Thien-Minh Wang, Han Xie, Lihua |
Keywords: | Engineering | Issue Date: | 2024 | Source: | Yang, Y., Yuan, S., Yang, J., Nguyen, T. H., Cao, M., Nguyen, T., Wang, H. & Xie, L. (2024). AV-FDTI: audio-visual fusion for drone threat identification. Journal of Automation and Intelligence, 3(3), 144-151. https://dx.doi.org/10.1016/j.jai.2024.06.002 | Project: | I2201E0013 WP5 |
Journal: | Journal of Automation and Intelligence | Abstract: | In response to the evolving challenges posed by small unmanned aerial vehicles (UAVs), which have the potential to transport harmful payloads or cause significant damage, we present AV-FDTI, an innovative Audio-Visual Fusion system designed for Drone Threat Identification. AV-FDTI leverages the fusion of audio and omnidirectional camera feature inputs, providing a comprehensive solution to enhance the precision and resilience of drone classification and 3D localization. Specifically, AV-FDTI employs a CRNN network to capture vital temporal dynamics within the audio domain and utilizes a pretrained ResNet50 model for image feature extraction. Furthermore, we adopt a visual information entropy and cross-attention-based mechanism to enhance the fusion of visual and audio data. Notably, our system is trained based on automated Leica tracking annotations, offering accurate ground truth data with millimeter-level accuracy. Comprehensive comparative evaluations demonstrate the superiority of our solution over the existing systems. In our commitment to advancing this field, we will release this work as open-source code and wearable AV-FDTI design, contributing valuable resources to the research community. | URI: | https://hdl.handle.net/10356/181381 | ISSN: | 2949-8554 | DOI: | 10.1016/j.jai.2024.06.002 | Schools: | School of Electrical and Electronic Engineering | Rights: | © 2024 The Authors. Published by Elsevier B.V. on behalf of KeAi Communications Co. Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). | Fulltext Permission: | open | Fulltext Availability: | With Fulltext |
Appears in Collections: | EEE Journal Articles |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
1-s2.0-S2949855424000285-main.pdf | 1.95 MB | Adobe PDF | ![]() View/Open |
SCOPUSTM
Citations
50
4
Updated on May 5, 2025
Page view(s)
106
Updated on May 6, 2025
Download(s) 50
33
Updated on May 6, 2025
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.