Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/141755
Title: | An efficient dilated convolutional neural network for UAV noise reduction at low input SNR | Authors: | Tan, Zhi-Wei Nguyen, Anh Hai Trieu Khong, Andy Wai Hoong |
Keywords: | Engineering::Electrical and electronic engineering | Issue Date: | 2020 | Source: | Tan, Z.-W., Nguyen, A. H. T., & Khong, A. W. H. (2019). An efficient dilated convolutional neural network for UAV noise reduction at low input SNR. Proceedings of 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 1885-1892. doi:10.1109/APSIPAASC47483.2019.9023324 | Project: | MRP14 | Conference: | 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) | Abstract: | Acoustic applications on a multi-rotor unmanned aerial vehicle (UAV) have been hindered by its low input signal-to-noise ratio (SNR). Such low SNR condition poses prominent challenges for beamforming algorithms, statistical methods, and existing mask-based deep learning algorithms. We propose the small model on low SNR (SMoLnet), a compact convolutional neural network (CNN) to suppress UAV noise in noisy speech signals recorded off a microphone array mounted on the UAV. The proposed SMoLnet employs a large analysis window to achieve high spectral resolution since the loud UAV noise exhibits a narrow-band harmonic pattern. In the proposed SMoLnet model, exponentially-increasing dilated convolution layers were adopted to capture the global relationship across the frequency dimension. Furthermore, we performed direct spectral mapping between noisy and clean complex spectrogram to cater to the low SNR scenario. Simulation results show that the proposed SMoLnet outperforms existing dilation-based models in terms of speech quality and objective speech intelligibility metrics for UAV noise reduction. In addition, the proposed SMoLnet requires fewer parameters and achieves lower latency than the compared models. | URI: | https://hdl.handle.net/10356/141755 | ISBN: | 978-1-7281-3249-5 | ISSN: | 2640-009X | DOI: | 10.1109/APSIPAASC47483.2019.9023324 | Schools: | School of Electrical and Electronic Engineering | Organisations: | ST Engineering-NTU Corporate Lab | Rights: | © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/APSIPAASC47483.2019.9023324 | Fulltext Permission: | open | Fulltext Availability: | With Fulltext |
Appears in Collections: | EEE Conference Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
APSIPA_2019_paper_443.pdf | 1.06 MB | Adobe PDF | ![]() View/Open |
SCOPUSTM
Citations
20
26
Updated on May 1, 2025
Page view(s)
390
Updated on May 2, 2025
Download(s) 20
318
Updated on May 2, 2025
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.