Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/152683
Title: | Audio intelligence & domain adaptation for deep learning models at the edge | Authors: | Ng, Linus JunJia | Keywords: | Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence | Issue Date: | 2021 | Publisher: | Nanyang Technological University | Source: | Ng, L. J. (2021). Audio intelligence & domain adaptation for deep learning models at the edge. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/152683 | Project: | MOE2017-T2-2-060 | Abstract: | Identifying urban noises and sounds is a challenging but essential problem in the field of machine listening. It enables and provides a realistic use case for detecting noises in residential areas - from noise complaints to detecting sounds or unusual noises that may indicate possible emergencies. To mitigate noise issues in an estate is not an easy task using machine learning approach due to data scarcity and the lack of labeled data, where the acquisition of labeled data is often difficult, costly, and time-consuming. In this work, we leverage an end-to-end IoT system coupled with deep learning models to detect critical urban sound information at the edge. Wireless acoustic sensor nodes (WASN) are deployed in several residential areas to validate their feasibility in detecting noise events of interest, where real-time edge analytic is performed. We explore methods to address the domain shift caused by novel acoustic conditions that are introduced due to environmental influences in different deployed locations, evaluating the environmental sound classifiers in a WASN setup, and the extent it affects the performance of the sound classifiers in different locations with different microphones. We have collected and annotated audio data set in Singapore for training, validating, and testing purposes. Our experimental results show that the proposed method is able to address the mismatch introduced by the domain shift. The proposed method and future research in this work will enhance model robustness in adapting to new deployed environments and minimize the manpower time required to acquire and annotate audio data. | URI: | https://hdl.handle.net/10356/152683 | DOI: | 10.32657/10356/152683 | Schools: | School of Electrical and Electronic Engineering | Research Centres: | Centre for Infocomm Technology (INFINITUS) | Rights: | This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). | Fulltext Permission: | open | Fulltext Availability: | With Fulltext |
Appears in Collections: | EEE Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
G1901919H-LinusNg-MEng-Final-Thesis-Submission.pdf | 5.56 MB | Adobe PDF | ![]() View/Open |
Page view(s)
428
Updated on May 5, 2025
Download(s) 50
212
Updated on May 5, 2025
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.