Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/152683
Full metadata record
DC FieldValueLanguage
dc.contributor.authorNg, Linus JunJiaen_US
dc.date.accessioned2021-09-14T07:14:33Z-
dc.date.available2021-09-14T07:14:33Z-
dc.date.issued2021-
dc.identifier.citationNg, L. J. (2021). Audio intelligence & domain adaptation for deep learning models at the edge. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/152683en_US
dc.identifier.urihttps://hdl.handle.net/10356/152683-
dc.description.abstractIdentifying urban noises and sounds is a challenging but essential problem in the field of machine listening. It enables and provides a realistic use case for detecting noises in residential areas - from noise complaints to detecting sounds or unusual noises that may indicate possible emergencies. To mitigate noise issues in an estate is not an easy task using machine learning approach due to data scarcity and the lack of labeled data, where the acquisition of labeled data is often difficult, costly, and time-consuming. In this work, we leverage an end-to-end IoT system coupled with deep learning models to detect critical urban sound information at the edge. Wireless acoustic sensor nodes (WASN) are deployed in several residential areas to validate their feasibility in detecting noise events of interest, where real-time edge analytic is performed. We explore methods to address the domain shift caused by novel acoustic conditions that are introduced due to environmental influences in different deployed locations, evaluating the environmental sound classifiers in a WASN setup, and the extent it affects the performance of the sound classifiers in different locations with different microphones. We have collected and annotated audio data set in Singapore for training, validating, and testing purposes. Our experimental results show that the proposed method is able to address the mismatch introduced by the domain shift. The proposed method and future research in this work will enhance model robustness in adapting to new deployed environments and minimize the manpower time required to acquire and annotate audio data.en_US
dc.language.isoenen_US
dc.publisherNanyang Technological Universityen_US
dc.relationMOE2017-T2-2-060en_US
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).en_US
dc.subjectEngineering::Computer science and engineering::Computing methodologies::Artificial intelligenceen_US
dc.titleAudio intelligence & domain adaptation for deep learning models at the edgeen_US
dc.typeThesis-Master by Researchen_US
dc.contributor.supervisorGan Woon Sengen_US
dc.contributor.schoolSchool of Electrical and Electronic Engineeringen_US
dc.description.degreeMaster of Engineeringen_US
dc.contributor.researchCentre for Infocomm Technology (INFINITUS)en_US
dc.identifier.doi10.32657/10356/152683-
dc.contributor.supervisoremailEWSGAN@ntu.edu.sgen_US
item.fulltextWith Fulltext-
item.grantfulltextopen-
Appears in Collections:EEE Theses
Files in This Item:
File Description SizeFormat 
G1901919H-LinusNg-MEng-Final-Thesis-Submission.pdf5.56 MBAdobe PDFView/Open

Page view(s)

154
Updated on May 24, 2022

Download(s) 50

50
Updated on May 24, 2022

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.