Please use this identifier to cite or link to this item:
Full metadata record
DC FieldValueLanguage
dc.contributor.authorLee, Yan Zhenen_US
dc.identifier.citationLee, Y. Z. (2021). Sound event detection with human and emergency sounds. Final Year Project (FYP), Nanyang Technological University, Singapore.
dc.description.abstractSound Event Detection (SED) is the task of recognizing the sound events and their respective onset and offset timestamps in an audio clip. This thesis explores a variety of models and techniques in order to develop an effective SED system. This includes investigating the impact of different audio feature types, data augmentation techniques, network architectures and automatic threshold optimisation on the performance of the system. Additionally, this thesis proposes frame- wise prediction pre-processing and post-processing methods, in order to address the issues with existing SED system and develop a system that is able analyse clips with long audio durations. Unlike previous works, which use standard datasets, such as those from the Detection and Classification of Acoustic Scenes and Events (DCASE) challenges, as the development dataset, a novel dataset consisting of human and emergency sounds extracted from AudioSet is used in this project. As the dataset is novel, there is no state-of-the-art baseline available for comparison. As such, the dataset of the DCASE 2017 Task 4 is used to compare the performance of our best-performing models, which is determined based on the project dataset, with the state-of-the- art performance. From our experiments, we managed to successfully develop a well-performing SED system for our novel dataset, with the system using our proposed prediction processing method consistently outperforming the ones that do not. Additionally, by using the knowledge we learnt from our experiments with our novel project dataset, we devloped a system which outperforms the previous state- of-the-art model for the DCASE 2017 Task 4 Challenge.en_US
dc.publisherNanyang Technological Universityen_US
dc.subjectEngineering::Computer science and engineering::Computing methodologies::Artificial intelligenceen_US
dc.titleSound event detection with human and emergency soundsen_US
dc.typeFinal Year Project (FYP)en_US
dc.contributor.supervisorChng Eng Siongen_US
dc.contributor.schoolSchool of Computer Science and Engineeringen_US
dc.description.degreeBachelor of Science in Data Science and Artificial Intelligenceen_US
dc.contributor.organizationDSO National Laboratoriesen_US
item.fulltextWith Fulltext-
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)
Files in This Item:
File Description SizeFormat 
  Restricted Access
2.76 MBAdobe PDFView/Open

Page view(s)

Updated on Jul 1, 2022


Updated on Jul 1, 2022

Google ScholarTM


Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.