Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/176981
Title: Urban noise tagging and perceptually informed unsupervised clustering
Authors: Quek, Gordon
Keywords: Engineering
Issue Date: 2024
Publisher: Nanyang Technological University
Source: Quek, G. (2024). Urban noise tagging and perceptually informed unsupervised clustering. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/176981
Project: A3063-231 
Abstract: The rapid urbanisation process has led to an increase in noise pollution in urban environments, impacting the well-being and quality of life of city residents. To address this issue, researchers have focused on understanding and categorising urban noise to mitigate its effects effectively. Convolutional Neural Networks (CNNs) have emerged as a popular choice for audio classification tasks due to their ability to learn structural patterns from audio data. Despite efforts in the field, there is a lack of research on urban audio data from specific locations, such as Singapore, which has unique acoustic features influenced by local factors. This project aims to enhance the performance of CNNs for audio classification using the SINGA:PURA dataset, which consists of audio recordings from the Singapore environment. Using an initial code for machine learning audio classification adapted from Kenneth's DCASE Task1B from 2020 and modified to adapt to the SINGA:PURA dataset. Experiments are conducted to modify model parameters, data preprocessing techniques, and CNN architectures. Results show that a learning rate of 0.001, a batch size of 60, and employing 2 to 4 convolutional layers with 160 mel filters yield optimal performance for classifying Singaporean acoustic environments. However, despite achieving a training accuracy of 80% and a testing accuracy of 46%, overfitting is observed, indicating the need for further research. Recommendations for future work include addressing dataset imbalance through additional data augmentation techniques, refining data preprocessing methods, exploring a broader range of hyperparameters, and leveraging pre-trained CNN models such as VGG16, ResNet, or InceptionV3. These approaches could provide valuable insights into optimising model performance and improving classification accuracy for urban noise tagging and mitigation efforts.
URI: https://hdl.handle.net/10356/176981
DOI (Related Dataset): 10.21979/N9/Y8UQ6F
Schools: School of Electrical and Electronic Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
FYP_Gordon_2024.pdf
  Restricted Access
1.22 MBAdobe PDFView/Open

Page view(s)

92
Updated on May 7, 2025

Download(s)

4
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.