Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/158057
Title: Machine learning / deep learning approach to soundscape analysis
Authors: Koh, Cheng Yong
Keywords: Engineering::Electrical and electronic engineering
Issue Date: 2022
Publisher: Nanyang Technological University
Source: Koh, C. Y. (2022). Machine learning / deep learning approach to soundscape analysis. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/158057
Abstract: Visual understanding of the soundscape environment is an enabling factor for a wide range of applications in studying how humans perceive sounds. Audiovisual scene decomposition allows further understanding of soundscape. This project will be focusing on the decomposition of urban soundscapes such as parks, plazas, streets, etc. As water sounds are a prominent sound source in urban landscapes, this project will add a new waterbody class to the segmentation model which do not currently exist in most multiclass urban semantic segmentation model. This project proposes the use of the DeepLabV3+ model, with a ResNet50 backbone, trained on an improved Cityscapes dataset to perform semantic segmentation for urban scene decomposition. The training dataset will include additional waterbody images on top of the original Cityscapes images.
URI: https://hdl.handle.net/10356/158057
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
KohChengYong_U1822992G_FYPReportFinalSub.pdf
  Restricted Access
2.45 MBAdobe PDFView/Open

Page view(s)

26
Updated on Dec 1, 2022

Download(s)

4
Updated on Dec 1, 2022

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.