Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/149513
Full metadata record
DC FieldValueLanguage
dc.contributor.authorLoh, Zhen Annen_US
dc.date.accessioned2021-06-02T08:28:48Z-
dc.date.available2021-06-02T08:28:48Z-
dc.date.issued2021-
dc.identifier.citationLoh, Z. A. (2021). Classification of distressed sounds using CNN/C-RNN. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149513en_US
dc.identifier.urihttps://hdl.handle.net/10356/149513-
dc.description.abstractSafety is always the utmost priority in this world where dangers are all around. There may be incidents of snakes, falling trees and even car crashing that may endanger one life. With improvements in quality of life in Singapore, the response from emergency personnel will arrive swiftly when contacted by the victim or other people. However, imagine if it were to occur in a deserted area or in a factory where no one else is present and the victim could not obtain help from any means of communication, this project will provide the solution. By having a trained distressed sounds classifier, distressed sounds can be detected so that the investigation team or emergency personnel can seek the victim. Integration of distressed sound detection in a sound-based surveillance system can thus be implemented at several places like factories and deserted areas to extend assistance to people who are distressed, in pain or danger [1]. Hence, this project discusses the development and usage of machine learning techniques, Convolutional Neural Network (CNN) and Convolutional-Recurrent Neural Network (CRNN) model to classify distressed sounds in Singapore’s soundscape. These distressed sounds are categorized into 4 classes: non-distressed sounds, ‘Crying’, ‘Help’, and ‘Screaming’. Furthermore, the models to be implemented are inspired by VGG [2] which is widely used in image and audio classification. In general, this report shows the process of transforming audio classification into an image classification problem where CNN and CRNN can be utilized efficiently. In the end, the performance of these networks was evaluated based on several metrics but unfortunately, they have not shown a feasible result that can be implemented in real-time. CNN and CRNN models have only scored F_β score of 0.3377 and 0.3225 respectively when beta is 2. Keyword: Audio classification, Distressed Sounds, Deep Neural Network en_US
dc.language.isoenen_US
dc.publisherNanyang Technological Universityen_US
dc.relationA3080-201en_US
dc.subjectEngineering::Electrical and electronic engineering::Electronic systems::Signal processingen_US
dc.subjectEngineering::Computer science and engineering::Computing methodologies::Artificial intelligenceen_US
dc.titleClassification of distressed sounds using CNN/C-RNNen_US
dc.typeFinal Year Project (FYP)en_US
dc.contributor.supervisorEr Meng Hwaen_US
dc.contributor.supervisorGan Woon Sengen_US
dc.contributor.schoolSchool of Electrical and Electronic Engineeringen_US
dc.description.degreeBachelor of Engineering (Electrical and Electronic Engineering)en_US
dc.contributor.supervisoremailEWSGAN@ntu.edu.sg, EMHER@ntu.edu.sgen_US
item.grantfulltextrestricted-
item.fulltextWith Fulltext-
Appears in Collections:EEE Student Reports (FYP/IA/PA/PI)
Files in This Item:
File Description SizeFormat 
U1821047G_Final_Year_Project_Final_Report.pdf
  Restricted Access
Classification of Distressed Sounds using CNN/C-RNN Final Year Project2.35 MBAdobe PDFView/Open

Page view(s)

206
Updated on Sep 11, 2024

Download(s)

11
Updated on Sep 11, 2024

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.