Please use this identifier to cite or link to this item:
Title: Sound-event classification for robot hearing
Authors: Yong, Shu Ching
Keywords: Engineering::Electrical and electronic engineering
Issue Date: 2022
Publisher: Nanyang Technological University
Source: Yong, S. C. (2022). Sound-event classification for robot hearing. Final Year Project (FYP), Nanyang Technological University, Singapore.
Project: A3090-211
Abstract: Throughout the years, there have been several methods of executing the process of sound-event classification. The use of spectrograms and a time-frequency spectral analysis that illustrates the magnitude spectrum of the signal within a 2D time-frequency plane are some examples of the well known methods. Even though intensive research was done, there are still greater developments that can be achieved. For instance, for sound-based recognition, there still exists a research gap to enhance its accuracy and reliability. By using a spectrogram, audio signals can be visualised and evaluated into a time-frequency spectral analysis of a magnitude spectrum on a 2D plane. However, the magnitude spectrum is not enough to classify the audio sources. To address this issue, a method, first proposed by Jiang Xudong and Ren Jianfeng, called “Regularised 2D complex-log-Fourier transform” is introduced. The addition to this process is a phase spectrum which will also be used to do sound-event classification. On top of this, the Principal Component Analysis (PCA) is used to extract out significant information and remove unnecessary data in the audio samples. Last but not least, the calculated values using the Mahalanobis Distance will be used to identify the belonging classes of the sound events.
Schools: School of Electrical and Electronic Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
Sound-Event Classification for Robot Hearing Final Report.pdf
  Restricted Access
4.64 MBAdobe PDFView/Open

Page view(s)

Updated on Feb 24, 2024


Updated on Feb 24, 2024

Google ScholarTM


Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.