Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/156592
Title: Speech emotion recognition using WaveNet
Authors: Nurul Sabrina Mohammed Riduwan
Keywords: Engineering::Computer science and engineering
Issue Date: 2022
Publisher: Nanyang Technological University
Source: Nurul Sabrina Mohammed Riduwan (2022). Speech emotion recognition using WaveNet. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156592
Project: SCSE21-0421
Abstract: Speech emotion recognition is known to be a challenging and complex task for machine learning models. Two challenges that are faced when doing speech emotion recognition are 1) human emotions are hard to distinguished and 2) detection of emotion could only be captured at specific moments in an utterance. Hereby, this paper proposes a Speech Emotion Recognition (SER) architecture inspired by WaveNet architecture. This architecture does not rely neither on tedious pre-processing nor the recurrent layers. The novelty of our approach uses both speech waveforms and audio features as inputs, usage on casual dilated convolutions for capturing temporal dependencies and the use of self-attention mechanism. Self-attention permit inputs to interact with each other to pay close attention on the valuable parts of the input to learn the connection between them. We illustrate improved performances SER with our model on EMO-DB datasets over the existing base-line models. Index Term: speech emotion recognition, self-attention, deep learning, computational paralinguistics
URI: https://hdl.handle.net/10356/156592
Schools: School of Computer Science and Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
FYP_Report_Nurul_Sabrina.pdf
  Restricted Access
630.5 kBAdobe PDFView/Open

Page view(s)

99
Updated on Sep 30, 2023

Download(s)

17
Updated on Sep 30, 2023

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.