Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/174253
Title: Synthesising the Singaporean voice: enhancing Singapore English in neural speech synthesis
Authors: Teo, Clarence Kai Xuan
Keywords: Arts and Humanities
Computer and Information Science
Issue Date: 2024
Publisher: Nanyang Technological University
Source: Teo, C. K. X. (2024). Synthesising the Singaporean voice: enhancing Singapore English in neural speech synthesis. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/174253
Abstract: In contemporary Text-to-Speech (TTS) techniques, achieving naturalness in synthetic speech presents a significant challenge. Additionally, to address the specific challenge of low-resource Singapore English (with minimal resources available in existing speech corpora), this Final Year Project focused on applying and advancing these TTS techniques. The objective was to enhance the naturalness of synthetic Singapore English speech, a goal that was under-represented in contemporary TTS research. This paper delves into the latest innovations in TTS systems and introduces a novel spoken corpus in Singapore English. Following the creation of this new corpus is the training and evaluation of a TTS model based on FastSpeech2 and HiFiNet2 architectures. The evaluation focused on three key metrics: PESQ, SDR and MOS-X2. Findings indicate notable improvements in the naturalness of synthesised Singapore English speech, effectively capturing its unique nuances and characteristics. These advancements mark a significant step in TTS technology, particularly in enhancing the naturalness and authenticity of synthetic speech for under-represented languages like Singapore English, amidst a sea of native English voices.
URI: https://hdl.handle.net/10356/174253
Schools: School of Humanities 
Organisations: Home Team Science and Technology Agency 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SoH Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
CTKX_FYP_Thesis__NTU_Singapore_LaTeX_FINAL.pdf
  Restricted Access
3.33 MBAdobe PDFView/Open

Page view(s)

249
Updated on May 7, 2025

Download(s) 50

32
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.