Please use this identifier to cite or link to this item:
Title: Southeast Asian multi-language speech recognition engine
Authors: Zhang, Keke
Keywords: Engineering::Electrical and electronic engineering
Issue Date: 2022
Publisher: Nanyang Technological University
Source: Zhang, K. (2022). Southeast Asian multi-language speech recognition engine. Final Year Project (FYP), Nanyang Technological University, Singapore.
Project: B1080-211
Abstract: In the digital era, all kinds of technology advancements have reshaped human life to become easier, faster, and smarter than ever before. Over the past decade, voice services have been adopted across a variety of industries as speech technology being propelled forward. The market prospects are in turn boosted as multiple applications such as Automatic Speech Recognition (ASR), Text-To-Speech (TTS) and AI Assistants are gaining increasing awareness. Amid the Covid-19 crisis, the global speech technology market remained resilient since speech technology is one of the major enablers of contactless interaction. Moreover, driven by the advancements in artificial intelligence, speech technology has become more accessible to a wider range of users at a lower cost in recent years. As a result, more challenges will arise inevitably and accented speech with language mixing is one of them. This project aims to develop an Automatic Speech Recognition (ASR) engine that can be utilised in Singapore, with capabilities to process language mixing input (English mixed with Mandarin) and to produce useful output with low error rate. The focus of this project is on automated text corpus collection, language model training, ASR integration and testing. The performance of the ASR will be evaluated by Mixed Error Rate (MER).
Schools: School of Electrical and Electronic Engineering 
Organisations: A*STAR Institute of Material Research and Engineering
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
FYP Report.pdf
  Restricted Access
1.73 MBAdobe PDFView/Open

Page view(s)

Updated on Apr 16, 2024


Updated on Apr 16, 2024

Google ScholarTM


Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.