Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/149221
Title: | Applying machine learning to human speech for image interpretations | Authors: | Woon, Yee Gin | Keywords: | Engineering::Electrical and electronic engineering | Issue Date: | 2021 | Publisher: | Nanyang Technological University | Source: | Woon, Y. G. (2021). Applying machine learning to human speech for image interpretations. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149221 | Abstract: | Speech Recognition has become prevalent over the years due to its ability to do information search, communicate and transcribe faster than typing on a keyboard. It is predicted that about half of the searches would employ Speech Recognition by 2020. With the growing trend of big data, data analytic and data science in the field of Machine Learning, the accuracy and precision for the audio recognition has vastly improved. There are technologies available to support related applications such as the voice assistance in Google Assistance, Amazon Alexa, Apple Siri, and Microsoft Cortana. The open sources for the Speech to Text recognition API enabled development to wider area such as education, customer support and even to daily texting routine. SpeechArt is a product from adapting existing Speech Recognition technology, DeepSpeech to creating artistic images for the transcribed texts. The acoustic and language model of an open source of DeepSpeech would be utilised. The transcribed text generated from DeepSpeech would be parsed by an NLP Model. The key words would be selected based on user’s audio input and then send image search model to return internet images with artistic effects in real time. The purpose of the project is to build an application that converts speech to image that is useful for visual learning in education and can be extended to artistic aspects for example, portraying a new design work. | URI: | https://hdl.handle.net/10356/149221 | Schools: | School of Electrical and Electronic Engineering | Fulltext Permission: | restricted | Fulltext Availability: | With Fulltext |
Appears in Collections: | EEE Student Reports (FYP/IA/PA/PI) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
FYP_Final Report_Yee Gin.pdf Restricted Access | 6.12 MB | Adobe PDF | View/Open |
Page view(s)
294
Updated on May 7, 2025
Download(s)
20
Updated on May 7, 2025
Google ScholarTM
Check
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.