Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/138858
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Wu, Mengkai | en_US |
dc.date.accessioned | 2020-05-13T06:47:46Z | - |
dc.date.available | 2020-05-13T06:47:46Z | - |
dc.date.issued | 2020 | - |
dc.identifier.uri | https://hdl.handle.net/10356/138858 | - |
dc.description.abstract | With the rapid growth of the Internet, the amount of video and audio data is increasing sharply. With the development of big data and artificial intelligence, audio analysis and recognition technology become more important. As the audio classification requirement increases, to classify audio and generate a description, many methods have been introduced. This project uses machine learning to achieve the classification goal through building a model with Convolutional Neural Networks or other neural networks such as Recurrent Neural Networks to categorize and generate the description for the audio. This paper includes the research I have done for generating audio descriptions using different neural network models and approaches. It starts from audio data downloading, feature extraction, image generation, and classifier training to the final audio description design and implementation. In this project, after comparison on a few types of deep neural networks, we found that deep convolutional neural networks have the overall better accuracy. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Nanyang Technological University | en_US |
dc.relation | PSCSE18-0064 | en_US |
dc.subject | Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence | en_US |
dc.title | Deep learning techniques to derive descriptions from audio signals | en_US |
dc.type | Final Year Project (FYP) | en_US |
dc.contributor.supervisor | Jagath C Rajapakse | en_US |
dc.contributor.school | School of Computer Science and Engineering | en_US |
dc.description.degree | Bachelor of Engineering (Computer Science) | en_US |
dc.contributor.supervisoremail | ASJagath@ntu.edu.sg | en_US |
item.grantfulltext | restricted | - |
item.fulltext | With Fulltext | - |
Appears in Collections: | SCSE Student Reports (FYP/IA/PA/PI) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Final Year Project Report.pdf Restricted Access | 1.4 MB | Adobe PDF | View/Open |
Page view(s)
290
Updated on Mar 25, 2023
Download(s) 50
41
Updated on Mar 25, 2023
Google ScholarTM
Check
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.