Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/136992
Title: | Towards audio-assist cognitive computing : algorithms and applications | Authors: | Liu, Ziyuan | Keywords: | Engineering::Computer science and engineering::Computer systems organization::Computer system implementation | Issue Date: | 2019 | Publisher: | Nanyang Technological University | Source: | Liu, Z. (2019). Towards audio-assist cognitive computing : algorithms and applications. Master's thesis, Nanyang Technological University, Singapore. | Abstract: | Meaningful information hidden in the acoustic signals can be utilized by cognitive computing algorithms. The algorithms use them to improve the quality of services and applications. Inspired by this idea, we develop and optimize a series of applications based on cognitive computing algorithms. Two cognitive computing algorithms are developed: Audio Tag and Audio Fingerprint algorithms. The implementation and experiment results of the algorithms suggest that the information hidden in acoustic signals, either manually implanted or innate, can be utilized by proper techniques. The experiment results demonstrate that the audio tag and audio fingerprint algorithm have high accuracy and low time cost. The audio tag algorithm achieves 100\% accuracy (recognition under 5 seconds), with loud noises existing in specific experiment environments. The audio fingerprint algorithm achieves over 95\% accuracy(recognition under 5 seconds), with proper parameter settings. Based on the two core algorithms, two android applications are developed: Hey!Shake and Parking Loud application. They utilize these algorithms in the TV watching and parking lot access control scenarios and provide services with better quality, less hardware cost, and more convenience for users. The results of this research project confirm the possibility that we can improve the quality of multimedia services by digging into the often-overlooked acoustic information. | URI: | https://hdl.handle.net/10356/136992 | DOI: | 10.32657/10356/136992 | Rights: | This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). | Fulltext Permission: | open | Fulltext Availability: | With Fulltext |
Appears in Collections: | SCSE Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Towards_Audio_Assist_Cognitive_Computing_Algorithms_and_Applications_AmendedV6.pdf | 4.39 MB | Adobe PDF | View/Open |
Page view(s)
214
Updated on May 20, 2022
Download(s) 50
125
Updated on May 20, 2022
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.