Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/41796
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | M. Mugunth Kumar | |
dc.date.accessioned | 2010-08-12T08:11:32Z | |
dc.date.available | 2010-08-12T08:11:32Z | |
dc.date.copyright | 2008 | en_US |
dc.date.issued | 2008 | |
dc.identifier.uri | http://hdl.handle.net/10356/41796 | |
dc.description.abstract | In today's world, for speech enabling applications, a main problem faced by researchers is having to rework from the start or "re-invent the wheel" owing to various reasons like existing speech recognition engines being too inefficient or inaccurate, or that existing speech-enabled applications are simply too specific for consideration. To alleviate this problem, one approach that can be used is a platform-based approach, engaging community of programmers to write applications for this platform. This dissertation, hence, aims at developing a community-supported, open and generic platform allowing other speech-enabled windows based applications to be developed using this platform. The dissertation aims at bridging two disparate areas of computer science, namely speech recognition and automation. Speech recognition remains a research area for at least 50 years and automation, which was used primarily to automatically run scheduled maintenance, updating corporate systems on the same network, but little research was done on using it for improving the usability of a system. In the course of the dissertation, different possible ways of recognizing speech and of automating Windows applications are explored. A prototype is being developed in this research, in which a broad aspect of the platform's capabilities was demonstrated rather than a single aspect in detail. As a part of evaluation, a focus group study was conducted to brainstorm on futuristic scenarios that could benefit using this platform. Finally, the dissertation concludes with advantages and disadvantages of the developed technology. | en_US |
dc.format.extent | 91 p. | en_US |
dc.language.iso | en | en_US |
dc.rights | Nanyang Technological University | |
dc.subject | DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing | en_US |
dc.title | Generic community-powered meta speech recognition platform | en_US |
dc.type | Thesis | |
dc.contributor.supervisor | Theng Yin Leng | en_US |
dc.contributor.school | Wee Kim Wee School of Communication and Information | en_US |
dc.description.degree | Master of Science (Information Systems) | en_US |
item.fulltext | With Fulltext | - |
item.grantfulltext | restricted | - |
Appears in Collections: | WKWSCI Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
MMugunthKumar08.pdf Restricted Access | 4.25 MB | Adobe PDF | View/Open |
Page view(s) 10
908
Updated on Mar 27, 2024
Download(s)
2
Updated on Mar 27, 2024
Google ScholarTM
Check
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.