Please use this identifier to cite or link to this item:
|Title:||Examining crosslingual word sense disambiguation.||Authors:||Liling, Tan.||Keywords:||DRNTU::Humanities::Linguistics::Semantics
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing
DRNTU::Engineering::Computer science and engineering::Computer applications::Arts and humanities
DRNTU::Engineering::Computer science and engineering::Mathematics of computing::Probability and statistics
|Issue Date:||2013||Abstract:||Understanding human language computationally remains a challenge at different levels, phonologically, syntactically and semantically. This thesis attempts to understand human language's ambiguity through the Word Sense Disambiguation (WSD) task. Word Sense Disambiguation (WSD) is the task of determining the correct sense of a word given a context sentence and topic models are statistical models of human language that can discover abstract topics given a collection of documents. This thesis examines the WSD task in a crosslingual manner with the usage of topic models and parallel corpus. The thesis defines a topical crosslingual WSD (Topical CLWSD) task as two subtasks (i) Match and Translate: finding a match of the query sentence in a parallel corpus using topic models that provides the appropriate translation of the target polysemous word (ii) Map: mapping the word-translation pair to disambiguate the concept respectively of the Open Multilingual WordNet. The XLING WSD system has been built to attempt the topical WSD task. Although the XLING system underperforms in the topical WSD task, it serves as a pilot approach to crosslingual WSD in a knowledge-lean manner. Other than the WSD task, the thesis briefly presents updates on the ongoing work to compile multilingual data for the Nanyang Technological University-Multilingual Corpus (NTU-MC). Both the NTU-MC project and the XLING system are related in their attempts to build crosslingual language technologies.||URI:||http://hdl.handle.net/10356/54652||Fulltext Permission:||restricted||Fulltext Availability:||With Fulltext|
|Appears in Collections:||HSS Theses|
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.