Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/92271
Title: MRD-based word sense disambiguation : further extending lesk
Authors: Baldwin, Timothy
Su, Nam Kim
Bond, Francis
Fujita, Sanae
Martinez, David
Tanaka, Takaaki
Keywords: DRNTU::Humanities::Language::Japanese
DRNTU::Humanities::Linguistics::Sociolinguistics::Computational linguistics
Issue Date: 2008
Source: Baldwin, T., Su, N. K., Bond, F., Fujita, S., Martinez, D., & Tanaka, T. (2008). MRD-based word sense disambiguation : further extending lesk. Proceedings of the Third International Joint Conference on Natural Language Processing (IJCNLP 2008) pp.775-780.
Abstract: This paper reconsiders the task of MRD-based word sense disambiguation, in extending the basic Lesk algorithm to investigate the impact on WSD performance of different tokenisation schemes, scoring mechanisms, methods of gloss extension and filtering methods. In experimentation over the Lexeed Sensebank and the Japanese Senseval-2 dictionary task, we demonstrate that character bigrams with sense-sensitive gloss extension over hyponyms and hypernyms enhances WSD performance.
URI: https://hdl.handle.net/10356/92271
http://hdl.handle.net/10220/6447
Rights: © 2008 ACL This is the author created version of a work that has been peer reviewed and accepted for publication by Proceedings of the Third International Joint Conference on Natural Language Processing (IJCNLP 2008), Association for Computational Linguistics. It incorporates referee’s comments but changes resulting from the publishing process, such as copyediting, structural formatting, may not be reflected in this document. The published version is available at: [URL: http://www.aclweb.org/anthology-new/I/I08/I08-2108.pdf].
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:HSS Conference Papers

Files in This Item:
File Description SizeFormat 
2008-ijcnlp-lesk.pdf343.82 kBAdobe PDFThumbnail
View/Open

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.