|
Title:
|
MRD-based word sense disambiguation : further extending lesk.
|
|
Author:
|
Baldwin, Timothy.; Su, Nam Kim.; Bond, Francis.; Fujita, Sanae.; Martinez, David.; Tanaka, Takaaki.
|
|
Copyright year:
|
2008 |
|
Abstract:
|
This paper reconsiders the task of MRD-based word sense disambiguation, in extending the basic Lesk algorithm to investigate the impact on WSD performance of different tokenisation schemes, scoring mechanisms, methods of gloss extension and filtering methods. In experimentation over the Lexeed
Sensebank and the Japanese Senseval-2 dictionary task, we demonstrate that character bigrams with sense-sensitive gloss extension over hyponyms and hypernyms enhances WSD performance. |
|
Subject:
|
DRNTU::Humanities::Language::Japanese. DRNTU::Humanities::Linguistics::Sociolinguistics::Computational linguistics.
|
|
Type:
|
Conference Paper |
|
Conference name:
|
Proceedings of the Third International Joint Conference on Natural Language Processing (IJCNLP 2008) |
|
School:
|
College of Humanities, Arts, and Social Sciences |
|
Rights:
|
© 2008 ACL
This is the author created version of a work that has been peer reviewed and accepted for publication by Proceedings of the Third International Joint Conference on Natural Language Processing (IJCNLP 2008), Association for Computational Linguistics. It incorporates referee’s comments but changes resulting from the publishing process, such as copyediting, structural formatting, may not be reflected in this document. The published version is available at: [URL: http://www.aclweb.org/anthology-new/I/I08/I08-2108.pdf]. |
|
Version:
|
Accepted version |