Now showing items 1-10 of 13
Integration of a lexical type database with a linguistically interpreted corpus
We have constructed a large scale and detailed database of lexical types in Japanese from a treebank that includes detailed linguistic information. The database helps treebank annotators and grammar developers to share ...
Exploiting semantic information for HPSG parse selection
In this paper we present a framework for experimentation on parse selection using syntactic and semantic features. Results are given for syntactic features, dependency relations and the use of semantic classes.
Word sense disambiguation incorporating lexical and structural semantic information
We present results that show that incorporating lexical and structural semantic information is effective for word sense disambiguation. We evaluated the method by using precise information from a large treebank and an ...
Extracting representative arguments from dictionaries for resolving zero pronouns
We propose a method to alleviate the problem of referential granularity for Japanese zero pronoun resolution. We use dictionary definition sentences to extract ‘representative’ arguments of predicative definition words; ...
High precision treebanking - blazing useful trees using POS information
In this paper we present a quantitative and qualitative analysis of annotation in the Hinoki treebank of Japanese, and investigate a ...
An implemented description of Japanese : the Lexeed dictionary and the Hinoki treebank
In this paper we describe the current state of a new Japanese lexical resource: the Hinoki treebank. The treebank is built from dictionary definition sentences, and uses an HPSG based Japanese grammar to encode both syntactic ...
The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese
Semantic information is important for precise word sense disambiguation system and the kind of semantic analysis used in sophisticated natural language processing such as machine translation, question answering, etc. There ...
A reexamination of MRD-based word sense disambiguation
This paper reconsiders the task of MRD-based word sense disambiguation, in extending the basic Lesk algorithm to investigate the impact on WSD performance of different tokenisation schemes and methods of definition extension. ...
Building a cross-lingual referential knowledge database using dictionaries
Referential knowledge is vital for resolving various problems in NLP, such as anaphora resolution. For example, we have the referential knowledge that diagnose is most likely a member of the referential relation ‘doctor ...
Acquiring an ontology for a fundamental vocabulary
In this paper we describe the extraction of thesaurus information from parsed dictionary definition sentences. The main data for our experiments comes from Lexeed, a Japanese semantic dictionary, and the Hinoki treebank ...