IMI : a multilingual semantic annotation environment
Da Costa, Luis Morgado
Le, Tuan Anh
Date of Issue2015
Proceedings of ACL-IJCNLP 2015 System Demonstrations
School of Humanities and Social Sciences
Semantic annotated parallel corpora, though rare, play an increasingly important role in natural language processing. These corpora provide valuable data for computational tasks like sense-based machine translation and word sense disambiguation, but also to contrastive linguistics and translation studies. In this paper we present the ongoing development of a web-based corpus semantic annotation environment that uses the Open Multilingual Wordnet (Bond and Foster, 2013) as a sense inventory. The system includes interfaces to help coordinating the annotation project and a corpus browsing interface designed specifically to meet the needs of a semantically annotated corpus. The tool was designed to build the NTU-Multilingual Corpus (Tan and Bond, 2012). For the past six years, our tools have been tested and developed in parallel with the semantic annotation of a portion of this corpus in Chinese, English, Japanese and Indonesian. The annotation system is released under an open source license (MIT).
© 2015 ACL and AFNLP. This article is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. Permission is granted to make copies for the purposes of teaching and research.