Please use this identifier to cite or link to this item:
Title: The Company They Keep: Extracting Japanese Neologisms Using Language Patterns
Authors: Breen, James
Baldwin, Timothy
Bond, Francis
Keywords: Japanese Text
Issue Date: 2018
Source: Breen, J., Baldwin, T., & Bond, F. (2018). The Company They Keep: Extracting Japanese Neologisms Using Language Patterns. The 9th Global WordNet Conference (GWC 2018).
Abstract: We describe an investigation into the identification and extraction of unrecorded potential lexical items in Japanese text by detecting text passages containing selected language patterns typically associated with such items. We identified a set of suitable patterns, then tested them with two large collections of text drawn from the WWW and Twitter. Samples of the extracted items were evaluated, and it was demonstrated that the approach has considerable potential for identifying terms for later lexicographic analysis.
Rights: © 2018 The author(s). This is the author created version of a work that has been peer reviewed and accepted for publication by The 9th Global WordNet Conference (GWC 2018). It incorporates referee’s comments but changes resulting from the publishing process, such as copyediting, structural formatting, may not be reflected in this document. The full-text is available at: [].
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:HSS Conference Papers

Files in This Item:
File Description SizeFormat 
GWC2018_paper_20.pdf121.69 kBAdobe PDFThumbnail

Page view(s) 50

Updated on Dec 2, 2020

Download(s) 50

Updated on Dec 2, 2020

Google ScholarTM


Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.