dc.contributor.authorSun, Aixin
dc.date.accessioned2013-07-24T03:14:56Z
dc.date.available2013-07-24T03:14:56Z
dc.date.copyright2012en_US
dc.date.issued2012
dc.identifier.citationSun, A. (2012). Short text classification using very few words. Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval - SIGIR '12.en_US
dc.identifier.urihttp://hdl.handle.net/10220/12091
dc.description.abstractWe propose a simple, scalable, and non-parametric approach for short text classification. Leveraging the well studied and scalable Information Retrieval (IR) framework, our approach mimics human labeling process for a piece of short text. It first selects the most representative and topical-indicative words from a given short text as query words, and then searches for a small set of labeled short texts best matching the query words. The predicted category label is the majority vote of the search results. Evaluated on a collection of more than 12K Web snippets, the proposed approach achieves comparable classification accuracy with the baseline Maximum Entropy classifier using as few as 3 query words and top-5 best matching search hits. Among the four query word selection schemes proposed and evaluated in our experiments, term frequency together with clarity gives the best classification accuracy.en_US
dc.language.isoenen_US
dc.rights© 2012 ACM.en_US
dc.subjectDRNTU::Engineering::Computer science and engineering
dc.titleShort text classification using very few wordsen_US
dc.typeConference Paper
dc.contributor.conferenceInternational conference on Research and development in information retrieval (35th : 2012)en_US
dc.contributor.schoolSchool of Computer Engineeringen_US
dc.identifier.doihttp://dx.doi.org/10.1145/2348283.2348511


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record