Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/97856
Title: | Short text classification using very few words | Authors: | Sun, Aixin | Keywords: | DRNTU::Engineering::Computer science and engineering | Issue Date: | 2012 | Source: | Sun, A. (2012). Short text classification using very few words. Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval - SIGIR '12. | Conference: | International conference on Research and development in information retrieval (35th : 2012) | Abstract: | We propose a simple, scalable, and non-parametric approach for short text classification. Leveraging the well studied and scalable Information Retrieval (IR) framework, our approach mimics human labeling process for a piece of short text. It first selects the most representative and topical-indicative words from a given short text as query words, and then searches for a small set of labeled short texts best matching the query words. The predicted category label is the majority vote of the search results. Evaluated on a collection of more than 12K Web snippets, the proposed approach achieves comparable classification accuracy with the baseline Maximum Entropy classifier using as few as 3 query words and top-5 best matching search hits. Among the four query word selection schemes proposed and evaluated in our experiments, term frequency together with clarity gives the best classification accuracy. | URI: | https://hdl.handle.net/10356/97856 http://hdl.handle.net/10220/12091 |
DOI: | 10.1145/2348283.2348511 | Schools: | School of Computer Engineering | Rights: | © 2012 ACM. | Fulltext Permission: | none | Fulltext Availability: | No Fulltext |
Appears in Collections: | SCSE Conference Papers |
SCOPUSTM
Citations
5
101
Updated on Apr 23, 2025
Page view(s) 10
877
Updated on May 5, 2025
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.