Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/97576
Title: | Category hierarchy maintenance : a data-driven approach | Authors: | Yuan, Quan Cong, Gao Sun, Aixin Lin, Chin-Yew Magnenat-Thalmann, Nadia |
Keywords: | DRNTU::Engineering::Computer science and engineering | Issue Date: | 2012 | Source: | Yuan, Q., Cong, G., Sun, A., Lin, C.-Y., & Magnenat-Thalmann, N. (2012). Category hierarchy maintenance: a data-driven approach. Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval - SIGIR '12. | Conference: | International conference on Research and development in information retrieval (35th : 2012) | Abstract: | Category hierarchies often evolve at a much slower pace than the documents reside in. With newly available documents kept adding into a hierarchy, new topics emerge and documents within the same category become less topically cohesive. In this paper, we propose a novel automatic approach to modifying a given category hierarchy by redistributing its documents into more topically cohesive categories. The modification is achieved with three operations (namely, sprout, merge, and assign) with reference to an auxiliary hierarchy for additional semantic information; the auxiliary hierarchy covers a similar set of topics as the hierarchy to be modified. Our user study shows that the modified category hierarchy is semantically meaningful. As an extrinsic evaluation, we conduct experiments on document classification using real data from Yahoo! Answers and AnswerBag hierarchies, and compare the classification accuracies obtained on the original and the modified hierarchies. Our experiments show that the proposed method achieves much larger classification accuracy improvement compared with several baseline methods for hierarchy modification. | URI: | https://hdl.handle.net/10356/97576 http://hdl.handle.net/10220/12082 |
DOI: | 10.1145/2348283.2348389 | Schools: | School of Computer Engineering | Rights: | © 2012 ACM. | Fulltext Permission: | none | Fulltext Availability: | No Fulltext |
Appears in Collections: | SCSE Conference Papers |
SCOPUSTM
Citations
20
12
Updated on Mar 13, 2025
Page view(s) 20
764
Updated on Mar 16, 2025
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.