Resource discovery through social tagging : a classification and content analytic approach
Author
Goh, Dion Hoe-Lian
Chua, Alton Yeow Kuan
Lee, Chei Sian
Razikin, Khasfariyati
Date of Issue
2008School
Wee Kim Wee School of Communication and Information
Version
Accepted version
Abstract
Purpose – Social tagging systems allow users to assign keywords (tags) to useful resources, facilitating their future access by the tag creator and possibly by other users. Social tagging has both proponents and critics, and this paper aims to investigate if tags are an effective means of resource discovery.
Design/methodology/approach – The paper adopts techniques from text categorisation in which webpages and their associated tags from del.icio.us and trained Support Vector Machine (SVM) classifiers are downloaded to determine if the documents could be assigned to their associated tags. Two text categorisation experiments were conducted. The first used only the terms from the documents as features while the second experiment included tags in addition to terms as part of its feature set. Performance metrics used were precision, recall, accuracy and F1 score. A content analysis was also conducted to uncover characteristics of effective and ineffective tags for resource discovery.
Findings – Results from the classifiers were mixed, and the inclusion of tags as part of the feature set did not result in a statistically significant improvement (or degradation) of the performance of the SVM classifiers. This suggests that not all tags can be used for resource discovery by public users, confirming earlier work that there are many dynamic reasons for tagging documents that may not be apparent to others.
Originality/value – The authors extend their understanding of social classification and its utility in sharing and accessing resources. Results of this work may be used to guide development in social tagging systems as well as social tagging practices.
Subject
DRNTU::Library and information science::General
Type
Journal Article
Series/Journal Title
Online information review
Rights
© 2008 Emerald Group Publishing Limited. This is the author created version of a work that has been peer reviewed and accepted for publication by Online Information Review, Emerald Group Publishing Limited. It incorporates referee’s comments but changes resulting from the publishing process, such as copyediting, structural formatting, may not be reflected in this document. The published version is available at: [http://dx.doi.org/10.1108/14684520910969961].
Collections
http://dx.doi.org/10.1108/14684520910969961
Get published version (via Digital Object Identifier)