Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/96254
Title: | Tag-based image retrieval improved by augmented features and group-based refinement | Authors: | Chen, Lin Xu, Dong Tsang, Ivor Wai-Hung Luo, Jiebo |
Keywords: | DRNTU::Engineering::Computer science and engineering | Issue Date: | 2012 | Source: | Chen, L., Xu, D., Tsang, I. W., & Luo, J. (2012). Tag-Based Image Retrieval Improved by Augmented Features and Group-Based Refinement. IEEE Transactions on Multimedia, 14(4), 1057-1067. | Series/Report no.: | IEEE transactions on multimedia | Abstract: | In this paper, we propose a new tag-based image retrieval framework to improve the retrieval performance of a group of related personal images captured by the same user within a short period of an event by leveraging millions of training web images and their associated rich textual descriptions. For any given query tag (e.g., “car”), the inverted file method is employed to automatically determine the relevant training web images that are associated with the query tag and the irrelevant training web images that are not associated with the query tag. Using these relevant and irrelevant web images as positive and negative training data respectively, we propose a new classification method called support vector machine (SVM) with augmented features (AFSVM) to learn an adapted classifier by leveraging the prelearned SVM classifiers of popular tags that are associated with a large number of relevant training web images. Treating the decision values of one group of test photos from AFSVM classifiers as the initial relevance scores, in the subsequent group-based refinement process, we propose to use the Laplacian regularized least squares method to further refine the relevance scores of test photos by utilizing the visual similarity of the images within the group. Based on the refined relevance scores, our proposed framework can be readily applied to tag-based image retrieval for a group of raw consumer photos without any textual descriptions or a group of Flickr photos with noisy tags. Moreover, we propose a new method to better calculate the relevance scores for Flickr photos. Extensive experiments on two datasets demonstrate the effectiveness of our framework. | URI: | https://hdl.handle.net/10356/96254 http://hdl.handle.net/10220/11473 |
ISSN: | 1520-9210 | DOI: | 10.1109/TMM.2012.2187435 | Schools: | School of Computer Engineering | Research Centres: | Centre for Multimedia and Network Technology | Rights: | © 2012 IEEE. | Fulltext Permission: | none | Fulltext Availability: | No Fulltext |
Appears in Collections: | SCSE Journal Articles |
SCOPUSTM
Citations
10
47
Updated on Mar 13, 2025
Web of ScienceTM
Citations
10
34
Updated on Oct 29, 2023
Page view(s) 10
998
Updated on Mar 19, 2025
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.