dc.contributor.authorMei, Jian-Ping
dc.contributor.authorChen, Lihui
dc.date.accessioned2015-01-12T03:20:28Z
dc.date.available2015-01-12T03:20:28Z
dc.date.copyright2014en_US
dc.date.issued2014
dc.identifier.citationMei, J.-P., & Chen, L. (2014). Proximity-based k-partitions clustering with ranking for document categorization and analysis. Expert systems with applications, 41(16), 7095-7105.en_US
dc.identifier.issn0957-4174en_US
dc.identifier.urihttp://hdl.handle.net/10220/24579
dc.description.abstractAs one of the most fundamental yet important methods of data clustering, center-based partitioning approach clusters the dataset into k subsets, each of which is represented by a centroid or medoid. In this paper, we propose a new medoid-based k-partitions approach called Clustering Around Weighted Prototypes (CAWP), which works with a similarity matrix. In CAWP, each cluster is characterized by multiple objects with different representative weights. With this new cluster representation scheme, CAWP aims to simultaneously produce clusters of improved quality and a set of ranked representative objects for each cluster. An efficient algorithm is derived to alternatingly update the clusters and the representative weights of objects with respect to each cluster. An annealinglike optimization procedure is incorporated to alleviate the local optimum problem for better clustering results and at the same time to make the algorithm less sensitive to parameter setting. Experimental results on benchmark document datasets show that, CAWP achieves favourable effectiveness and efficiency in clustering, and also provides useful information for cluster-specified analysisen_US
dc.format.extent34 p.en_US
dc.language.isoenen_US
dc.relation.ispartofseriesExpert systems with applicationsen_US
dc.rights© 2014 Elsevier Ltd. This is the author created version of a work that has been peer reviewed and accepted for publication by Expert Systems with Applications, Elsevier Ltd. It incorporates referee’s comments but changes resulting from the publishing process, such as copyediting, structural formatting, may not be reflected in this document. The published version is available at: [http://dx.doi.org/10.1016/j.eswa.2014.06.016].en_US
dc.subjectDRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems
dc.titleProximity-based k-partitions clustering with ranking for document categorization and analysisen_US
dc.typeJournal Article
dc.contributor.schoolSchool of Electrical and Electronic Engineeringen_US
dc.identifier.doihttp://dx.doi.org/10.1016/j.eswa.2014.06.016
dc.description.versionAccepted versionen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record