Please use this identifier to cite or link to this item:
Title: Clustering techniques for web mining
Authors: Lu, Jiao.
Keywords: DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems
Issue Date: 2009
Abstract: In an effort to keep up with the fast growth of World Wide Web, many Web Document Clustering techniques have been designed. These techniques can be used to increase the accuracy and efficiency of the users to find the relevant information they want from the internet. In this dissertation, a Web document clustering approach based on a phrase-based document Indexing has been implemented based on three merits. The first is the new document representation called Document index Graph (DIG), which is used to represent the document. The second is a new similarity measure between documents which is based on the matching phrases and their weights. The third concept is theincremental document clustering method. The objective of this dissertation is to design and implement the clustering system based on the concepts above. The implementation details, the experimental results and performance evaluation are reported.
Rights: Nanyang Technological University
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
  Restricted Access
1.29 MBAdobe PDFView/Open

Page view(s) 10

Updated on Nov 23, 2020

Download(s) 10

Updated on Nov 23, 2020

Google ScholarTM


Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.