Please use this identifier to cite or link to this item:
Title: Web data mining on XML documents
Authors: Wen, Yi.
Keywords: DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval
Issue Date: 2000
Abstract: The World Wide Web has become the largest and most convenient source of information. Searching and using Web data is becoming an important part in people's work and daily life. The need for collecting, extracting and analyzing information from the WWW is increasing. However, data existing on the Web is unstructured. The lack of structures in Web documents render traditional database techniques inapplicable for managing Web data. The advent of the extensible Markup Language(XML) has enabled flexible structuring of Web data. An XML document uses meaningful tags to provide semantic information for different parts of the document. An XML document is valid when it has a Document Type Definition(DTD) to describe its schema. An important issue for Web data mining is efficiently extracting data from Web documents and organize it into a proper form for data mining. Our work aims at extracting and transforming the Web data into a proper form such that they can be analyzed in a similar way as data mining in relational databases.
Schools: School of Computer Engineering 
Rights: Nanyang Technological University
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Theses

Files in This Item:
File Description SizeFormat 
  Restricted Access
12.55 MBAdobe PDFView/Open

Page view(s) 50

Updated on Jun 14, 2024


Updated on Jun 14, 2024

Google ScholarTM


Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.