Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/13581
Title: Approximation algorithms for mining patterns from data streams
Authors: Dang, Xuan Hong
Keywords: DRNTU::Engineering::Computer science and engineering::Information systems::Database management
Issue Date: 2008
Source: Dang, X. H. (2008). Approximation algorithms for mining patterns from data streams. Doctoral thesis, Nanyang Technological University, Singapore.
Abstract: Traditional data mining techniques expect all data to be managed within some form of persistent datasets. Recently, for many emerging applications, such as stock tickers, web-click streams, and telecom call records, the concept of a \textit{data stream} is more appropriate than a stored dataset. Naturally, a data stream is generated continuously in a dynamic environment with huge volume, infinite flow, and fast changing behaviors. Furthermore, they usually arrive to a mining system in a push-based manner meanwhile system resources used in the mining process are generally restricted in advance. Consequently, there have been increasing demands for developing novel techniques that are able to discover interesting patterns from data streams while they work within system resource constraints. Moreover, the mining results returned by these techniques are often desirable to be guaranteed within some error. When such an important task is completed, it is strongly believed that the quality of making decisions can be improved significantly in streaming environments. This research aims to study and investigate various approximation algorithms in order to effectively and efficiently mine useful patterns from data streams under different system resource constraints. Two fundamental data mining tasks are explored in the streaming data context: frequent pattern discovering and cluster analysis. The contributions of this research are claimed as follows: A novel algorithm named EStream is developed to address the problem of online mining frequent patterns from data streams with precise error guarantee.
URI: https://hdl.handle.net/10356/13581
DOI: 10.32657/10356/13581
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Theses

Files in This Item:
File Description SizeFormat 
DangXuanHong08.pdfMain report1.65 MBAdobe PDFThumbnail
View/Open

Page view(s) 50

345
Updated on May 13, 2021

Download(s) 20

158
Updated on May 13, 2021

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.