Please use this identifier to cite or link to this item:
Title: Outlier detection
Authors: Li, Shukai
Keywords: DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
Issue Date: 2013
Publisher: Nanyang Technological University
Abstract: Outlier detection aims to capture or identify uncommon events or instances. This technique has been widely used in applications such as fraud detection, image processing and bioinformatics. Because of its diverse usage, outlier detection has emerged as a vibrant research topic in the fields of data mining, machine learning and statistics. In this thesis, we investigate four different kinds of outlier detection problems. Amongst them, unsupervised outlier detection has been the most popular, while relative outlier detection has attracted increasing attention in recent years. Thus, our research will focus on these two classes of outlier detection problems. Unsupervised outlier detection methods are used when there are no labeled patterns. For this kind of problems, we propose a Maximum Margin Criterion to segregate the unknown outliers from the normal patterns in a given set of samples. However, the corresponding learning task is formulated as a Mixed Integer Programming (MIP) problem, which is computationally hard. To address this issue, we adopt a recently developed label generating technique to efficiently solve a convex relaxation of the MIP problem for outlier detection. Specifically, we propose an effective procedure of successive approximation to find a largely violated labeling vector for identifying the outliers from the normal patterns. The convergence of such a procedure has also been established and presented. Subsequently, a set of largely violated labeling vectors are combined via multiple kernel learning methods to robustly detect the outliers. To further enhance the efficacy of our outlier detector, we also explore the use of the Maximum Volume Criterion to measure the quality of separation between the outliers and the normal patterns. This criterion can be easily incorporated into our proposed model by introducing an additional regularization term. The efforts culminate to two novel outlier detection models named Maximum Margin Outlier Detection (MMOD) and Maximum Volume Outlier Detection (MVOD) respectively.
Schools: School of Computer Engineering 
Research Centres: Centre for Computational Intelligence 
Rights: This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Theses

Files in This Item:
File Description SizeFormat 
Li Shukai.pdf
  Restricted Access
3.29 MBAdobe PDFView/Open

Page view(s) 50

Updated on Jul 18, 2024


Updated on Jul 18, 2024

Google ScholarTM


Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.