Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/161923
Title: Persistent-homology-based machine learning: a survey and a comparative study
Authors: Pun, Chi Seng
Lee, Si Xian
Xia, Kelin
Keywords: Science::Mathematics
Issue Date: 2022
Source: Pun, C. S., Lee, S. X. & Xia, K. (2022). Persistent-homology-based machine learning: a survey and a comparative study. Artificial Intelligence Review, 55(7), 5169-5213. https://dx.doi.org/10.1007/s10462-022-10146-z
Project: M4081840
M4081842
M4082115
RG109/19
MOE2018-T2-1-033
MOE-T2EP20120-0013
Journal: Artificial Intelligence Review
Abstract: A suitable feature representation that can both preserve the data intrinsic information and reduce data complexity and dimensionality is key to the performance of machine learning models. Deeply rooted in algebraic topology, persistent homology (PH) provides a delicate balance between data simplification and intrinsic structure characterization, and has been applied to various areas successfully. However, the combination of PH and machine learning has been hindered greatly by three challenges, namely topological representation of data, PH-based distance measurements or metrics, and PH-based feature representation. With the development of topological data analysis, progresses have been made on all these three problems, but widely scattered in different literatures. In this paper, we provide a systematical review of PH and PH-based supervised and unsupervised models from a computational perspective. Our emphasizes are the recent development of mathematical models and tools, including PH software and PH-based functions, feature representations, kernels, and similarity models. Essentially, this paper can work as a roadmap for the practical application of PH-based machine learning tools. Further, we compare between two types of simplicial complexes (alpha and Vietrois-Rips complexes), two types of feature extractions (barcode statistics and binned features), and three types of machine learning models (support vector machines, tree-based models, and neural networks), and investigate their impacts on the protein secondary structure classification.
URI: https://hdl.handle.net/10356/161923
ISSN: 0269-2821
DOI: 10.1007/s10462-022-10146-z
Rights: © 2022 The Author(s), under exclusive licence to Springer Nature B.V.
Fulltext Permission: none
Fulltext Availability: No Fulltext
Appears in Collections:SPMS Journal Articles

Page view(s)

13
Updated on Dec 5, 2022

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.