Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/160380
Title: Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction
Authors: Liu, Xiang
Feng, Huitao
Wu, Jie
Xia, Kelin
Keywords: Science::Mathematics
Issue Date: 2021
Source: Liu, X., Feng, H., Wu, J. & Xia, K. (2021). Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction. Briefings in Bioinformatics, 22(5), bbab127-. https://dx.doi.org/10.1093/bib/bbab127
Project: M4081842
RG109/19
MOE2018-T2-1-033
Journal: Briefings in Bioinformatics
Abstract: Molecular descriptors are essential to not only quantitative structure activity/property relationship (QSAR/QSPR) models, but also machine learning based chemical and biological data analysis. In this paper, we propose persistent spectral hypergraph (PSH) based molecular descriptors or fingerprints for the first time. Our PSH-based molecular descriptors are used in the characterization of molecular structures and interactions, and further combined with machine learning models, in particular gradient boosting tree (GBT), for protein-ligand binding affinity prediction. Different from traditional molecular descriptors, which are usually based on molecular graph models, a hypergraph-based topological representation is proposed for protein-ligand interaction characterization. Moreover, a filtration process is introduced to generate a series of nested hypergraphs in different scales. For each of these hypergraphs, its eigen spectrum information can be obtained from the corresponding (Hodge) Laplacain matrix. PSH studies the persistence and variation of the eigen spectrum of the nested hypergraphs during the filtration process. Molecular descriptors or fingerprints can be generated from persistent attributes, which are statistical or combinatorial functions of PSH, and combined with machine learning models, in particular, GBT. We test our PSH-GBT model on three most commonly used datasets, including PDBbind-2007, PDBbind-2013 and PDBbind-2016. Our results, for all these databases, are better than all existing machine learning models with traditional molecular descriptors, as far as we know.
URI: https://hdl.handle.net/10356/160380
ISSN: 1467-5463
DOI: 10.1093/bib/bbab127
Schools: School of Physical and Mathematical Sciences 
Rights: © 2021 The authors. Published by Oxford University Press. All rights reserved.
Fulltext Permission: none
Fulltext Availability: No Fulltext
Appears in Collections:SPMS Journal Articles

SCOPUSTM   
Citations 20

14
Updated on Jun 4, 2023

Web of ScienceTM
Citations 20

12
Updated on Jun 6, 2023

Page view(s)

36
Updated on Jun 8, 2023

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.