Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/105988
Full metadata record
DC FieldValueLanguage
dc.contributor.authorAta, Sezin Kircalien
dc.contributor.authorOu-Yang, Leen
dc.contributor.authorFang, Yuanen
dc.contributor.authorKwoh, Chee-Keongen
dc.contributor.authorWu, Minen
dc.contributor.authorLi, Xiao-Lien
dc.date.accessioned2019-06-19T02:57:35Zen
dc.date.accessioned2019-12-06T22:02:19Z-
dc.date.available2019-06-19T02:57:35Zen
dc.date.available2019-12-06T22:02:19Z-
dc.date.issued2018en
dc.identifier.citationAta, S. K., Ou-Yang, L., Fang, Y., Kwoh, C.-K., Wu, M., & Li, X.-L. (2018). Integrating node embeddings and biological annotations for genes to predict disease-gene associations. BMC Systems Biology, 12(S9), 138-. doi:10.1186/s12918-018-0662-yen
dc.identifier.urihttps://hdl.handle.net/10356/105988-
dc.description.abstractBackground : Predicting disease causative genes (or simply, disease genes) has played critical roles in understanding the genetic basis of human diseases and further providing disease treatment guidelines. While various computational methods have been proposed for disease gene prediction, with the recent increasing availability of biological information for genes, it is highly motivated to leverage these valuable data sources and extract useful information for accurately predicting disease genes. Results : We present an integrative framework called N2VKO to predict disease genes. Firstly, we learn the node embeddings from protein-protein interaction (PPI) network for genes by adapting the well-known representation learning method node2vec. Secondly, we combine the learned node embeddings with various biological annotations as rich feature representation for genes, and subsequently build binary classification models for disease gene prediction. Finally, as the data for disease gene prediction is usually imbalanced (i.e. the number of the causative genes for a specific disease is much less than that of its non-causative genes), we further address this serious data imbalance issue by applying oversampling techniques for imbalance data correction to improve the prediction performance. Comprehensive experiments demonstrate that our proposed N2VKO significantly outperforms four state-of-the-art methods for disease gene prediction across seven diseases. Conclusions : In this study, we show that node embeddings learned from PPI networks work well for disease geneprediction, while integrating node embeddings with other biological annotations further improves the performanceof classification models. Moreover, oversampling techniques for imbalance correction further enhances the prediction performance. In addition, the literature search of predicted disease genes also shows the effectiveness of our proposed N2VKO framework for disease gene prediction.en
dc.description.sponsorshipMOE (Min. of Education, S’pore)en
dc.format.extent14 p.en
dc.language.isoenen
dc.relation.ispartofseriesBMC Systems Biologyen
dc.rights© 2018 The Author(s). This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stateden
dc.subjectDRNTU::Engineering::Computer science and engineeringen
dc.subjectDisease Gene Predictionen
dc.subjectNode Embeddingsen
dc.titleIntegrating node embeddings and biological annotations for genes to predict disease-gene associationsen
dc.typeJournal Articleen
dc.contributor.schoolSchool of Computer Science and Engineeringen
dc.identifier.doi10.1186/s12918-018-0662-yen
dc.description.versionPublished versionen
item.fulltextWith Fulltext-
item.grantfulltextopen-
Appears in Collections:SCSE Journal Articles

SCOPUSTM   
Citations 20

24
Updated on Mar 27, 2024

Web of ScienceTM
Citations 20

19
Updated on Oct 29, 2023

Page view(s) 50

436
Updated on Mar 27, 2024

Download(s) 50

64
Updated on Mar 27, 2024

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.