Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/171862
Full metadata record
DC FieldValueLanguage
dc.contributor.authorGuan, Jia Shengen_US
dc.date.accessioned2023-11-14T06:42:31Z-
dc.date.available2023-11-14T06:42:31Z-
dc.date.issued2023-
dc.identifier.citationGuan, J. S. (2023). Enzyme catalytic residue prediction using deep learning methods. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/171862en_US
dc.identifier.urihttps://hdl.handle.net/10356/171862-
dc.description.abstractIdentification of catalytic residues in enzymes have important applications ranging from drug discovery to protein engineering. However, locating catalytic residues in laboratory is time consuming and costly. Through high throughput computational methods, potential catalytic residues could be elucidated. While many models trained to predict catalytic residues were published, there are still unexplored combinations of model features and data preparation methods. In this project, graph neural network (GNN) and multi-layer perceptron (MLP) models were constructed to predict catalytic residues. The choice of edge weight equation was discovered to have huge impact on GNN model performance. Embeddings from a large protein language model, Evolutionary Scale Modeling 2 (ESM-2), were experimented and found suitable as features for MLP and GNN models, rivaling many published models in performance. Atchley factors as features were investigated but results hinted that the information might have already been included in the ESM-2 embeddings. To address knowledge gap, structural information of entire protein complex was considered as GNN model feature but found no benefits as compared to using only monomer structures as in published models. To resolve class imbalance issue, down-sampling of non-catalytic to catalytic residues to a 10:1 ratio was tested but it did not improve models’ performances.en_US
dc.language.isoenen_US
dc.publisherNanyang Technological Universityen_US
dc.subjectScience::Biological sciencesen_US
dc.titleEnzyme catalytic residue prediction using deep learning methodsen_US
dc.typeFinal Year Project (FYP)en_US
dc.contributor.supervisorMu Yuguangen_US
dc.contributor.schoolSchool of Biological Sciencesen_US
dc.description.degreeBachelor of Science in Biological Sciencesen_US
dc.contributor.supervisoremailYGMu@ntu.edu.sgen_US
item.grantfulltextrestricted-
item.fulltextWith Fulltext-
Appears in Collections:SBS Student Reports (FYP/IA/PA/PI)
Files in This Item:
File Description SizeFormat 
FYP Report.pdf
  Restricted Access
Undergraduate project report1.44 MBAdobe PDFView/Open

Page view(s)

170
Updated on Jul 24, 2024

Download(s)

45
Updated on Jul 24, 2024

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.