Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/178506
Title: PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction
Authors: Wan, Zhang
Lin, Zhuoyi
Rashid, Shamima
Ng, Shaun Yue Hao
Yin, Rui
Senthilnath, J.
Kwoh, Chee Keong
Keywords: Computer and Information Science
Medicine, Health and Life Sciences
Issue Date: 2023
Source: Wan, Z., Lin, Z., Rashid, S., Ng, S. Y. H., Yin, R., Senthilnath, J. & Kwoh, C. K. (2023). PESI: paratope-epitope set interaction for SARS-CoV-2 neutralization prediction. 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 49-56. https://dx.doi.org/10.1109/BIBM58861.2023.10386059
Project: MOE2019-T2-2-175 
Conference: 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
Abstract: Prediction of neutralization antibodies is important for the development of effective vaccines and antibody-based therapeutics. Traditional methods rely on features based on first principles derived from the binding interface. However, they are burdened by arduous data preprocessing from a limited quantity of protein structures. In comparison, deep learning allows automatic substructure characterization and representation without hand-crafted feature engineering. In particular, large language models (LLMs) based method predicts neutralization using Fv sequences of antibody and antigen. Despite LLM's success, incorporating full-length Fv sequences suffers from: 1) inaccurate sequence-level labels in existing datasets, 2) inefficient modeling due to noisy non-contributing motifs, and 3) ignorance of non-bonded interactions that play a key role in facilitating epitope-paratope pairing. In this paper, we propose a novel approach that incorporates only the paratope and epitope for antibody-antigen neutralization prediction while adopting a novel set modeling that regards the paratope and epitope as bags of residues. Specifically, we hand-crafted a dataset containing neutralizing paratope-epitope pairs where epitopes are potentially generalizable to future unseen variants of SARS-CoV-2. Training on such a dataset enables deep learning models to predict neutralizing antibodies for prospective mutated variants of SARS-CoV-2, meanwhile addressing the problem of inaccurate sequence-level labels. A higher modeling efficiency is also achieved by disregarding non-contributing motifs. Furthermore, we also propose paratope-epitope set interaction (PESI), a set modeling model inspired by first principles that learns intra-inter non-covalent interactions through a global attention mechanism. To validate PESI, we perform a 10-fold cross-validation on our dataset. Experimental results show that PESI achieves a more balanced overall performance and a significant improvement on MCC as compared to existing architectures.
URI: https://hdl.handle.net/10356/178506
ISBN: 9798350337488
ISSN: 2156-1133
DOI: 10.1109/BIBM58861.2023.10386059
Schools: College of Computing and Data Science 
School of Computer Science and Engineering 
School of Chemical and Biomedical Engineering 
Research Centres: Biomedical Informatics Lab 
Rights: © 2023 IEEE. All rights reserved. This article may be downloaded for personal use only. Any other use requires prior permission of the copyright holder. The Version of Record is available online at http://doi.org/10.1109/BIBM58861.2023.10386059.
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:CCDS Conference Papers

Files in This Item:
File Description SizeFormat 
PESI_BIBM_2023_Aug13_Submission.pdfMain Manuscript678.05 kBAdobe PDFView/Open
PESI_SupplementaryMaterial.pdfSupplementary File60.41 kBAdobe PDFView/Open

Page view(s)

35
Updated on Sep 8, 2024

Download(s) 50

25
Updated on Sep 8, 2024

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.