Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/175082
Title: Exploring the similarities of XAI approaches in finding the influential factors in lung cancer survival rate
Authors: Tan, Elise Zining
Keywords: Computer and Information Science
Issue Date: 2024
Publisher: Nanyang Technological University
Source: Tan, E. Z. (2024). Exploring the similarities of XAI approaches in finding the influential factors in lung cancer survival rate. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175082
Project: SCSE23-0510 
Abstract: Lung cancer is the most common cancer and the leading cause of cancer-related deaths globally. While Artificial Intelligence (AI) offers promise in early detection of high-risk patients and providing treatment decision support, the black-box nature of AI models raises trust concerns. Therefore, eXplainable Artificial Intelligence (XAI) is important to explain models and identify critical factors affecting lung cancer survival. In addition, real datasets frequently contain missing values due to diverse reasons. In this work, the Simulacrum dataset was used, and different statistical and machine learning imputation methods like Mean-Mode, K-Nearest Neighbors (KNN), MissForest were used to impute the missing data. An innovative neural network imputation method was proposed to evaluate whether the sequence of imputing the features with missing values matters. Neural network models were then trained on the imputed datasets to predict lung cancer patient survival. The results revealed that the models trained on imputed datasets obtained from machine learning imputation methods had better performance and the sequence in which features were imputed had minimal impact on the performance. For interpretability, different XAI approaches were deployed, namely, Expected Gradients (EG), SHapley Additive exPlanations (SHAP) and Local Interpretable Model-Agnostic Explanations (LIME). The analysis revealed that important features were related to the tumor condition, age and dose administration, largely unaffected by different imputation methods. Pairwise similarity metrics such as Jaccard Similarity, Pearson Correlation Coefficient and Cosine Similarity were used to compare the XAI approaches. EG and SHAP exhibited the highest similarity to each other, though there were some notable inconsistencies between their explanations. This paper underscores the usefulness of XAI for medical professionals in understanding model decisions and emphasizes the importance of cross-validating explanations from various XAI approaches to evaluate model reliability.
URI: https://hdl.handle.net/10356/175082
Schools: School of Computer Science and Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
Amended Final FYP Report - Tan Zining, Elise.pdf
  Restricted Access
2.3 MBAdobe PDFView/Open

Page view(s)

117
Updated on May 7, 2025

Download(s)

10
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.