Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/150018
Title: The study of Rashomon effects on machine learning : a case study on breast cancer
Authors: Wee, Yu Hui
Keywords: Science::Biological sciences::Genetics
Issue Date: 2021
Publisher: Nanyang Technological University
Source: Wee, Y. H. (2021). The study of Rashomon effects on machine learning : a case study on breast cancer. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/150018
Abstract: The Rashomon effect is a theory that suggests the presence of multiple uncorrelated observations and explanations that can be made for a single observation. This theory has been translated into a popular machine learning method: Random Forests which uses bootstrapping (bagging) algorithms to create a set of uncorrelated decision trees that together make the decision (prediction) of the final result. In this study, we will be using 3 ER breast cancer datasets as a case study and we look at the results of the selection of each individual tree in the forest using the standard random forest algorithms and when bootstrapping of the attributes was removed. We found that most forests converged into a few highly correlate gene signatures which dominates the prediction and masks the errors of non-accurate models. Besides, because the random forest algorithm can generate highly accurate with a group of and non-predictive signatures, we need to be careful when using random forest machine models for prediction in the field of cancer biology.
URI: https://hdl.handle.net/10356/150018
Schools: School of Biological Sciences 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SBS Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
fypreport.pdf
  Restricted Access
5.53 MBAdobe PDFView/Open

Page view(s) 50

495
Updated on May 7, 2025

Download(s)

20
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.