Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/148294
Full metadata record
DC FieldValueLanguage
dc.contributor.authorYeo, Jarrett Shan Weien_US
dc.date.accessioned2021-05-04T02:05:41Z-
dc.date.available2021-05-04T02:05:41Z-
dc.date.issued2021-
dc.identifier.citationYeo, J. S. W. (2021). Predicting stock market index with gradient boosting machine ensemble, bayesian optimization, temporal consistency analysis, market sentiment analysis, game theory and novel holdout method. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/148294en_US
dc.identifier.urihttps://hdl.handle.net/10356/148294-
dc.description.abstractThe potential of machine learning has sustained the interest of both academia and industry in stock market prediction for over the past decade. This project aims to integrate modern techniques used in the field into a resource-efficient and accurate stock index predictor. While Gradient Boosting Machines (GBMs) have been around for more than twenty years, they have recently received a revival in popularity of modern gradient-boosted decision trees such as XGBoost in 2014, and LightGBM and CatBoost in 2017. Additionally, literature in stock market prediction field has been focused on the use of macro-economic metrics, the creation of technical financial indicators, and more recently, the analysis of social media big data as well. This project serves to unify such techniques into an efficient yet effective ensemble called CalixBoost Ensemble of the GBMs using the aforementioned data. The models are tuned with Bayesian Optimization, and temporal consistency analysis is also used for invariant feature selection over random trial-and-error. Market sentiment analysis is then conducted using a simple and fast but effective rule-based model tuned specifically for understanding social media posts. Finally, the feature importance and inter-feature relationships of every model will be explained using a unified game theory approach using Shapley values to better appreciate their inner workings. All models will be evaluated using a novel holdout method, viz. on two separate test datasets whose datapoints are collected under different conditions: first, normal economic activity; and second, during a black swan / financial downturn.en_US
dc.language.isoenen_US
dc.publisherNanyang Technological Universityen_US
dc.subjectEngineering::Computer science and engineering::Information systemsen_US
dc.titlePredicting stock market index with gradient boosting machine ensemble, bayesian optimization, temporal consistency analysis, market sentiment analysis, game theory and novel holdout methoden_US
dc.typeFinal Year Project (FYP)en_US
dc.contributor.supervisorYeo Chai Kiaten_US
dc.contributor.schoolNanyang Business Schoolen_US
dc.description.degreeBachelor of Businessen_US
dc.description.degreeBachelor of Engineering (Computer Science)en_US
dc.contributor.supervisoremailASCKYEO@ntu.edu.sgen_US
item.grantfulltextrestricted-
item.fulltextWith Fulltext-
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)
Files in This Item:
File Description SizeFormat 
Amended Final Report (Submission) - Jarrett Yeo Shan Wei.pdf
  Restricted Access
1.62 MBAdobe PDFView/Open

Page view(s) 50

500
Updated on Apr 25, 2025

Download(s) 50

46
Updated on Apr 25, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.