Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/184536
Title: Enhancing financial market volatility prediction: a machine learning approach integrating sentiment analysis and macroeconomic indicators
Authors: Liaw, Celest Jia Xuan
Lee, Han Ni
Low, Nicole Wan Ting
Keywords: Computer and Information Science
Social Sciences
Issue Date: 2025
Publisher: Nanyang Technological University
Source: Liaw, C. J. X., Lee, H. N. & Low, N. W. T. (2025). Enhancing financial market volatility prediction: a machine learning approach integrating sentiment analysis and macroeconomic indicators. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184536
Project: HE1AY2425_21
Abstract: This report aims to explore the integration of sentiment-derived features with machine- learning (ML) algorithms – LightGBM (LGBM) and XGBoost to improve volatility forecasting. Financial market volatility forecasting continues to be a significant challenge in risk management and investment decision-making. In this study, sentiment data from Reddit posts, representing retail investor sentiment, and The Wall Street Journal, reflecting institutional perspectives alongside historical market index data from 2022 to 2024 were added into our ML models to assess their impact on volatility forecasting. Utilising these ML models, the study evaluates whether sentiment enhanced models offer additional predictive power beyond baseline models that rely solely on historical data and traditional econometric approaches, such as EGARCH. The results show that LGBM and XGBoost models consistently outperform EGARCH, demonstrating lower mean squared errors (MSE). Incorporating sentiment data from both Reddit and the Wall Street Journal with lagged historical data significantly improves predictive accuracy, achieving the lowest MSE for both Dow Jones and S&P 500 indexes. Additionally, while macroeconomic indicators contribute to model performance, they are less effective than historical market trends in short-term volatility forecasting. Diebold-Mariano test results along with a simulation study reinforces the statistical significance of machine learning models' enhanced accuracy over EGARCH. These findings strengthen the case for integrating machine learning techniques into volatility forecasting as viable alternatives to traditional econometric models in financial applications
URI: https://hdl.handle.net/10356/184536
Schools: School of Social Sciences 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SSS Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
GP_Report.pdf
  Restricted Access
817 kBAdobe PDFView/Open

Page view(s)

113
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.