Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/184074
Title: | Spatio-temporal analysis of public transportation systems using network autoregressive models | Authors: | Zaki Bin Zainudin | Keywords: | Computer and Information Science Mathematical Sciences |
Issue Date: | 2025 | Publisher: | Nanyang Technological University | Source: | Zaki Bin Zainudin (2025). Spatio-temporal analysis of public transportation systems using network autoregressive models. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184074 | Project: | CCDS24-0267 | Abstract: | This research study covers the literature behind statistical spatio-temporal analysis as well as the progress made thus far in exploring its applications in the domain of public transportation. The research project delves into data provided by Open Data NY covering the New York City (NYC) Subway System, making use of statistical methods to accurately forecast future changes. This analysis showcases the usage and application of the Generalised Network Autoregressive (GNAR) model as proposed by authors Knight and Leeming, to fit a model to predict the ridership of the various subway stations around NYC. Authors Nason and Wei showcased the model’s prowess in the domain of economics by quantifying and predicting the economic response to COVID-19 across various European countries. This paper showcases the GNAR model’s use case in the domain of transport, making use of the subway station’s pre-existing graph network structure to aid in predictions. As it concludes, the paper would have gone through the steps taken to extract, transform, and load the data making use of Python and the Pandas library. The raw data obtained using the MTA API will be transformed into graphical networks and vector time series’ compatible with the autoregressive model. Subsequent analyses would be conducted in R, outlining the conditions and consolations given to provide a model that would be useful in forecasting. This includes an investigation to the Bayesian Information Criterion (BIC) of each model, observations of its predictive power represented by the mean squared prediction error (MSPE) and explained variance (R-squared) of each model. The paper also covers several challenges faced during the data exploration of model fitting, offering potential solutions to future renditions of this project. Specifically, improvements to parameter selection and graph network construction have been outlined in the closing review, along with extensions to compare regression-based analyses against machine learning and large language model (LLM) based analyses. | URI: | https://hdl.handle.net/10356/184074 | Schools: | College of Computing and Data Science | Fulltext Permission: | restricted | Fulltext Availability: | With Fulltext |
Appears in Collections: | CCDS Student Reports (FYP/IA/PA/PI) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Zaki FYP Final Report_Submission.pdf Restricted Access | Zaki_FYP_Report_Final | 2.31 MB | Adobe PDF | View/Open |
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.