Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/173568
Title: A fair evaluation of the potential of machine learning in maritime transportation
Authors: Luo, Xi
Yan, Ran
Wang, Shuaian
Zhen, Lu
Keywords: Engineering
Issue Date: 2023
Source: Luo, X., Yan, R., Wang, S. & Zhen, L. (2023). A fair evaluation of the potential of machine learning in maritime transportation. Electronic Research Archive, 31(8), 4753-4772. https://dx.doi.org/10.3934/era.2023243
Journal: Electronic Research Archive 
Abstract: Machine learning (ML) techniques are extensively applied to practical maritime transportation issues. Due to the difficulty and high cost of collecting large volumes of data in the maritime industry, in many maritime studies, ML models are trained with small training datasets. The relative predictive performances of these trained ML models are then compared with each other and with the conventional model using the same test set. The ML model that performs the best out of the ML models and better than the conventional model on the test set is regarded as the most effective in terms of this prediction task. However, in scenarios with small datasets, this common process may lead to an unfair comparison between the ML and the conventional model. Therefore, we propose a novel process to fairly compare multiple ML models and the conventional model. We first select the best ML model in terms of predictive performance for the validation set. Then, we combine the training and the validation sets to retrain the best ML model and compare it with the conventional model on the same test set. Based on historical port state control (PSC) inspection data, we examine both the common process and the novel process in terms of their ability to fairly compare ML models and the conventional model. The results show that the novel process is more effective at fairly comparing the ML models with the conventional model on different test sets. Therefore, the novel process enables a fair assessment of ML models’ ability to predict key performance indicators in the context of limited data availability in the maritime industry, such as predicting the ship fuel consumption and port traffic volume, thereby enhancing their reliability for real-world applications.
URI: https://hdl.handle.net/10356/173568
ISSN: 2688-1594
DOI: 10.3934/era.2023243
Schools: School of Civil and Environmental Engineering 
Rights: © 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0).
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:CEE Journal Articles

Files in This Item:
File Description SizeFormat 
10.3934_era.2023243.pdf28.23 MBAdobe PDFThumbnail
View/Open

Page view(s)

189
Updated on May 7, 2025

Download(s) 50

99
Updated on May 7, 2025

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.