Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/158278
Title: Process control of water treatment facilities using machine learning method
Authors: Tey, Shiyang
Keywords: Engineering::Environmental engineering::Water treatment
Issue Date: 2022
Publisher: Nanyang Technological University
Source: Tey, S. (2022). Process control of water treatment facilities using machine learning method. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/158278
Project: EN-19
Abstract: The water industry in Singapore is increasingly incorporating the use of Industrial Control System (ICS) which introduces cyber-physical systems (CPS) in water treatment plants. Along with the highly efficient automated processes, the connectivity of the systems instigates new means of cyber-attacks threats. For research, a Secure Water Treatment (SWaT) testbed was jointly established by Singapore’s authorities and SUTD to provide a facility to study the security of CPS. This study aims to improve and optimize previously developed anomaly detection scripts against possible attacks on the testbed. Previous studies utilized NGBoost (NGB) which is a gradient boosting model (GBM) which outputs probabilistic predictions as the main algorithm to perform anomaly detection. Probabilistic predictions were used to estimate uncertainties to aid in the judgement of a model’s prediction. XGBoost-Distritbution (XGBD) was discovered to be a more efficient gradient boosting model compared to NGBoost (NGB) while also providing probabilistic predictions. XGBD was found to perform predictions 30 times faster and train 18 times faster than NGB. However, XGBD’s overall performance on the validation set has a 10% higher RMSE and a 25% higher MAE than NGB’s overall performance on the validation set. After comparing the significant reduction of computational time and slightly inferior accuracy, it was optimistic that XGBD is a more suitable model candidate for this project’s application. To maintain the performance of models during real-time prediction, factors affecting the degradation of performance were identified. In addition, mitigation methods were proposed to continuously improve the training data and to use the improvised training data to update and retrain models.
URI: https://hdl.handle.net/10356/158278
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:CEE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
FYP final.pdf
  Restricted Access
4.41 MBAdobe PDFView/Open

Page view(s)

37
Updated on Dec 2, 2022

Download(s)

5
Updated on Dec 2, 2022

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.