Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/162965
Title: Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road
Authors: Shi, Xiupeng
Wong, Yiik Diew
Chai, Chen
Li, Michael Zhi Feng
Chen, Tianyi
Zeng, Zeng
Keywords: Engineering::Civil engineering
Issue Date: 2022
Source: Shi, X., Wong, Y. D., Chai, C., Li, M. Z. F., Chen, T. & Zeng, Z. (2022). Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road. IEEE Transactions On Intelligent Transportation Systems, 23(10), 17451-17465. https://dx.doi.org/10.1109/TITS.2022.3166838
Journal: IEEE Transactions on Intelligent Transportation Systems
Abstract: Early risk diagnosis and driving anomaly detection from vehicle stream are of great benefits in a range of advanced solutions towards Smart Road and crash prevention, although there are intrinsic challenges, especially lack of ground truth, definition of multiple risk exposures. This study proposes a domain-specific automatic clustering (termed AutoCluster) to self-learn the optimal models for unsupervised risk assessment, which integrates key steps of clustering into an auto-optimisable pipeline, including feature and algorithm selection, hyperparameter auto-tuning. Firstly, based on surrogate conflict measures, a series of risk indicator features are constructed to represent temporal-spatial and kinematical risk exposures. Then, we develop an unsupervised feature selection method to identify the useful features by elimination-based model reliance importance (EMRI). Secondly, we propose balanced Silhouette Index (bSI) to evaluate the internal quality of imbalanced clustering. A loss function is designed that considers the clustering performance in terms of internal quality, inter-cluster variation, and model stability. Thirdly, based on Bayesian optimisation, the algorithm auto-selection and hyperparameter auto-tuning are self-learned to generate the best clustering results. Herein, NGSIM vehicle trajectory data is used for test-bedding. Findings show that AutoCluster is reliable and promising to diagnose multiple distinct risk levels inherent to generalised driving behaviour. We also delve into risk clustering, such as, algorithms heterogeneity, Silhouette analysis, hierarchical clustering flows, etc. Meanwhile, the AutoCluster is also a method for unsupervised data labelling and indicator threshold calibration. Furthermore, AutoCluster is useful to tackle the challenges in imbalanced clustering without ground truth or a priori knowledge.
URI: https://hdl.handle.net/10356/162965
ISSN: 1524-9050
DOI: 10.1109/TITS.2022.3166838
Schools: School of Civil and Environmental Engineering 
Nanyang Business School 
Rights: © 2022 IEEE. All rights reserved.
Fulltext Permission: none
Fulltext Availability: No Fulltext
Appears in Collections:CEE Journal Articles
NBS Journal Articles

SCOPUSTM   
Citations 50

6
Updated on Mar 24, 2024

Web of ScienceTM
Citations 50

3
Updated on Oct 26, 2023

Page view(s)

128
Updated on Mar 28, 2024

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.