Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/182434
Title: | Interactive prediction and decision-making for autonomous vehicles: online active learning with traffic entropy minimization | Authors: | Zhang, Yiran Lou, Shanhe Hang, Peng Huang, Wenhui Yang, Lie Lv, Chen |
Keywords: | Engineering | Issue Date: | 2024 | Source: | Zhang, Y., Lou, S., Hang, P., Huang, W., Yang, L. & Lv, C. (2024). Interactive prediction and decision-making for autonomous vehicles: online active learning with traffic entropy minimization. IEEE Transactions On Intelligent Transportation Systems, 25(11), 17718-17732. https://dx.doi.org/10.1109/TITS.2024.3419003 | Project: | M22K2c0079 NRF2021-NRF-ANR003 HM Science MOE-T2EP50222-0002 |
Journal: | IEEE Transactions on Intelligent Transportation Systems | Abstract: | Interacting with the surrounding road users is crucial for autonomous vehicles (AV). However, the inherent multimodality and uncertainties associated with traffic participants (TP) pose challenges in AVs' prediction and decision-making (PnD). A primary challenge is adapting predictors trained on static offline datasets to the dynamic, diverse data streams encountered in reality. Secondly, utilizing one single forecast trajectory with the highest probability for decision-making contains potential risks as it neglects that even a small probability represents a subset of TP behaviors. Based on the existing prediction backbone, we propose an online learning approach incorporating pseudo-labels inferred from partial feedback as compensation for conventional methodologies, considering both the commonsense and personalization facets of driving. Drawing inspiration from the second law of thermodynamics, we propose to minimize microscopic traffic entropy as an additional objective in decision-making. This objective aims to reduce the chaos of traffic scenes, thus achieving more predictable future interactions and, conversely, making future decisions easier. Through real-time human-in-the-loop experiments, we quantifiably and comparably reveal that adopting one single trajectory without online learning in PnD is risky. However, this reliability is verified to be significantly improved by our proposed techniques, and the efficacy is further analyzed in a subsequent qualitative study. A static experiment transferring the prediction algorithm trained exclusively on Argoverse 2 to datasets including NGSIM, HighD, RounD, and NuScenes is also conducted, demonstrating that the proposed correction can effectively mitigate the gap between the datasets and real-world scenarios. | URI: | https://hdl.handle.net/10356/182434 | ISSN: | 1524-9050 | DOI: | 10.1109/TITS.2024.3419003 | Schools: | School of Mechanical and Aerospace Engineering | Rights: | © 2024 IEEE. All rights reserved. | Fulltext Permission: | none | Fulltext Availability: | No Fulltext |
Appears in Collections: | MAE Journal Articles |
SCOPUSTM
Citations
50
1
Updated on Mar 23, 2025
Page view(s)
26
Updated on Mar 24, 2025
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.