Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/143864
Title: DCL-AIM : decentralized coordination learning of autonomous intersection management for connected and automated vehicles
Authors: Wu, Yuanyuan
Chen, Haipeng
Zhu, Feng
Keywords: Engineering::Civil engineering
Issue Date: 2019
Source: Wu, Y., Chen, H., & Zhu, F. (2019). DCL-AIM: Decentralized coordination learning of autonomous intersection management for connected and automated vehicles. Transportation Research Part C: Emerging Technologies, 103, 246–260. doi:10.1016/j.trc.2019.04.012
Journal: Transportation Research Part C: Emerging Technologies
Abstract: Conventional intersection managements, such as signalized intersections, may not necessarily be the optimal strategies when it comes to connected and automated vehicles (CAVs) environment. Autonomous intersection management (AIM) is tailored for CAVs aiming at replacing the conventional traffic control strategies. In this work, using the communication and computation technologies of CAVs, the sequential movements of vehicles through intersections are modelled as multi-agent Markov decision processes (MAMDPs) in which vehicle agents cooperate to minimize intersection delay with collision-free constraints. To handle the huge dimension scale incurred by the nature of multi-agent decision making problems, the state space of CAVs are decomposed into independent part and coordinated part by exploiting the structural properties of the AIM problem, and a decentralized coordination multi-agent learning approach (DCL-AIM) is proposed to solve the problem efficiently by exploiting both global and localized agent coordination needs in AIM. The main feature of the proposed approach is to explicitly identify and dynamically adapt agent coordination needs during the learning process so that the curse of dimensionality and environment nonstationarity problems in multi-agent learning can be alleviated. The effectiveness of the proposed method is demonstrated under a variety of traffic conditions. The comparison analysis is performed between DCL-AIM and the First-Come-First-Serve based AIM (FCFS-AIM), with Longest-Queue-First (LQF-AIM) policy and the signal control based on the Webster’s method (Signal) as benchmarks. Experimental results show that the sequential decisions from DCL-AIM outperform the other control policies.
URI: https://hdl.handle.net/10356/143864
ISSN: 0968-090X
DOI: 10.1016/j.trc.2019.04.012
Schools: School of Civil and Environmental Engineering 
Rights: © 2019 Elsevier Ltd. All rights reserved. This paper was published in Transportation Research Part C: Emerging Technologies and is made available with permission of Elsevier Ltd.
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:CEE Journal Articles

SCOPUSTM   
Citations 5

131
Updated on Mar 26, 2025

Web of ScienceTM
Citations 5

74
Updated on Oct 28, 2023

Page view(s)

330
Updated on Mar 27, 2025

Download(s) 10

411
Updated on Mar 27, 2025

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.