Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/182740
Title: A multi-aircraft co-operative trajectory planning model under dynamic thunderstorm cells using decentralized deep reinforcement learning
Authors: Pang, Bizhao
Hu, Xinting
Zhang, Mingcheng
Alam, Sameer
Lulli, Guglielmo
Keywords: Engineering
Issue Date: 2025
Source: Pang, B., Hu, X., Zhang, M., Alam, S. & Lulli, G. (2025). A multi-aircraft co-operative trajectory planning model under dynamic thunderstorm cells using decentralized deep reinforcement learning. Advanced Engineering Informatics, 65, 103157-. https://dx.doi.org/10.1016/j.aei.2025.103157
Journal: Advanced Engineering Informatics 
Abstract: Climate change induces an increased frequency of adverse weather, particularly thunderstorms, posing significant safety and efficiency challenges in en route airspace, especially in oceanic regions with limited air traffic control services. These conditions require multi-aircraft cooperative trajectory planning to avoid both dynamic thunderstorms and other aircraft. Existing literature has typically relied on centralized approaches and single-agent principles, which lack coordination and robustness when surrounding aircraft or thunderstorms change paths, leading to scalability issues due to heavy trajectory regeneration needs. To address these gaps, this paper introduces a multi-agent cooperative method for autonomous trajectory planning. The problem is modeled as a Decentralized Markov Decision Process (DEC-MDP) and solved using an Independent Deep Deterministic Policy Gradient (IDDPG) learning framework. A shared actor-critic network is trained using combined experiences from all aircraft to optimize joint behavior. During execution, each aircraft acts independently based on its own observations, with coordination ensured through the shared policy. The model is validated through extensive simulations, including uncertainty analysis, baseline comparisons, and ablation studies. Under known thunderstorm paths, the model achieved a 2 % loss of separation rate, increasing to 4 % with random storm paths. ETA uncertainty analysis demonstrated the model's robustness, while baseline comparisons with the Fast Marching Tree and centralized DDPG highlighted its scalability and efficiency. These findings contribute to advancing autonomous aircraft operations.
URI: https://hdl.handle.net/10356/182740
ISSN: 1474-0346
DOI: 10.1016/j.aei.2025.103157
Schools: School of Mechanical and Aerospace Engineering 
Research Centres: Air Traffic Management Research Institute 
Rights: © 2025 Elsevier Ltd.. All rights reserved. This article may be downloaded for personal use only. Any other use requires prior permission of the copyright holder. The Version of Record is available online at http://doi.org/10.1016/j.aei.2025.103157.
Fulltext Permission: embargo_20270210
Fulltext Availability: With Fulltext
Appears in Collections:ATMRI Journal Articles

Files in This Item:
File Description SizeFormat 
Manuscript_AEI.pdf
  Until 2027-02-10
6.28 MBAdobe PDFUnder embargo until Feb 10, 2027

Page view(s)

32
Updated on Mar 17, 2025

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.