Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/170774
Title: | Deep reinforcement learning for UAV routing in the presence of multiple charging stations | Authors: | Fan, Mingfeng Wu, Yaoxin Liao, Tianjun Cao, Zhiguang Guo, Hongliang Sartoretti, Guillaume Wu, Guohua |
Keywords: | Engineering::Computer science and engineering | Issue Date: | 2023 | Source: | Fan, M., Wu, Y., Liao, T., Cao, Z., Guo, H., Sartoretti, G. & Wu, G. (2023). Deep reinforcement learning for UAV routing in the presence of multiple charging stations. IEEE Transactions On Vehicular Technology, 72(5), 5732-5746. https://dx.doi.org/10.1109/TVT.2022.3232607 | Journal: | IEEE Transactions on Vehicular Technology | Abstract: | Deploying Unmanned Aerial Vehicles (UAVs) for traffic monitoring has been a hotspot given their flexibility and broader view. However, a UAV is usually constrained by battery capacity due to limited payload. On the other hand, the development of wireless charging technology has allowed UAVs to replenish energy from charging stations.In this paper, we study a UAV routing problem in the presence of multiple charging stations (URPMCS) with the objective of minimizing the total distance traveled by the UAV during traffic monitoring. We present a deep reinforcement learning based method, where a multi-head heterogeneous attention mechanism is designed to facilitate learning a policy that automatically and sequentially constructs the route, while taking the energy consumption into account. In our method, two types of attentions are leveraged to learn the relations between monitoring targets and charging station nodes, adopting an encoder-decoder-like policy network. Moreover, we also employ a curriculum learning strategy to enhance generalization to different numbers of charging stations. Computational results show that our method outperforms conventional algorithms with higher solution quality (except for exact methods such as Gurobi) and shorter runtime in general, and also exhibits strong generalized performance on problem instances with different distributions and sizes. | URI: | https://hdl.handle.net/10356/170774 | ISSN: | 0018-9545 | DOI: | 10.1109/TVT.2022.3232607 | Schools: | School of Computer Science and Engineering | Rights: | © 2022 IEEE. All rights reserved. | Fulltext Permission: | none | Fulltext Availability: | No Fulltext |
Appears in Collections: | SCSE Journal Articles |
SCOPUSTM
Citations
20
19
Updated on Mar 16, 2025
Page view(s)
135
Updated on Mar 20, 2025
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.