Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/180258
Title: | Multi-dimensional resource management with deep deterministic policy gradient for digital twin-enabled Industrial Internet of Things in 6 generation | Authors: | Hu, Yue Cao, Ning Lu, Hao Jiang, Yunzhe Liu, Yinqiu He, Xiaoming |
Keywords: | Computer and Information Science | Issue Date: | 2024 | Source: | Hu, Y., Cao, N., Lu, H., Jiang, Y., Liu, Y. & He, X. (2024). Multi-dimensional resource management with deep deterministic policy gradient for digital twin-enabled Industrial Internet of Things in 6 generation. Transactions On Emerging Telecommunications Technologies, 35(4), e4962-. https://dx.doi.org/10.1002/ett.4962 | Journal: | Transactions on Emerging Telecommunications Technologies | Abstract: | In the era of sixth generation mobile networks (6G), industrial big data is rapidly generated due to the increasing data-driven applications in the Industrial Internet of Things (IIoT). Effectively processing such data, for example, knowledge learning, on resource-limited IIoT devices becomes a challenge. To this end, we introduce a cloud-edge-end collaboration architecture, in which computing, communication, and storage resources are flexibly coordinated to alleviate the issue of resource constraints. To achieve better performance in hyper-connected experience, real-time communication, and sustainable computing, we construct a novel architecture combining digital twin (DT)-IIoT with edge networks. In addition, considering the energy consumption and delay issues in distributed learning, we propose a deep reinforcement learning-based method called deep deterministic policy gradient with double actors and double critics (D4PG) to manage the multi-dimensional resources, that is, CPU cycles, DT models, and communication bandwidths, enhancing the exploration ability and improving the inaccurate value estimation of agents in continuous action spaces. In addition, we introduce a synchronization threshold for distributed learning framework to avoid the synchronization latency caused by stragglers. Extensive experimental results prove that the proposed architecture can efficiently conduct knowledge learning, and the intelligent scheme can also improve system efficiency by managing multi-dimensional resources. | URI: | https://hdl.handle.net/10356/180258 | ISSN: | 2161-3915 | DOI: | 10.1002/ett.4962 | Schools: | School of Computer Science and Engineering | Rights: | © 2024 John Wiley & Sons, Ltd. All rights reserved. | Fulltext Permission: | none | Fulltext Availability: | No Fulltext |
Appears in Collections: | SCSE Journal Articles |
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.