Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/180258
Title: Multi-dimensional resource management with deep deterministic policy gradient for digital twin-enabled Industrial Internet of Things in 6 generation
Authors: Hu, Yue
Cao, Ning
Lu, Hao
Jiang, Yunzhe
Liu, Yinqiu
He, Xiaoming
Keywords: Computer and Information Science
Issue Date: 2024
Source: Hu, Y., Cao, N., Lu, H., Jiang, Y., Liu, Y. & He, X. (2024). Multi-dimensional resource management with deep deterministic policy gradient for digital twin-enabled Industrial Internet of Things in 6 generation. Transactions On Emerging Telecommunications Technologies, 35(4), e4962-. https://dx.doi.org/10.1002/ett.4962
Journal: Transactions on Emerging Telecommunications Technologies
Abstract: In the era of sixth generation mobile networks (6G), industrial big data is rapidly generated due to the increasing data-driven applications in the Industrial Internet of Things (IIoT). Effectively processing such data, for example, knowledge learning, on resource-limited IIoT devices becomes a challenge. To this end, we introduce a cloud-edge-end collaboration architecture, in which computing, communication, and storage resources are flexibly coordinated to alleviate the issue of resource constraints. To achieve better performance in hyper-connected experience, real-time communication, and sustainable computing, we construct a novel architecture combining digital twin (DT)-IIoT with edge networks. In addition, considering the energy consumption and delay issues in distributed learning, we propose a deep reinforcement learning-based method called deep deterministic policy gradient with double actors and double critics (D4PG) to manage the multi-dimensional resources, that is, CPU cycles, DT models, and communication bandwidths, enhancing the exploration ability and improving the inaccurate value estimation of agents in continuous action spaces. In addition, we introduce a synchronization threshold for distributed learning framework to avoid the synchronization latency caused by stragglers. Extensive experimental results prove that the proposed architecture can efficiently conduct knowledge learning, and the intelligent scheme can also improve system efficiency by managing multi-dimensional resources.
URI: https://hdl.handle.net/10356/180258
ISSN: 2161-3915
DOI: 10.1002/ett.4962
Schools: School of Computer Science and Engineering 
Rights: © 2024 John Wiley & Sons, Ltd. All rights reserved.
Fulltext Permission: none
Fulltext Availability: No Fulltext
Appears in Collections:SCSE Journal Articles

Page view(s)

48
Updated on Jan 16, 2025

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.