Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/154224
Title: Transforming cooling optimization for green data center via deep reinforcement learning
Authors: Li, Yuanlong
Wen, Yonggang
Tao, Dacheng
Guan, Kyle
Keywords: Engineering::Computer science and engineering
Issue Date: 2020
Source: Li, Y., Wen, Y., Tao, D. & Guan, K. (2020). Transforming cooling optimization for green data center via deep reinforcement learning. IEEE Transactions On Cybernetics, 50(5), 2002-2013. https://dx.doi.org/10.1109/TCYB.2019.2927410
Journal: IEEE Transactions on Cybernetics
Abstract: Data center (DC) plays an important role to support services, such as e-commerce and cloud computing. The resulting energy consumption from this growing market has drawn significant attention, and noticeably almost half of the energy cost is used to cool the DC to a particular temperature. It is thus an critical operational challenge to curb the cooling energy cost without sacrificing the thermal safety of a DC. The existing solutions typically follow a two-step approach, in which the system is first modeled based on expert knowledge and, thus, the operational actions are determined with heuristics and/or best practices. These approaches are often hard to generalize and might result in suboptimal performances due to intrinsic model errors for large-scale systems. In this paper, we propose optimizing the DC cooling control via the emerging deep reinforcement learning (DRL) framework. Compared to the existing approaches, our solution lends itself an end-to-end cooling control algorithm (CCA) via an off-policy offline version of the deep deterministic policy gradient (DDPG) algorithm, in which an evaluation network is trained to predict the DC energy cost along with resulting cooling effects, and a policy network is trained to gauge optimized control settings. Moreover, we introduce a de-underestimation (DUE) validation mechanism for the critic network to reduce the potential underestimation of the risk caused by neural approximation. Our proposed algorithm is evaluated on an EnergyPlus simulation platform and on a real data trace collected from the National Super Computing Centre (NSCC) of Singapore. The resulting numerical results show that the proposed CCA can achieve up to 11% cooling cost reduction on the simulation platform compared with a manually configured baseline control algorithm. In the trace-based study of conservative nature, the proposed algorithm can achieve about 15% cooling energy savings on the NSCC data trace. Our pioneering approach can shed new light on the application of DRL to optimize and automate DC operations and management, potentially revolutionizing digital infrastructure management with intelligence.
URI: https://hdl.handle.net/10356/154224
ISSN: 2168-2267
DOI: 10.1109/TCYB.2019.2927410
Rights: © 2019 IEEE. All rights reserved.
Fulltext Permission: none
Fulltext Availability: No Fulltext
Appears in Collections:SCSE Journal Articles

SCOPUSTM   
Citations 10

34
Updated on Dec 23, 2021

PublonsTM
Citations 10

27
Updated on Dec 24, 2021

Page view(s)

34
Updated on May 25, 2022

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.