Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/164664
Title: A double-deck deep reinforcement learning-based energy dispatch strategy for an integrated electricity and district heating system embedded with thermal inertial and operational flexibility
Authors: Zhang, Bin
Ghias, Amer M. Y. M.
Chen, Zhe
Keywords: Engineering::Electrical and electronic engineering
Issue Date: 2022
Source: Zhang, B., Ghias, A. M. Y. M. & Chen, Z. (2022). A double-deck deep reinforcement learning-based energy dispatch strategy for an integrated electricity and district heating system embedded with thermal inertial and operational flexibility. Energy Reports, 8, 15067-15080. https://dx.doi.org/10.1016/j.egyr.2022.11.028
Project: AcRF TIER 1 RG50/21
Journal: Energy Reports
Abstract: With the high penetration of wind power connected to the integrated electricity and district heating systems (IEDHSs), wind power curtailment still inevitably occurs in the traditional IEDHS dispatch. Focusing on the flexibilities of the IEDHS is considered to be a beneficial solution to further promote the integration of wind power. In the district heating network, the thermal inertia is utilized to improve such flexibility. Therefore, an IEDHS dispatch model considering the thermal inertia of district heating network and operational flexibility of generators is proposed in this paper. In addition, to avoid the tendency of traditional reinforcement learning (RL) to fall into local optimality when solving high-dimensional problems, a double-deck deep RL (D3RL) framework is proposed in this study. D3RL combines with a deep deterministic policy gradient (DDPG) agent in the upper level and a conventional optimization solver in the lower level to simplify the action and reward design. In the simulation, the proposed model considering the transmission time delay characteristics of the district heating network and the operational flexibility of generators is verified in four scheduling scenarios. Besides, the superiority of the proposed D3RL method is validated in a larger IEDHS. Numerical results show that the considered scheduling model can use the heat storage characteristics of heating pipelines, reduce operating costs, improve the operational flexibility and encourage wind power utilization. Compared with traditional RL, the proposed optimization method can improve its training speed and convergence performance.
URI: https://hdl.handle.net/10356/164664
ISSN: 2352-4847
DOI: 10.1016/j.egyr.2022.11.028
Schools: School of Electrical and Electronic Engineering 
Rights: © 2022 Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:EEE Journal Articles

Files in This Item:
File Description SizeFormat 
1-s2.0-S2352484722024143-main.pdf3.27 MBAdobe PDFThumbnail
View/Open

SCOPUSTM   
Citations 50

7
Updated on Feb 29, 2024

Page view(s)

80
Updated on Feb 28, 2024

Download(s) 50

29
Updated on Feb 28, 2024

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.