Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/177989
Title: Reinforcement learning for blast furnace ironmaking operation with safety and partial observation considerations
Authors: Jiang, Ke
Jiang, Zhaohui
Jiang, Xudong
Xie, Yongfang
Gui, Weihua
Keywords: Engineering
Issue Date: 2024
Source: Jiang, K., Jiang, Z., Jiang, X., Xie, Y. & Gui, W. (2024). Reinforcement learning for blast furnace ironmaking operation with safety and partial observation considerations. IEEE Transactions On Neural Networks and Learning Systems, 35(3), 3077-3090. https://dx.doi.org/10.1109/TNNLS.2023.3340741
Journal: IEEE Transactions on Neural Networks and Learning Systems 
Abstract: Making proper decision online in complex environment during the blast furnace (BF) operation is a key factor in achieving long-term success and profitability in the steel manufacturing industry. Regulatory lags, ore source uncertainty, and continuous decision requirement make it a challenging task. Recently, reinforcement learning (RL) has demonstrated state-of-the-art performance in various sequential decision-making problems. However, the strict safety requirements make it impossible to explore optimal decisions through online trial and error. Therefore, this article proposes a novel offline RL approach designed to ensure safety, maximize return, and address issues of partially observed states. Specifically, it utilizes an off-policy actor-critic framework to infer the optimal decision from expert operation trajectories. The "actor" in this framework is jointly trained by the supervision and evaluation signals to make decision with low risk and high return. Furthermore, we investigate a recurrent version of the actor and critic networks to better capture the complete observations, which solves the partially observed Markov decision process (POMDP) arising from sensor limitations. Verification within the BF smelting process demonstrates the improvements of the proposed algorithm in performance, i.e., safety and return.
URI: https://hdl.handle.net/10356/177989
ISSN: 2162-237X
DOI: 10.1109/TNNLS.2023.3340741
Schools: School of Electrical and Electronic Engineering 
Rights: © 2024 IEEE. All rights reserved.
Fulltext Permission: none
Fulltext Availability: No Fulltext
Appears in Collections:EEE Journal Articles

SCOPUSTM   
Citations 50

3
Updated on Mar 11, 2025

Page view(s)

92
Updated on Mar 15, 2025

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.