Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/177989
Title: | Reinforcement learning for blast furnace ironmaking operation with safety and partial observation considerations | Authors: | Jiang, Ke Jiang, Zhaohui Jiang, Xudong Xie, Yongfang Gui, Weihua |
Keywords: | Engineering | Issue Date: | 2024 | Source: | Jiang, K., Jiang, Z., Jiang, X., Xie, Y. & Gui, W. (2024). Reinforcement learning for blast furnace ironmaking operation with safety and partial observation considerations. IEEE Transactions On Neural Networks and Learning Systems, 35(3), 3077-3090. https://dx.doi.org/10.1109/TNNLS.2023.3340741 | Journal: | IEEE Transactions on Neural Networks and Learning Systems | Abstract: | Making proper decision online in complex environment during the blast furnace (BF) operation is a key factor in achieving long-term success and profitability in the steel manufacturing industry. Regulatory lags, ore source uncertainty, and continuous decision requirement make it a challenging task. Recently, reinforcement learning (RL) has demonstrated state-of-the-art performance in various sequential decision-making problems. However, the strict safety requirements make it impossible to explore optimal decisions through online trial and error. Therefore, this article proposes a novel offline RL approach designed to ensure safety, maximize return, and address issues of partially observed states. Specifically, it utilizes an off-policy actor-critic framework to infer the optimal decision from expert operation trajectories. The "actor" in this framework is jointly trained by the supervision and evaluation signals to make decision with low risk and high return. Furthermore, we investigate a recurrent version of the actor and critic networks to better capture the complete observations, which solves the partially observed Markov decision process (POMDP) arising from sensor limitations. Verification within the BF smelting process demonstrates the improvements of the proposed algorithm in performance, i.e., safety and return. | URI: | https://hdl.handle.net/10356/177989 | ISSN: | 2162-237X | DOI: | 10.1109/TNNLS.2023.3340741 | Schools: | School of Electrical and Electronic Engineering | Rights: | © 2024 IEEE. All rights reserved. | Fulltext Permission: | none | Fulltext Availability: | No Fulltext |
Appears in Collections: | EEE Journal Articles |
SCOPUSTM
Citations
50
3
Updated on Mar 11, 2025
Page view(s)
92
Updated on Mar 15, 2025
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.