Coordinated Active Power-Frequency Control Based on Safe Deep Reinforcement Learning

被引:0
|
作者
Zhou Y. [1 ]
Zhou L. [1 ]
Shi D. [2 ]
Zhao X. [2 ]
Shan X. [3 ]
机构
[1] State Grid East China Branch, Shanghai
[2] AINERGY, Santa Clara
[3] NAR1 Technology Development Co., Ltd., Nanjing
关键词
agent; artificial intelligence (AI); constrained Markov decision process; coordinated power and frequency control; safe deep reinforcement learning;
D O I
10.16183/j.cnki.jsjtu.2022.358
中图分类号
学科分类号
摘要
The continuous increase in renewables penetration poses a severe challenge to the frequency control of interconnected power grid. Since the conventional automatic generation control (AGO strategy does not consider the power flow constraints of the network, the traditional approach is to make tentative generator power adjustments based on expert knowledge and experience, which is time consuming. The optimal power flow-based AGC optimization model has a long solution time and convergence issues due to its non-convexity and large size. Deep reinforcement learning has the advantage of "offline training and online end-to-end strategy formation", which yet cannot ensure the security of artificial intelligence (AD in power grid applications. A coordinated optimal control method is proposed for active power and frequency control based on safe deep reinforcement learning. First, the method models the frequency control problem as a constrained Markov decision process, and an agent is designed by considering various safety constraints. Then, the agent is trained using the example of East China Power Grid through continuous interactions with the grid. Finally, the effect of the agent and the conventional AGC strategy is compared. The results show that the proposed approach can quickly generate control strategies under various operating conditions, and can assist dispatchers to make decisions online. © 2024 Shanghai Jiaotong University. All rights reserved.
引用
收藏
页码:682 / 692
页数:10
相关论文
共 25 条
  • [1] ZHOU Yi, QIAN Xuedong, Applications analysis of dynamic ACE in East China power grid, Automation of Electric Power Systems, 34, 8, pp. 106-110, (2010)
  • [2] LI Zhaowei, WU Xuelian, ZHUANG Kanqin, Et al., Analysis and reflection on frequency characteristics of East China grid after bipolar locking of "9 • 19" Jinping-Sunan DC transmission line, Automation of Electric Power Systems, 41, 7, pp. 149-155, (2017)
  • [3] YAN Wei, WANG Cong, MAO Yanli, Et al., AGC optimal model based on OPF technology for interconnected power grid, Power System Protection & Control, 43, 23, pp. 35-40, (2015)
  • [4] LIU Yang, XIA Tian, WANG Yang, On-line generation method of active power coordinated control strategy for multiple transmission sections in regional power grid, Electric Power Automation Equipment, 40, 7, pp. 204-210, (2020)
  • [5] JIA Hongjie, MU Yunfei, YU Xiaodan, Directional control method to interface power based on DC power flow and sensitivity, Automation of Electric Power Systems, 34, 2, pp. 34-38, (2010)
  • [6] ZHAO Jianning, XU Wuxiang, YANG Qiang, Design and implementation of active power control for tie lines based on automatic generation control, Automation of Electric Power Systems, 30, 22, pp. 85-88, (2006)
  • [7] SILVER D, SCHRITTWIESER J, SIMON YAN K, Et al., Mastering the game of Go without human knowledge, Nature, 550, 7676, pp. 354-359, (2017)
  • [8] SILVER D, SINGH S, PRECUP D, Et al., Reward is enough, Artificial Intelligence, 299, (2021)
  • [9] LIU Wei, ZHANG Dongxia, WANG Xinying, Et al., A decision making strategy for generating unit tripping under emergency circumstances based on deep reinforcement learning, Proceedings of the CSEE, 38, 1, pp. 109-119, (2018)
  • [10] WANG Tianjing, TANG Yong, GUO Qiang, Et al., Automatic adjustment method of power flow calculation convergence for large-scale power grid based on knowledge experience and deep reinforcement learning, Proceedings of the CSEE, 40, 8, pp. 2396-2405, (2020)