Multi-timescale Deep Reinforcement Learning for Reactive Power Optimization of Distribution Network

被引:0
|
作者
Hu D. [1 ]
Peng Y. [1 ]
Wei W. [1 ]
Xiao T. [1 ]
Cai T. [2 ]
Xi W. [2 ]
机构
[1] Zhejiang University, Zhejiang Province, Hangzhou
[2] Digital Grid Research Institute, China Southern Power Grid, Guangdong Province, Guangzhou
关键词
data-driven; deep reinforcement learning; Markov process; multi-agent; voltage control;
D O I
10.13334/j.0258-8013.pcsee.213110
中图分类号
学科分类号
摘要
With the access of high proportion distributed generation, distributed network faces tremendous challenges in dealing with uncertainty and coordinating a variety of reactive power compensation equipment. This paper presented a multi-timescale voltage regulation strategy based on a mathematical optimization model and data-driven method. First, for the online tap changer and switching capacitor with slow-timescale regulation, Aiming to minimize active power loss, the day ahead reactive power and voltage optimization model were proposed based on mixed-integer second-order cone programming. Second, to meet the real-time requirements on the fast timescale stage, an intraday real-time scheduling method based on multi-agent reinforcement learning was proposed, transforming the real-time reactive power optimization problem into a Markov game process and adopting a centralized training and decentralized execution framework. Compared with traditional methods, this method has low communication cost, better real-time performance, and does not rely on an accurate power flow model. Finally, the effectiveness of the proposed strategy is verified by an IEEE 33-bus example. © 2022 Chin.Soc.for Elec.Eng.
引用
收藏
页码:5034 / 5044
页数:10
相关论文
共 26 条
  • [1] CHEN Hong, Reactive power compensation optimization of distribution network including distributed power supply [J], Information Technology, 44, 11, pp. 132-136, (2020)
  • [2] ZHANG Yujing, QIAO Ying, LU Zongxiang, Et al., Voltage control for partially visible distribution networks with high DG penetration[J], Power System Technology, 43, 5, pp. 1528-1535, (2019)
  • [3] LI Peishuai, WU Zaijun, ZHANG Cuo, Distributed hybrid-timescale voltage/var control in active distribution networks[J], Automation of Electric Power Systems, 45, 16, pp. 160-168, (2021)
  • [4] ZHOU Wei, HU Shubo, SUN Hui, Interval nonlinear economic dispatch in large scale wind power integrated system[J], Proceedings of the CSEE, 37, 2, pp. 557-563, (2017)
  • [5] ZHENG Neng, DING Xiaoqun, GUAN Zhicheng, Coordinated optimization of active power and reactive power in distribution network based on scenario method [J], Power System Technology, 43, 5, pp. 1640-1651, (2019)
  • [6] ZHENG Neng, DING Xiaoqun, ZHENG Chengtuo, Multi-objective coordinated optimization of active and reactive power for distribution network integrated with high proportion of photovoltaic generation[J], Automation of Electric Power Systems, 42, 6, pp. 33-39, (2018)
  • [7] YU Lin, SUN Ying, XU Ran, Improved particle swarm optimization algorithm and its application in reactive power partitioning of power grid[J], Automation of Electric Power Systems, 41, 3, pp. 89-95, (2017)
  • [8] GAO Hongjun, LIU Junyong, SHEN Xiaodong, Optimal power flow research in active distribution network and its application examples[J], Proceedings of the CSEE, 37, 6, pp. 1634-1644, (2017)
  • [9] FANG Sidun, CHENG Haozhong, XU Guodong, Reviews on stochastic optimal power flow and its application[J], Electric Power Automation Equipment, 36, 11, pp. 1-10, (2016)
  • [10] REN Jiayi, GU Wei, WANG Yong, Multi-time scale active and reactive power coordinated optimal dispatch in active distribution network based on model predictive control[J], Proceedings of the CSEE, 38, 5, pp. 1397-1407, (2018)