Reactive Power Optimization Based on Proximal Policy Optimization of Deep Reinforcement Learning

被引:0
|
作者
Zahng P. [1 ]
Zhu Z. [1 ]
Xie H. [1 ]
机构
[1] School of Electrical Engineering, Beijing Jiaotong University, Haidian District, Beijing
来源
关键词
data driving; deep reinforcement learning; new power system; proximal policy optimization; reactive power optimization;
D O I
10.13335/j.1000-3673.pst.2022.0728
中图分类号
学科分类号
摘要
The fluctuations of renewable energies and loads pose a great challenge to reactive power optimization. Considering the time-varying characteristics of new energies and loads, the reactive power optimization problem is constructed as a reinforcement learning problem. The method of constraint-target division and target presupposition is proposed to design a reward function, and the proximal policy optimization algorithm is used to solve the reinforcement learning problem, getting the reactive power optimization policy. A case study is carried out with the modified IEEE39 system, and the results show that the proposed reward function can improve the convergence speed of the agent. The reactive power optimization strategy based on reinforcement learning is supirior to the traditional deterministic optimization algorithm in decision-making effects and decision-making time. © 2023 Power System Technology Press. All rights reserved.
引用
收藏
页码:562 / 570
页数:8
相关论文
共 35 条
  • [21] NIE Huanhuan, ZHANG Jiaqi, CHEN Ying, Real-time economic dispatch of community integrated energy system based on a double-layer reinforcement learning method[J], Power System Technology, 45, 4, pp. 1330-1336, (2021)
  • [22] DIAO Ruisheng, WANG Zhiwei, SHI Di, Autonomous voltage control for grid operation using deep reinforcement learning[C], 2019 IEEE Power & Energy Society General Meeting (PESGM), pp. 1-5, (2019)
  • [23] TOUBEAU J F, ZAD B B,, HUPEZ M, Deep reinforcement learning-based voltage control to deal with model uncertainties in distribution networks[J], Energies, 13, 15, (2020)
  • [24] Qi LI, QIAO Ying, ZHANG Yujing, Continuous reactive power optimization of distribution network using deep reinforcement learning[J], Power System Technology, 44, 4, pp. 1473-1480, (2020)
  • [25] NI Shuang, CUI Chenggang, YANG Ning, Multi-time-scale online optimization for reactive power of distribution network based on deep reinforcement learning[J], Automation of Electric Power Systems, 45, 10, pp. 77-85, (2021)
  • [26] (2018)
  • [27] WANG Yanpin, Grid voltage control method based on generator reactive power regulation using reinforcement learning[C], 2020 IEEE/IAS Industrial and Commercial Power System Asia (I&CPS Asia), (2020)
  • [28] YANG Qiuling, WANG Gang, SADEGHI A, Two-timescale voltage control in distribution grids using deep reinforcement learning[J], IEEE Transactions on Smart Grid, 11, 3, pp. 2313-2323, (2020)
  • [29] WANG Shengyi, DUAN Jiajun, SHI Di, A data-driven multi-agent autonomous voltage control framework using deep reinforcement learning[J], IEEE Transactions on Power Systems, 35, 6, pp. 4644-4654, (2020)
  • [30] Linfei YIN, Shikui LUO, WANG Yaoxiong, Coordinated complex-valued encoding dragonfly algorithm and artificial emotional reinforcement learning for coordinated secondary voltage control and automatic voltage regulation in multi-generator power systems[J], IEEE Access, 8, pp. 180520-180533, (2020)