Optimal Volt/Var Control for Unbalanced Distribution Networks With Human-in-the-Loop Deep Reinforcement Learning

被引:3
|
作者
Sun, Xianzhuo [1 ,2 ]
Xu, Zhao [1 ,2 ]
Qiu, Jing [3 ]
Liu, Huichuan [3 ]
Wu, Huayi [1 ,2 ]
Tao, Yuechuan [4 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Elect Engn, Hong Kong, Peoples R China
[2] Hong Kong Polytech Univ, Res Inst Smart Energy, Hong Kong, Peoples R China
[3] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2006, Australia
[4] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
关键词
Voltage control; Distribution networks; Training; Reinforcement learning; Reactive power; Inverters; Human in the loop; Volt/Var control; human-in-the-loop; safe deep reinforcement learning; soft actor-critic; unbalanced distribution networks; VOLTAGE CONTROL; PENETRATION;
D O I
10.1109/TSG.2023.3337843
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a human-in-the-loop deep reinforcement learning (HL-DRL)-based VVC strategy to simultaneously reduce power losses, mitigate voltage violations and compensate for voltage unbalance in three-phase unbalanced distribution networks. Instead of fully trusting DRL actions made by deep neural networks, a human intervention module is proposed to modify dangerous actions that violate operation constraints during offline training. This module refers to well-designed human guidance rules based on voltage-reactive power sensitivities, which regulate PV reactive power to sequentially address local voltage violation and unbalance issues to obtain safe transitions. To efficiently and safely learn the optimal control policy from these training samples, a human-in-the-loop soft actor-critic (HL-SAC) solution method is then developed. Different from the standard SAC algorithm, an online switch mechanism between action exploration and human intervention is designed. The actor network loss function is modified to incorporate human guidance terms, which alleviates the inconsistency of the updating direction of actor and critic networks. A hybrid experience replay buffer including both dangerous and safe transitions is also used to facilitate the learning process towards human actions. Comparative simulation results on a modified IEEE 123-bus unbalanced distribution system demonstrate the effectiveness and superiority of the proposed method in voltage control.
引用
收藏
页码:2639 / 2651
页数:13
相关论文
共 50 条
  • [1] Volt-VAR Control in Power Distribution Systems with Deep Reinforcement Learning
    Wang, Wei
    Yu, Nanpeng
    Shi, Jie
    Gao, Yuanqi
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CONTROL, AND COMPUTING TECHNOLOGIES FOR SMART GRIDS (SMARTGRIDCOMM), 2019,
  • [2] Human-in-the-loop Reinforcement Learning
    Liang, Huanghuang
    Yang, Lu
    Cheng, Hong
    Tu, Wenzhe
    Xu, Mengjie
    [J]. 2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 4511 - 4518
  • [3] Optimal coordinated Volt/Var control in active distribution networks
    Azzouz, Maher A.
    El-Saadany, Ehab F.
    [J]. 2014 IEEE PES GENERAL MEETING - CONFERENCE & EXPOSITION, 2014,
  • [4] Two-Stage Volt/Var Control in Active Distribution Networks With Multi-Agent Deep Reinforcement Learning Method
    Sun, Xianzhuo
    Qiu, Jing
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (04) : 2903 - 2912
  • [5] Two-Stage Deep Reinforcement Learning for Inverter-Based Volt-VAR Control in Active Distribution Networks
    Liu, Haotian
    Wu, Wenchuan
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (03) : 2037 - 2047
  • [6] Two-Critic Deep Reinforcement Learning for Inverter-Based Volt-Var Control in Active Distribution Networks
    Liu, Qiong
    Guo, Ye
    Deng, Lirong
    Liu, Haotian
    Li, Dongyu
    Sun, Hongbin
    Huang, Wenqi
    [J]. IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2024, 15 (03) : 1768 - 1781
  • [7] Optimal Use of Decentralized Methods for Volt/var Control in Distribution Networks
    Meyer, Marian
    Cramer, Moritz
    Goergens, Philipp
    Schnettler, Armin
    [J]. 2017 IEEE MANCHESTER POWERTECH, 2017,
  • [8] Model-augmented safe reinforcement learning for Volt-VAR control in power distribution networks
    Gao, Yuanqi
    Yu, Nanpeng
    [J]. APPLIED ENERGY, 2022, 313
  • [9] Volt-VAR Control in Active Distribution Networks Using Multi-Agent Reinforcement Learning
    Su, Shi
    Zhan, Haozhe
    Zhang, Luxi
    Xie, Qingyang
    Si, Ruiqi
    Dai, Yuxin
    Gao, Tianlu
    Wu, Linhan
    Zhang, Jun
    Shang, Lei
    [J]. ELECTRONICS, 2024, 13 (10)
  • [10] Consensus Multi-Agent Reinforcement Learning for Volt-VAR Control in Power Distribution Networks
    Gao, Yuanqi
    Wang, Wei
    Yu, Nanpeng
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (04) : 3594 - 3604