DNN Assisted Projection Based Deep Reinforcement Learning for Safe Control of Distribution Grids

被引:2
|
作者
Zhang, Mengfan [1 ]
Guo, Guodong [2 ]
Zhao, Tianyang [1 ]
Xu, Qianwen [1 ]
机构
[1] KTH Royal Inst Technol, Elect Power & Energy Syst Div, S-11428 Stockholm, Sweden
[2] State Grid Econ & Technol Res Inst Co Ltd, Beijing 102209, Peoples R China
关键词
Deep reinforcement learning; safety; projection; deep neural network; inverter interfaced RESs; distribution grid; AUTONOMOUS VOLTAGE CONTROL; VOLT/VAR CONTROL;
D O I
10.1109/TPWRS.2023.3336614
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep reinforcement learning (DRL) is a promising solution for voltage control of distribution grids with high penetration of inverter-based renewable energy sources (RESs). Yet, when adopting the DRL-based control method, the safe and optimal operation of the system cannot be guaranteed at the same time, as the conventional DRL agent is not designed to solve the hard constraint problem. To address this challenge, this paper proposes a deep neural network (DNN) assisted projection based DRL method for safe control of distribution grids. First, a finite iteration projection algorithm is proposed to guarantee hard constraints by converting a non-convex optimization problem into a finite iteration problem. Next, a DNN assisted projection method is proposed to accelerate the calculation of projection and achieve the practical implementation of hard constraints in DRL problem. Finally, a DNN Projection embedded twin-delayed deep deterministic policy gradient (DPe-TD3) method is proposed to achieve optimal operation of distribution grids with guaranteed 100% safety of the distribution grid. The safety of the DRL training is guaranteed via the embedded Projection DNN in TD3 with participation in gradient return process, which could smoothly and effectively project the DRL agent actions into the feasible area, thus guaranteeing the safety of data driven control and the optimal operation at the same time. The case studies and comparisons are conducted in the IEEE 33 bus system to show the effectiveness of the proposed method.
引用
收藏
页码:5687 / 5698
页数:12
相关论文
共 50 条
  • [31] ProAPT: Projection of APTs with Deep Reinforcement Learning
    Dehghan, Motahareh
    Sadeghian, Babak
    Khosravian, Erfan
    Moghaddam, Alireza Sedighi
    Nooshi, Farshid
    ISECURE-ISC INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2025, 17 (01): : 25 - 41
  • [32] Stereoscopic Projection Policy Optimization Method Based on Deep Reinforcement Learning
    An, Jing
    Si, Guang-Ya
    Zhang, Lei
    Liu, Wei
    Zhang, Xue-Chao
    ELECTRONICS, 2022, 11 (23)
  • [33] Coordinated Volt/VAR Control in Distribution Networks Considering Demand Response via Safe Deep Reinforcement Learning
    Hua, Dong
    Peng, Fei
    Liu, Suisheng
    Lin, Qinglin
    Fan, Jiahui
    Li, Qian
    ENERGIES, 2025, 18 (02)
  • [34] Research on Distribution Network Topology Control Based on Deep Reinforcement Learning Combinatorial Optimization
    Yan D.
    Peng G.
    Gao H.
    Chen S.
    Zhou Y.
    Dianwang Jishu/Power System Technology, 2022, 46 (07): : 2547 - 2554
  • [35] Deep Reinforcement Learning for Direct Load Control in Distribution Networks
    Bahrami, Shahab
    Chen, Yu Christine
    Wong, Vincent W. S.
    2020 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2020,
  • [36] Constrained EV Charging Scheduling Based on Safe Deep Reinforcement Learning
    Li, Hepeng
    Wan, Zhiqiang
    He, Haibo
    IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (03) : 2427 - 2439
  • [37] Online Microgrid Energy Management Based on Safe Deep Reinforcement Learning
    Li, Hepeng
    Wang, Zhenhua
    Li, Lusi
    He, Haibo
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [38] Safe Reinforcement Learning via Projection on a Safe Set: How to Achieve Optimality?
    Gros, Sebastien
    Zanon, Mario
    Bemporad, Alberto
    IFAC PAPERSONLINE, 2020, 53 (02): : 8076 - 8081
  • [39] Probabilistic Guarantees for Safe Deep Reinforcement Learning
    Bacci, Edoardo
    Parker, David
    FORMAL MODELING AND ANALYSIS OF TIMED SYSTEMS, FORMATS 2020, 2020, 12288 : 231 - 248
  • [40] Skill Reward for Safe Deep Reinforcement Learning
    Cheng, Jiangchang
    Yu, Fumin
    Zhang, Hongliang
    Dai, Yinglong
    UBIQUITOUS SECURITY, 2022, 1557 : 203 - 213