An Augmented Lagrangian-Based Safe Reinforcement Learning Algorithm for Carbon-Oriented Optimal Scheduling of EV Aggregators

被引:0
|
作者
Shi, Xiaoying [1 ]
Xu, Yinliang [1 ]
Chen, Guibin [1 ]
Guo, Ye [1 ]
机构
[1] Tsinghua Univ, Tsinghua Berkeley Shenzhen Inst, Tsinghua Shenzhen Int Grad Sch, Shenzhen 518055, Peoples R China
关键词
Carbon dioxide; Costs; Optimal scheduling; Markov processes; Energy states; Distribution networks; Carbon; Carbon emission mitigation; deep reinforcement learning; aggregator; carbon flow; augmented Lagrangian function; DEMAND; GENERATION; ENERGY;
D O I
10.1109/TSG.2023.3289211
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes an augmented Lagrangian-based safe off-policy deep reinforcement learning (DRL) algorithm for the carbon-oriented optimal scheduling of electric vehicle (EV) aggregators in a distribution network. First, practical charging data are employed to formulate an EV aggregation model, and its flexibility in both emission mitigation and energy/power dispatching is demonstrated. Second, a bilevel optimization model is formulated for EV aggregators to participate in day-ahead optimal scheduling, which aims to minimize the total cost without exceeding the given carbon cap. Third, to tackle the nonlinear coupling between the carbon flow and power flow, a bilevel model with a carbon cap constraint is formed as a constrained Markov decision process (CMDP). Finally, the CMDP is efficiently solved by the proposed augmented Lagrangian-based DRL algorithm featuring the soft actor-critic (SAC) method. Comprehensive numerical studies with IEEE distribution test feeders demonstrate that the proposed approach can achieve a fine tradeoff between cost and emission mitigation with a higher computation efficiency compared with the existing DRL methods.
引用
收藏
页码:795 / 809
页数:15
相关论文
共 9 条
  • [1] Augmented Lagrangian-Based Reinforcement Learning for Network Slicing in IIoT
    Qi, Qi
    Lin, Wenbin
    Guo, Boyang
    Chen, Jinshan
    Deng, Chaoping
    Lin, Guodong
    Sun, Xin
    Chen, Youjia
    [J]. ELECTRONICS, 2022, 11 (20)
  • [2] Constrained EV Charging Scheduling Based on Safe Deep Reinforcement Learning
    Li, Hepeng
    Wan, Zhiqiang
    He, Haibo
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (03) : 2427 - 2439
  • [3] Safe reinforcement learning based optimal low-carbon scheduling strategy for multi-energy system
    Jiang, Fu
    Chen, Jie
    Rong, Jieqi
    Liu, Weirong
    Li, Heng
    Peng, Hui
    [J]. SUSTAINABLE ENERGY GRIDS & NETWORKS, 2024, 39
  • [4] Decentralized collaborative optimal scheduling for EV charging stations based on multi-agent reinforcement learning
    Li, Hang
    Han, Bei
    Li, Guojie
    Wang, Keyou
    Xu, Jin
    Khan, Muhammad Waseem
    [J]. IET GENERATION TRANSMISSION & DISTRIBUTION, 2024, 18 (06) : 1172 - 1183
  • [5] A reinforcement learning based mobile charging sequence scheduling algorithm for optimal sensing coverage in wireless rechargeable sensor networks
    Li, Jinglin
    Wang, Haoran
    Xiao, Wendong
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (06) : 2869 - 2881
  • [6] Multi-Time-Scale Optimal Scheduling Strategy for Marine Renewable Energy Based on Deep Reinforcement Learning Algorithm
    Xu, Ren
    Lin, Fei
    Shao, Wenyi
    Wang, Haoran
    Meng, Fanping
    Li, Jun
    [J]. ENTROPY, 2024, 26 (04)
  • [7] Carbon Dioxide Emission Reduction-Oriented Optimal Control of Traffic Signals in Mixed Traffic Flow Based on Deep Reinforcement Learning
    Wang, Zhaowei
    Xu, Le
    Ma, Jianxiao
    [J]. SUSTAINABILITY, 2023, 15 (24)
  • [8] A new resource-constrained project scheduling problem with ladder-type carbon trading prices and its algorithm based on deep reinforcement learning
    Liu, Hao
    Zhang, Jingwen
    Zhang, Xinyue
    Chen, Zhi
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [9] Optimal Bidding and Operation of a Power Plant with Solvent-Based Carbon Capture under a CO2 Allowance Market: A Solution with a Reinforcement Learning-Based Sarsa Temporal-Difference Algorithm
    Li, Ziang
    Ding, Zhengtao
    Wang, Meihong
    [J]. ENGINEERING, 2017, 3 (02) : 257 - 265