SMA-PDPPO: Safe Multiagent Primal-Dual Deep Reinforcement Learning for Industrial Parks Energy Trading

被引:0
|
作者
Lu, Renzhi [1 ,2 ]
Wu, Ning [3 ]
Yang, Tao [4 ]
Chen, Ying [5 ]
Sun, Mingyang [6 ,7 ]
Wang, Dong [8 ,9 ]
Peng, Xin [10 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Key Lab Image Proc & Intelligent Control, Wuhan 430074, Peoples R China
[2] Minist Educ, Key Lab Syst Control & Informat Proc, Shanghai 200240, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Peoples R China
[4] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China
[5] Tsinghua Univ, Elect Engn, Beijing 100084, Peoples R China
[6] Peking Univ, Coll Engn, Dept Ind Engn & Management, Beijing 100091, Peoples R China
[7] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England
[8] Dalian Univ Technol, Key Lab Intelligent Control & Optimizat Ind Equipm, Minist Educ, Dalian 116024, Peoples R China
[9] Dalian Univ Technol, Sch Control Sci & Engn, Dalian 116024, Peoples R China
[10] East China Univ Sci & Technol, Key Lab Smart Mfg Energy Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning (DRL); electricity market; energy management; energy trading; industrial park; DEMAND RESPONSE;
D O I
10.1109/TII.2024.3514128
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Energy trading in industrial parks has great potential for reducing carbon emissions and lowering energy bills. This article proposes a safe multiagent deep reinforcement learning algorithm for optimizing the energy trading strategy in industrial parks to achieve less reliance on the main grid and save energy costs. Specifically, an industrial park that contains multiple industrial users with both thermal and electrical load requirements is considered, in which the different users can trade energy with each other and with the main grid based on their own strategies. Unlike the existing studies, the time-phased energy trading problem is transformed into a constrained partially observable Markov game, which models the industrial users and objectives of the buyers and sellers. Finally, a novel multiagent primal-dual proximal policy optimization algorithm that guarantees safety is developed to achieve the optimal trading strategies between the main grid and multiple users. Numerical simulations with real-world data demonstrate that the proposed algorithm allows higher total revenue for sellers and lower total costs for buyers in the park, limits each user's bid or offer to a relatively safe range, and increases the amount of electricity traded locally, while reducing trading with the grid.
引用
收藏
页码:2640 / 2649
页数:10
相关论文
共 10 条
  • [1] Interpreting Primal-Dual Algorithms for Constrained Multiagent Reinforcement Learning
    Tabas, Daniel
    Zamzam, Ahmed S.
    Zhang, Baosen
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [2] Safe Policies for Reinforcement Learning via Primal-Dual Methods
    Paternain, Santiago
    Calvo-Fullana, Miguel
    Chamon, Luiz F. O.
    Ribeiro, Alejandro
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (03) : 1321 - 1336
  • [3] Accelerated Primal-Dual Deep Reinforcement Learning for Efficient Energy Management of Hybrid Electric Vehicles
    Shaik, Jewaliddin
    Karri, Sri Phani Krishna
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, : 5525 - 5540
  • [4] A projected primal-dual gradient optimal control method for deep reinforcement learning
    Simon Gottschalk
    Michael Burger
    Matthias Gerdts
    Journal of Mathematics in Industry, 10
  • [5] A projected primal-dual gradient optimal control method for deep reinforcement learning
    Gottschalk, Simon
    Burger, Michael
    Gerdts, Matthias
    JOURNAL OF MATHEMATICS IN INDUSTRY, 2020, 10 (01)
  • [6] Primal-Dual Deep Reinforcement Learning for Periodic Coverage-Assisted UAV Secure Communications
    Qin, Yunhui
    Xing, Zhifang
    Li, Xulong
    Zhang, Zhongshan
    Zhang, Haijun
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (12) : 19641 - 19652
  • [7] Real-Time Optimal Power Flow Method via Safe Deep Reinforcement Learning Based on Primal-Dual and Prior Knowledge Guidance
    Wu, Pengfei
    Chen, Chen
    Lai, Dexiang
    Zhong, Jian
    Bie, Zhaohong
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2025, 40 (01) : 597 - 611
  • [8] Indoor Periodic Fingerprint Collections by Vehicular Crowdsensing via Primal-Dual Multi-Agent Deep Reinforcement Learning
    Yang, Haoming
    Zhao, Qiran
    Wang, Hao
    Liu, Chi Harold
    Li, Guozheng
    Wang, Guoren
    Tang, Jian
    Wu, Dapeng
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2024, 42 (10) : 2625 - 2641
  • [9] Energy-Efficient Computation Offloading Based on Multiagent Deep Reinforcement Learning for Industrial Internet of Things Systems
    Chouikhi, Samira
    Esseghir, Moez
    Merghem-Boulahia, Leila
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (07) : 12228 - 12239
  • [10] Dual-attention assisted deep reinforcement learning algorithm for energy-efficient resource allocation in Industrial Internet of Things
    Wang, Ying
    Shang, Fengjun
    Lei, Jianjun
    Zhu, Xiangwei
    Qin, Haoming
    Wen, Jiayu
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 142 : 150 - 164