SMA-PDPPO: Safe Multiagent Primal-Dual Deep Reinforcement Learning for Industrial Parks Energy Trading

被引：0

作者：

Lu, Renzhi ^{[1
,2
]}

Wu, Ning ^{[3
]}

Yang, Tao ^{[4
]}

Chen, Ying ^{[5
]}

Sun, Mingyang ^{[6
,7
]}

Wang, Dong ^{[8
,9
]}

Peng, Xin ^{[10
]}

机构：

[1] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Key Lab Image Proc & Intelligent Control, Wuhan 430074, Peoples R China

[2] Minist Educ, Key Lab Syst Control & Informat Proc, Shanghai 200240, Peoples R China

[3] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Peoples R China

[4] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China

[5] Tsinghua Univ, Elect Engn, Beijing 100084, Peoples R China

[6] Peking Univ, Coll Engn, Dept Ind Engn & Management, Beijing 100091, Peoples R China

[7] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England

[8] Dalian Univ Technol, Key Lab Intelligent Control & Optimizat Ind Equipm, Minist Educ, Dalian 116024, Peoples R China

[9] Dalian Univ Technol, Sch Control Sci & Engn, Dalian 116024, Peoples R China

[10] East China Univ Sci & Technol, Key Lab Smart Mfg Energy Chem Proc, Minist Educ, Shanghai 200237, Peoples R China

来源：

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS | 2025年 / 21卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Deep reinforcement learning (DRL); electricity market; energy management; energy trading; industrial park; DEMAND RESPONSE;

D O I：

10.1109/TII.2024.3514128

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Energy trading in industrial parks has great potential for reducing carbon emissions and lowering energy bills. This article proposes a safe multiagent deep reinforcement learning algorithm for optimizing the energy trading strategy in industrial parks to achieve less reliance on the main grid and save energy costs. Specifically, an industrial park that contains multiple industrial users with both thermal and electrical load requirements is considered, in which the different users can trade energy with each other and with the main grid based on their own strategies. Unlike the existing studies, the time-phased energy trading problem is transformed into a constrained partially observable Markov game, which models the industrial users and objectives of the buyers and sellers. Finally, a novel multiagent primal-dual proximal policy optimization algorithm that guarantees safety is developed to achieve the optimal trading strategies between the main grid and multiple users. Numerical simulations with real-world data demonstrate that the proposed algorithm allows higher total revenue for sellers and lower total costs for buyers in the park, limits each user's bid or offer to a relatively safe range, and increases the amount of electricity traded locally, while reducing trading with the grid.

引用

页码：2640 / 2649

页数：10

共 10 条

[1] Interpreting Primal-Dual Algorithms for Constrained Multiagent Reinforcement Learning
Tabas, Daniel
Zamzam, Ahmed S.
Zhang, Baosen
LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
[2] Safe Policies for Reinforcement Learning via Primal-Dual Methods
Paternain, Santiago
Calvo-Fullana, Miguel
Chamon, Luiz F. O.
Ribeiro, Alejandro
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (03) : 1321 - 1336
[3] Accelerated Primal-Dual Deep Reinforcement Learning for Efficient Energy Management of Hybrid Electric Vehicles
Shaik, Jewaliddin
Karri, Sri Phani Krishna
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, : 5525 - 5540
[4] A projected primal-dual gradient optimal control method for deep reinforcement learning
Simon Gottschalk
Michael Burger
Matthias Gerdts
Journal of Mathematics in Industry, 10
[5] A projected primal-dual gradient optimal control method for deep reinforcement learning
Gottschalk, Simon
Burger, Michael
Gerdts, Matthias
JOURNAL OF MATHEMATICS IN INDUSTRY, 2020, 10 (01)
[6] Primal-Dual Deep Reinforcement Learning for Periodic Coverage-Assisted UAV Secure Communications
Qin, Yunhui
Xing, Zhifang
Li, Xulong
Zhang, Zhongshan
Zhang, Haijun
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (12) : 19641 - 19652
[7] Real-Time Optimal Power Flow Method via Safe Deep Reinforcement Learning Based on Primal-Dual and Prior Knowledge Guidance
Wu, Pengfei
Chen, Chen
Lai, Dexiang
Zhong, Jian
Bie, Zhaohong
IEEE TRANSACTIONS ON POWER SYSTEMS, 2025, 40 (01) : 597 - 611
[8] Indoor Periodic Fingerprint Collections by Vehicular Crowdsensing via Primal-Dual Multi-Agent Deep Reinforcement Learning
Yang, Haoming
Zhao, Qiran
Wang, Hao
Liu, Chi Harold
Li, Guozheng
Wang, Guoren
Tang, Jian
Wu, Dapeng
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2024, 42 (10) : 2625 - 2641
[9] Energy-Efficient Computation Offloading Based on Multiagent Deep Reinforcement Learning for Industrial Internet of Things Systems
Chouikhi, Samira
Esseghir, Moez
Merghem-Boulahia, Leila
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (07) : 12228 - 12239
[10] Dual-attention assisted deep reinforcement learning algorithm for energy-efficient resource allocation in Industrial Internet of Things
Wang, Ying
Shang, Fengjun
Lei, Jianjun
Zhu, Xiangwei
Qin, Haoming
Wen, Jiayu
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 142 : 150 - 164

← 1 →