Distributed Multiagent Reinforcement Learning With Action Networks for Dynamic Economic Dispatch

被引：8

作者：

Hu, Chengfang ^{[1
]}

Wen, Guanghui ^{[2
]}

Wang, Shuai ^{[3
,4
]}

Fu, Junjie ^{[2
]}

Yu, Wenwu ^{[2
]}

机构：

[1] Southeast Univ, Sch Cyber Sci & Engn, Nanjing 211189, Peoples R China

[2] Southeast Univ, Sch Math, Dept Syst Sci, Nanjing 211189, Peoples R China

[3] Beihang Univ, Res Inst Frontier Sci, Beijing 100191, Peoples R China

[4] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Power demand; Heuristic algorithms; Prediction algorithms; Couplings; Approximation algorithms; Power system stability; Convex functions; Distributed optimization; dynamic economic dispatch; multiagent reinforcement learning (MARL); smart grids; VISIBLE IMAGE FUSION; PERFORMANCE; INFORMATION; ALGORITHM; PROTEIN;

D O I：

10.1109/TNNLS.2023.3234049

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A new class of distributed multiagent reinforcement learning (MARL) algorithm suitable for problems with coupling constraints is proposed in this article to address the dynamic economic dispatch problem (DEDP) in smart grids. Specifically, the assumption made commonly in most existing results on the DEDP that the cost functions are known and/or convex is removed in this article. A distributed projection optimization algorithm is designed for the generation units to find the feasible power outputs satisfying the coupling constraints. By using a quadratic function to approximate the state-action value function of each generation unit, the approximate optimal solution of the original DEDP can be obtained by solving a convex optimization problem. Then, each action network utilizes a neural network (NN) to learn the relationship between the total power demand and the optimal power output of each generation unit, such that the algorithm obtains the generalization ability to predict the optimal power output distribution on an unseen total power demand. Furthermore, an improved experience replay mechanism is introduced into the action networks to improve the stability of the training process. Finally, the effectiveness and robustness of the proposed MARL algorithm are verified by simulation.

引用

页码：9553 / 9564

页数：12

共 50 条

[1] Virtual-Action-Based Coordinated Reinforcement Learning for Distributed Economic Dispatch
Li, Dewen
Yu, Liying
Li, Ning
Lewis, Frank
IEEE TRANSACTIONS ON POWER SYSTEMS, 2021, 36 (06) : 5143 - 5152
[2] Distributed Reinforcement Learning Algorithm for Dynamic Economic Dispatch With Unknown Generation Cost Functions
Dai, Pengcheng
Yu, Wenwu
Wen, Guanghui
Baldi, Simone
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (04) : 2258 - 2267
[3] Distributed Economic Dispatch in Microgrids Based on Cooperative Reinforcement Learning
Liu, Weirong
Zhuang, Peng
Liang, Hao
Peng, Jun
Huang, Zhiwu
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2192 - 2203
[4] Distributed Dynamic Economic Dispatch for Active Distribution Networks
Liang, Junwen
Liu, Mingbo
Lin, Shunjiang
10TH ASIA-PACIFIC POWER AND ENERGY ENGINEERING CONFERENCE (APPEEC 2018), 2018, : 696 - 704
[5] The distributed economic dispatch of smart grid based on deep reinforcement learning
Fu, Yang
Guo, Xiaoyan
Mi, Yang
Yuan, Minghan
Ge, Xiaolin
Su, Xiangjing
Li, Zhenkun
IET GENERATION TRANSMISSION & DISTRIBUTION, 2021, 15 (18) : 2645 - 2658
[6] Multiagent Reinforcement Learning Algorithm for Distributed Dynamic Pricing of Managed Lanes
Pandey, Venktesh
Boyles, Stephen D.
2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 2346 - 2351
[7] Consensus Based Distributed Reinforcement Learning for Nonconvex Economic Power Dispatch in Microgrids
Li, Fangyuan
Qin, Jiahu
Kang, Yu
Zheng, Wei Xing
NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 831 - 839
[8] Dynamic Pricing by Multiagent Reinforcement Learning
Han, Wei
Liu, Lingbo
Zheng, Huaili
PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, 2008, : 226 - 229
[9] Distributed Neural Learning Algorithms for Multiagent Reinforcement Learning
Dai, Pengcheng
Liu, Hongzhe
Yu, Wenwu
Wang, He
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (23) : 21039 - 21060
[10] Distributed Multiagent Deep Reinforcement Learning for Multiline Dynamic Bus Timetable Optimization
Yan, Haoyang
Cui, Zhiyong
Chen, Xinqiang
Ma, Xiaolei
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (01) : 469 - 479

← 1 2 3 4 5 →