Multi-Agent Reinforcement Learning With Privacy Preservation for Continuous Double Auction-Based P2P Energy Trading

被引:16
|
作者
Zheng, Jiehui [1 ]
Liang, Ze-Ting [1 ]
Li, Yuanzheng [2 ]
Li, Zhigang [1 ]
Wu, Qing-Hua [1 ]
机构
[1] South China Univ Technol, Sch Elect Power Engn, Guangzhou 510640, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Key Lab lmage Informat Proc & Intelligent Control, Minist Educ China, Wuhan 430074, Peoples R China
关键词
Privacy; Training; Tariffs; Peer-to-peer computing; Energy management; Scalability; Power system dynamics; Continue double auction (CDA); dynamic potential based reward shaping; mean-field approximation; multiagent twin delayed deep deterministic policy gradient; peer-to-peer (P2P);
D O I
10.1109/TII.2023.3348823
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With increasing deployment of distributed energy resources, the energy market which aims for local generation and load profile redistribution is facing the challenge to accommodate various types of participants. To realize social welfare maximization with privacy preserving in a dynamic energy market, this article propose a multiagent reinforcement learning (MARL) method for quotation decision optimization in continuous double auction (CDA)-based peer-to-peer (P2P) energy market. To address the nonstationarity and privacy violation brought by multiagent context, we utilize mean-field approximation to abstract the unauthorized local information of other agents from the public market dynamics. An abstract Q-value function is developed for each agent to infer the neighbor agents' local observation and action through the public clearing results in the dynamic CDA market. Moreover, to avoid sparse reward so as to stabilize the learning process, we propose a dynamic potential-based reward shaping term in the reward. Without altering the learnt optimal policies, the agents can be informed with the additional energy storage state as the reward shaping in each time instants. To validate the effectiveness and economy of our proposed method, simulation studies are conducted on a real-world dataset. Simulation results show that the proposed MARL method produces up to 17% more convergent episodic reward and 67% less energy bills which indicates competitive convergence performance and significant economic benefits.
引用
收藏
页码:6582 / 6590
页数:9
相关论文
共 50 条
  • [1] P2P trading of heat and power via a continuous double auction
    Hutty, Timothy D.
    Brown, Solomon
    APPLIED ENERGY, 2024, 369
  • [2] Competitive-Cooperative Multi-Agent Reinforcement Learning for Auction-based Federated Learning
    Tang, Xiaoli
    Yu, Han
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4262 - 4270
  • [3] An Auction-based Approach to Spectrum Allocation using Multi-agent Reinforcement Learning
    Abji, Nadeem
    Leon-Garcia, Alberto
    2010 IEEE 21ST INTERNATIONAL SYMPOSIUM ON PERSONAL INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2010, : 2233 - 2238
  • [4] Flocking-based decentralised double auction for P2P energy trading within neighbourhoods
    Bandara, Kosala Yapa
    Thakur, Subhasis
    Breslin, John
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2021, 129
  • [5] A Multi-Agent Approach for P2P Energy Trading with EV Battery Thermal Profile Management
    Singh, Anshuman
    Sampath, L. P. Mohasha Isuru
    Dinh Hoa Nguyen
    Hoay Beng Gooi
    Hung Dinh Nguyen
    2022 IEEE VEHICLE POWER AND PROPULSION CONFERENCE (VPPC), 2022,
  • [6] Multi-Agent Reinforcement Learning for Automated Peer-to-Peer Energy Trading in Double-Side Auction Market
    Qiu, Dawei
    Wang, Jianhong
    Wang, Junkai
    Strbac, Goran
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2913 - 2920
  • [7] Physics-Guided Multi-Agent Adversarial Reinforcement Learning for Robust Active Voltage Control With Peer-to-Peer (P2P) Energy Trading
    Chen, Pengcheng
    Liu, Shichao
    Wang, Xiaozhe
    Kamwa, Innocent
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2024, 39 (06) : 7089 - 7101
  • [8] P2P energy trading of multi-energy prosumers: An electricity-heat coupling double auction market
    Fu, Yang
    Shan, Jie
    Li, Zhenkun
    Pan, Jeng-Shyang
    APPLIED ENERGY, 2025, 390
  • [9] P2P power trading based on reinforcement learning for nanogrid clusters
    Jin, Hojun
    Nengroo, Sarvar Hussain
    Jin, Juhee
    Har, Dongsoo
    Lee, Sangkeum
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [10] Auction-Based P2P VoD Streaming: Incentives and Optimal Scheduling
    Wu, Chuan
    Li, Zongpeng
    Qiu, Xuanjia
    Lau, Francis C. M.
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2012, 8 (01)