Multi-Agent Reinforcement Learning With Privacy Preservation for Continuous Double Auction-Based P2P Energy Trading

被引:16
|
作者
Zheng, Jiehui [1 ]
Liang, Ze-Ting [1 ]
Li, Yuanzheng [2 ]
Li, Zhigang [1 ]
Wu, Qing-Hua [1 ]
机构
[1] South China Univ Technol, Sch Elect Power Engn, Guangzhou 510640, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Key Lab lmage Informat Proc & Intelligent Control, Minist Educ China, Wuhan 430074, Peoples R China
关键词
Privacy; Training; Tariffs; Peer-to-peer computing; Energy management; Scalability; Power system dynamics; Continue double auction (CDA); dynamic potential based reward shaping; mean-field approximation; multiagent twin delayed deep deterministic policy gradient; peer-to-peer (P2P);
D O I
10.1109/TII.2023.3348823
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With increasing deployment of distributed energy resources, the energy market which aims for local generation and load profile redistribution is facing the challenge to accommodate various types of participants. To realize social welfare maximization with privacy preserving in a dynamic energy market, this article propose a multiagent reinforcement learning (MARL) method for quotation decision optimization in continuous double auction (CDA)-based peer-to-peer (P2P) energy market. To address the nonstationarity and privacy violation brought by multiagent context, we utilize mean-field approximation to abstract the unauthorized local information of other agents from the public market dynamics. An abstract Q-value function is developed for each agent to infer the neighbor agents' local observation and action through the public clearing results in the dynamic CDA market. Moreover, to avoid sparse reward so as to stabilize the learning process, we propose a dynamic potential-based reward shaping term in the reward. Without altering the learnt optimal policies, the agents can be informed with the additional energy storage state as the reward shaping in each time instants. To validate the effectiveness and economy of our proposed method, simulation studies are conducted on a real-world dataset. Simulation results show that the proposed MARL method produces up to 17% more convergent episodic reward and 67% less energy bills which indicates competitive convergence performance and significant economic benefits.
引用
收藏
页码:6582 / 6590
页数:9
相关论文
共 50 条
  • [21] Multi-agent interaction based collaborative P2P system for fighting Spam
    Mo, Guoqing
    Zhao, Wei
    Cao, Haixia
    Dong, Jianshe
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2006, : 428 - 431
  • [22] The collaboration alliance mechanism of P2P based on mobile multi-agent technology
    Xu, Xiao-Long
    Wang, Ru-Chuan
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2007, 29 (02): : 345 - 349
  • [23] A resource discovery model based on multi-agent technology in P2P system
    Dan, W
    IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2004, : 548 - 551
  • [24] A multi-agent framework for a P2P data sharing facility
    Deen, SM
    19TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 1, PROCEEDINGS: AINA 2005, 2005, : 35 - 40
  • [25] A Three-Stage Multi-Energy Trading Strategy Based on P2P Trading Mode
    Yang, Jie
    Xu, Wenya
    Ma, Kai
    Li, Conghui
    IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2023, 14 (01) : 233 - 241
  • [26] Joint Energy and Carbon Trading for Multi-Microgrid System Based on Multi-Agent Deep Reinforcement Learning
    Zhou, Yanting
    Ma, Zhongjing
    Wang, Tianyu
    Zhang, Jinhui
    Shi, Xingyu
    Zou, Suli
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2024, 39 (06) : 7376 - 7388
  • [27] Automatic P2P Energy Trading Model Based on Reinforcement Learning Using Long Short-Term Delayed Reward
    Kim, Jin-Gyeom
    Lee, Bowon
    ENERGIES, 2020, 13 (20)
  • [28] Renewable energy integration and microgrid energy trading using multi-agent deep reinforcement learning
    Harrold, Daniel J. B.
    Cao, Jun
    Fan, Zhong
    APPLIED ENERGY, 2022, 318
  • [29] A Multi-Agent System for Collaborative Editing in Mobile Networks and P2P
    Driss, Mechaoui Moulay
    Fatima, Bendella
    Abdessamad, Imine
    IAMA: 2009 INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT & MULTI-AGENT SYSTEMS, 2009, : 372 - +
  • [30] Implementation of Blockchain based P2P Energy Trading Platform
    Kwak, Subin
    Lee, Joohyung
    35TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2021), 2021, : 5 - 7