Multi-Agent Reinforcement Learning With Privacy Preservation for Continuous Double Auction-Based P2P Energy Trading

被引:16
|
作者
Zheng, Jiehui [1 ]
Liang, Ze-Ting [1 ]
Li, Yuanzheng [2 ]
Li, Zhigang [1 ]
Wu, Qing-Hua [1 ]
机构
[1] South China Univ Technol, Sch Elect Power Engn, Guangzhou 510640, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Key Lab lmage Informat Proc & Intelligent Control, Minist Educ China, Wuhan 430074, Peoples R China
关键词
Privacy; Training; Tariffs; Peer-to-peer computing; Energy management; Scalability; Power system dynamics; Continue double auction (CDA); dynamic potential based reward shaping; mean-field approximation; multiagent twin delayed deep deterministic policy gradient; peer-to-peer (P2P);
D O I
10.1109/TII.2023.3348823
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With increasing deployment of distributed energy resources, the energy market which aims for local generation and load profile redistribution is facing the challenge to accommodate various types of participants. To realize social welfare maximization with privacy preserving in a dynamic energy market, this article propose a multiagent reinforcement learning (MARL) method for quotation decision optimization in continuous double auction (CDA)-based peer-to-peer (P2P) energy market. To address the nonstationarity and privacy violation brought by multiagent context, we utilize mean-field approximation to abstract the unauthorized local information of other agents from the public market dynamics. An abstract Q-value function is developed for each agent to infer the neighbor agents' local observation and action through the public clearing results in the dynamic CDA market. Moreover, to avoid sparse reward so as to stabilize the learning process, we propose a dynamic potential-based reward shaping term in the reward. Without altering the learnt optimal policies, the agents can be informed with the additional energy storage state as the reward shaping in each time instants. To validate the effectiveness and economy of our proposed method, simulation studies are conducted on a real-world dataset. Simulation results show that the proposed MARL method produces up to 17% more convergent episodic reward and 67% less energy bills which indicates competitive convergence performance and significant economic benefits.
引用
收藏
页码:6582 / 6590
页数:9
相关论文
共 50 条
  • [31] A multi-agent mechanism for topolog balancing in unstructured P2P networks
    Dasgupta, Prithviraj
    2006 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, Proceedings, 2006, : 389 - 392
  • [32] An Efficient Task Allocation Protocol for P2P Multi-Agent Systems
    Ye, Dayong
    Bai, Quan
    Zhang, Minjie
    Win, Khin Than
    Shen, Zhiqi
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS, PROCEEDINGS, 2009, : 11 - 18
  • [33] Resource assignment algorithm under multi-agent for P2P MMOG
    Luo, Jia
    Chang, Huiyou
    Yi, Yang
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2010, 47 (12): : 2067 - 2074
  • [34] P2P energy trading: Blockchain-enabled P2P energy society with multi-scale flexibility services
    Wu, Ying
    Wu, Yanpeng
    Cimen, Halil
    Vasquez, Juan C.
    Guerrero, Josep M.
    ENERGY REPORTS, 2022, 8 : 3614 - 3628
  • [35] Auction-Based On-Demand P2P Min-Cost Media Streaming with Network Coding
    Chu, Xiaowen
    Zhao, Kaiyong
    Li, Zongpeng
    Mahanti, Anirban
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2009, 20 (12) : 1816 - 1829
  • [36] Multi-Agent Deep Reinforcement Learning for Simulating Centralized Double-Sided Auction Electricity Market
    Yin, Baocai
    Weng, Haoen
    Hu, Yongli
    Xi, Jiayang
    Ding, Pinggang
    Liu, Jia
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2025, 40 (01) : 518 - 529
  • [37] A multi-agent virtual market model for generalization in reinforcement learning based trading strategies
    He, Fei-Fan
    Chen, Chiao-Ting
    Huang, Szu-Hao
    APPLIED SOFT COMPUTING, 2023, 134
  • [38] Quantum Finance and Fuzzy Reinforcement Learning-Based Multi-agent Trading System
    Cheng, Chi
    Chen, Bingshen
    Xiao, Ziting
    Lee, Raymond S. T.
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2024, 26 (07) : 2224 - 2245
  • [39] Deep Reinforcement Learning based Multi-Agent Collaborated Network for Distributed Stock Trading
    Kim, Jung-Jae
    Cha, Si-Ho
    Cho, Kuk-Hyun
    Ryu, Minwoo
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2018, 11 (02): : 11 - 20
  • [40] A multi-agent reinforcement learning framework for optimizing financial trading strategies based on TimesNet
    Huang, Yuling
    Zhou, Chujin
    Cui, Kai
    Lu, Xiaoping
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237