Evolutionary Game Based Demand Response Bidding Strategy for End-Users Using Q-Learning and Compound Differential Evolution

被引:19
|
作者
Han, Ouzhu [1 ]
Ding, Tao [1 ]
Bai, Linquan [2 ]
He, Yuankang [3 ]
Li, Fangxing [4 ]
Shahidehpour, Mohammad [5 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect Engn, Xian 710049, Shaanxi, Peoples R China
[2] Univ North Carolina Charlotte, Dept Elect Engn & Comp Sci, Charlotte, NC USA
[3] State Grid Corp China, Northwest Branch, Xian 710048, Shaanxi, Peoples R China
[4] Univ Tennessee, Dept Elect Engn & Comp Sci, Knoxville, TN USA
[5] IIT, Elect & Comp Engn Dept, Chicago, IL 60616 USA
基金
中国国家自然科学基金;
关键词
Cloud computing; evolutionary game; demand response; Q-learning; compound differential evolution; MARKET; AGGREGATORS; ENERGY; MODEL;
D O I
10.1109/TCC.2021.3117956
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Load aggregators (LAs) play a key role in fully tapping the demand response (DR) resources of small and medium-sized end-users to enable a more flexible power grid. In the ancillary service market, the LA can provide DR to the system by aggregating the resources of its users. In response to the issued DR program, end-users offer to provide DR resources. To help optimize the user bidding strategy, an evolutionary game model is presented here in view of the bounded rationality of bidders. A combined Q-learning and compound differential evolution (CDE) algorithm is proposed to deal with the problems of incomplete information and uncertainties in the opponents' decision-making, and prevent the evolutionary stable strategy (ESS) from falling into a local optimum. Moreover, a cloud-computing-based framework is designed and agent servers are introduced to protect data privacy. Numerical results show that by adopting the proposed algorithm, the user's bidding price keeps slightly lower than the opponents' price which guarantees its revenue remains on a high level. This indicates that the proposed algorithm has good adaptability for addressing incomplete information and uncertainties in opponents' decision-making.
引用
收藏
页码:97 / 110
页数:14
相关论文
共 40 条
  • [1] Bonus-Based Demand Response Using Stackelberg Game Approach for Residential End-Users Equipped With HVAC System
    Tavakkoli, Mehdi
    Fattaheian-Dehkordi, Sajjad
    Pourakbari-Kasmaei, Mahdi
    Liski, Matti
    Lehtonen, Matti
    IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2021, 12 (01) : 234 - 249
  • [2] The Research on the Mass Incidents Law Based on Evolutionary Game Theory and Q-Learning
    Zhang D.
    Cao Y.
    Zhao C.
    Yao Z.
    Cao Q.
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [3] A Bayesian game theoretic based bidding strategy for demand response aggregators in electricity markets
    Abapour, Saeed
    Mohammadi-Ivatloo, Behnam
    Hagh, Mehrdad Tarafdar
    SUSTAINABLE CITIES AND SOCIETY, 2020, 54
  • [4] Robust bidding strategy for demand response aggregators in electricity market based on game theory
    Abapour, Saeed
    Mohammadi-Ivatloo, Behnam
    Hagh, Mehrdad Tarafdar
    JOURNAL OF CLEANER PRODUCTION, 2020, 243 (243)
  • [5] Optimal GWCSO-based home appliances scheduling for demand response considering end-users comfort
    Waseem, Muhammad
    Lin, Zhenzhi
    Liu, Shengyuan
    Sajjad, Intisar Ali
    Aziz, Tarique
    ELECTRIC POWER SYSTEMS RESEARCH, 2020, 187 (187)
  • [6] Individual and cluster demand response in retail electricity trading with end-users in differentiated oligopoly market: A game-theoretical approach
    Yan, Jing
    Zhang, Jun
    Zhang, Luxi
    Deng, Changhong
    Gao, Tianlu
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2024, 161
  • [7] Bidding strategy of thermal power compound differential evolution game under the market mechanism of peak regulation auxiliary service
    Dong, Fugui
    Li, Wanying
    Ji, Zhengsen
    Fatema, Shafaq
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2021, 15 (12) : 1871 - 1883
  • [8] A Robust Q-Learning and Differential Evolution Based Policy Framework for Key Frame Extraction
    Rudra, Sudipta
    Thangavel, Senthil Kumar
    INTELLIGENT COMPUTING, INFORMATION AND CONTROL SYSTEMS, ICICCS 2019, 2020, 1039 : 716 - 728
  • [9] Vibration-based FRP debonding detection using a Q-learning evolutionary algorithm
    Ding, Zhenghao
    Li, Lingfang
    Wang, Xiaoyou
    Yu, Tao
    Xia, Yong
    ENGINEERING STRUCTURES, 2023, 275
  • [10] Interactive Dispatch Modes and Bidding Strategy of Multiple Virtual Power Plants Based on Demand Response and Game Theory
    Wang, Yao
    Ai, Xin
    Tan, Zhongfu
    Yan, Lei
    Liu, Shuting
    IEEE TRANSACTIONS ON SMART GRID, 2016, 7 (01) : 510 - 519