Generalized multi-agent competitive reinforcement learning with differential augmentation

被引:2
|
作者
Liu, Tianyi [1 ]
Chen, Hechang [1 ,2 ]
Hu, Jifeng [1 ]
Yang, Zhejian [1 ]
Yu, Bo [1 ]
Du, Xinqi [1 ]
Miao, Yinxiao [3 ]
Chang, Yi [1 ,2 ,4 ]
机构
[1] Jilin Univ, Sch Artificial Intelligence, Changchun 130015, Peoples R China
[2] Jilin Univ, Knowledge Driven Human Machine Intelligence Educ E, Changchun 130015, Peoples R China
[3] Beijing Aerosp Inst Metrol & Measurement Technol, Key Lab Artificial Intelligence Measurement & Stan, Beijing 100076, Peoples R China
[4] Jilin Univ, Int Ctr Future Sci, Changchun 130015, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Multi-agent reinforcement learning (MARL); Differential privacy; Data augmentation; OPTIMIZATION; ALGORITHM; ML;
D O I
10.1016/j.eswa.2023.121760
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Observation generalization in multi-agent reinforcement learning is essential for real-world applications as enables agents to formulate effective policies to deal with environmental uncertainty. However, most existing studies seldom consider the generalization ability of agents in competitive scenarios, which is not in line with the actual process of intelligent groups in real applications. We propose a generalized multi-agent reinforcement learning framework (GMARL) by incorporating the data augmentation mechanism of differential privacy, which enhances the agent's adaptability and improves the model's stability. Specifically, we introduce a Laplacian mechanism with temporary adjacency sensitivity into the training and add an observation supplement to the agent's input. Under this mechanism, the agent intelligently considers the environment fluctuation, thus guarding against the adverse effects of the uncertain changes in the environment states. We show the rigorous guarantee of the difference balance between the observation supplement and the original data, and propose series of variants upon the GMARL framework. Extensive experimental results on five benchmark multi-agent competitive environments against with five state-of-the-art algorithms validate the effectiveness and robustness of our proposed method.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Reinforcement Learning for Multi-Agent Competitive Scenarios
    Coutinho, Manuel
    Reis, Luis Paulo
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC), 2022, : 130 - 135
  • [2] Generalized learning automata for multi-agent reinforcement learning
    De Hauwere, Yann-Michael
    Vrancx, Peter
    Nowe, Ann
    [J]. AI COMMUNICATIONS, 2010, 23 (04) : 311 - 324
  • [3] Learning competitive pricing strategies by multi-agent reinforcement learning
    Kutschinski, E
    Uthmann, T
    Polani, D
    [J]. JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2003, 27 (11-12): : 2207 - 2218
  • [4] Competitive Evolution Multi-Agent Deep Reinforcement Learning
    Zhou, Wenhong
    Chen, Yiting
    Li, Jie
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2019), 2019,
  • [5] Multi-agent reinforcement learning with approximate model learning for competitive games
    Park, Young Joon
    Cho, Yoon Sang
    Kim, Seoung Bum
    [J]. PLOS ONE, 2019, 14 (09):
  • [6] Competitive Multi-Agent Deep Reinforcement Learning with Counterfactual Thinking
    Wang, Yue
    Wan, Yao
    Zhang, Chenwei
    Bai, Lu
    Cui, Lixin
    Yu, Philip S.
    [J]. 2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 1366 - 1371
  • [7] Multi-Agent Reinforcement Learning
    Stankovic, Milos
    [J]. 2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
  • [8] COMPETITIVE MULTI-AGENT REINFORCEMENT LEARNING WITH SELF-SUPERVISED REPRESENTATION
    Su, DiJia
    Lee, Jason D.
    Mulvey, John M.
    Poor, H. Vincent
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4098 - 4102
  • [9] Competitive Multi-agent Inverse Reinforcement Learning with Sub-optimal Demonstrations
    Wang, Xingyu
    Klabjan, Diego
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [10] Mixed Cooperative-Competitive Communication Using Multi-agent Reinforcement Learning
    Vanneste, Astrid
    Van Wijnsberghe, Wesley
    Vanneste, Simon
    Mets, Kevin
    Mercelis, Siegfried
    Latre, Steven
    Hellinckx, Peter
    [J]. ADVANCES ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING, 3PGCIC-2021, 2022, 343 : 197 - 206