Deconfounded Opponent Intention Inference for Football Multi-Player Policy Learning

被引:1
|
作者
Wang, Shijie [1 ,2 ]
Pan, Yi [2 ]
Pu, Zhiqiang [1 ,2 ]
Liu, Boyin [1 ,2 ]
Yi, Jianqiang [1 ,2 ]
机构
[1] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
关键词
D O I
10.1109/IROS55552.2023.10341469
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the high complexity of a football match, the opponents' strategies are variable and unknown. Thus predicting the opponents' future intentions accurately based on current situation is crucial for football players' decision-making. To better anticipate the opponents and learn more effective strategies, a deconfounded opponent intention inference (DOII) method for football multi-player policy learning is proposed in this paper. Specifically, opponents' intentions are inferred by an opponent intention supervising module. Furthermore, for some confounders which affect the causal relationship among the players and the opponents, a deconfounded trajectory graph module is designed to mitigate the influence of these confounders and increase the accuracy of the inferences about opponents' intentions. Besides, an opponent-based incentive module is designed to improve the players' sensitivity to the opponents' intentions and further to train reasonable players' strategies. Representative results indicate that DOII can effectively improve the performance of players' strategies in the Google Research Football environment, which validates the superiority of the proposed method.
引用
收藏
页码:8054 / 8061
页数:8
相关论文
共 50 条
  • [1] Long-Term and Short-Term Opponent Intention Inference for Football Multiplayer Policy Learning
    Wang, Shijie
    Pu, Zhiqiang
    Pan, Yi
    Liu, Boyin
    Ma, Hao
    Yi, Jianqiang
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (06) : 2055 - 2069
  • [2] A Fuzzy Inference System for Players Evaluation in Multi-Player Sports: The Football Study Case
    Salabun, Wojciech
    Shekhovtsov, Andrii
    Pamucar, Dragan
    Watrobski, Jaroslaw
    Kizielewicz, Bartlomiej
    Wiceckowski, Jakub
    Bozanic, Darko
    Urbaniak, Karol
    Nyczaj, Bartosz
    SYMMETRY-BASEL, 2020, 12 (12): : 1 - 49
  • [3] Opponent Behavior Prediction in a Multi-Player Game with Imperfect Information
    Chang, Tzu-Le
    Sugiyanto
    Pan, Wei-Cheng
    Tai, Wen-Kai
    Chang, Chin-Chen
    Way, Der-Lor
    2020 IEEE GRAPHICS AND MULTIMEDIA (GAME), 2020, : 43 - 48
  • [4] Heterogeneous multi-player imitation learning
    Lian, Bosen
    Xue, Wenqian
    Lewis, Frank L.
    CONTROL THEORY AND TECHNOLOGY, 2023, 21 (03) : 281 - 291
  • [5] Heterogeneous multi-player imitation learning
    Bosen Lian
    Wenqian Xue
    Frank L. Lewis
    Control Theory and Technology, 2023, 21 (3) : 281 - 291
  • [6] Learning a Multi-Player Chess Game with TreeStrap
    Real, Diogo
    Blair, Alan
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 617 - 623
  • [7] A fuzzy inference system with application to player selection and team formation in multi-player sports
    Tavana, Madjid
    Azizi, Farshad
    Azizi, Farzad
    Behzadian, Majid
    SPORT MANAGEMENT REVIEW, 2013, 16 (01) : 97 - 110
  • [8] Multi-player H∞ Differential Game using On-Policy and Off-Policy Reinforcement Learning
    An, Peiliang
    Liu, Mushuang
    Wan, Yan
    Lewis, Frank L.
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION (ICCA), 2020, : 1137 - 1142
  • [9] Decentralized Learning for Multi-player Multi-armed Bandits
    Kalathil, Dileep
    Nayyar, Naumaan
    Jain, Rahul
    2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 3960 - 3965
  • [10] Learning the Patterns of Balance in a Multi-Player Shooter Game
    Karavolos, Daniel
    Liapis, Antonios
    Yannakakis, Georgios
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF DIGITAL GAMES (FDG'17), 2017,