Game Theory for Autonomy: From Min-Max Optimization to Equilibrium and Bounded Rationality Learning

被引:1
|
作者
Vamvoudakis, Kyriakos G. [1 ]
Fotiadis, Filippos [1 ]
Hespanha, Joao P. [2 ]
Chinchilla, Raphael [2 ]
Yang, Guosong [3 ]
Liu, Mushuang [4 ]
Shamma, Jeff S. [5 ]
Pavel, Lacra [6 ]
机构
[1] Georgia Inst Technol, Daniel Guggenheim Sch Aerosp Engn, Atlanta, GA 30332 USA
[2] Univ Calif Santa Barbara, Ctr Control Dynam Syst & Computat, Santa Barbara, CA 93106 USA
[3] Rutgers State Univ, Dept Elect & Comp Engn, Piscataway, NJ 08854 USA
[4] Univ Missouri, Dept Mech & Aerosp Engn, Columbia, MO 65211 USA
[5] Univ Illinois, Dept Ind & Enterprise Syst Engn, Champaign, IL 61801 USA
[6] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 3G4, Canada
关键词
FICTITIOUS PLAY; STOCHASTIC APPROXIMATIONS; UNCOUPLED DYNAMICS; GRADIENT PLAY; PASSIVITY; CONVERGENCE; REINFORCEMENT; SYSTEMS; MODEL;
D O I
10.23919/ACC55779.2023.10156432
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Finding Nash equilibria in non-cooperative games can be, in general, an exceptionally challenging task. This is owed to various factors, including but not limited to the cost functions of the game being nonconvex/nonconcave, the players of the game having limited information about one another, or even due to issues of computational complexity. The present tutorial draws motivation from this harsh reality and provides methods to approximate Nash or min-max equilibria in non-ideal settings using both optimization- and learning-based techniques. The tutorial acknowledges, however, that such techniques may not always converge, but instead lead to oscillations or even chaos. In that respect, tools from passivity and dissipativity theory are provided, which can offer explanations about these divergent behaviors. Finally, the tutorial highlights that, more frequently than often thought, the search for equilibrium policies is simply vain; instead, bounded rationality and non-equilibrium policies can be more realistic to employ owing to some players' learning imperfectly or being relatively naive - "bounded rational." The efficacy of such plays is demonstrated in the context of autonomous driving systems, where it is explicitly shown that they can guarantee vehicle safety.
引用
收藏
页码:4363 / 4380
页数:18
相关论文
共 50 条
  • [41] Min-Max Optimization of Several Classical Discrete Optimization Problems
    G. Yu
    Journal of Optimization Theory and Applications, 1998, 98 : 221 - 242
  • [42] Min-Max payoffs in a two-player location game
    Chawla, S.
    Rajan, U.
    Ravi, R.
    Sinha, A.
    OPERATIONS RESEARCH LETTERS, 2006, 34 (05) : 499 - 507
  • [43] OPTIMIZATION METHOD IN COUNTER TERRORISM: MIN-MAX ZERO-SUM DIFFERENTIAL GAME APPROACH
    Megahed, Abd El-Monem A.
    Youness, Ebrahim A.
    Arafat, Hebatallah K.
    NUMERICAL ALGEBRA CONTROL AND OPTIMIZATION, 2023, 13 (02): : 299 - 313
  • [44] RIEMANNIAN HAMILTONIAN METHODS FOR MIN-MAX OPTIMIZATION ON MANIFOLDS
    Han, Andi
    Mishra, Bamdev
    Jawanpuria, Pratik
    Kumar, Pawan
    Gao, Junbin
    SIAM JOURNAL ON OPTIMIZATION, 2023, 33 (03) : 1797 - 1827
  • [45] SCENARIO MIN-MAX OPTIMIZATION AND THE RISK OF EMPIRICAL COSTS
    Care, A.
    Garatti, S.
    Campi, M. C.
    SIAM JOURNAL ON OPTIMIZATION, 2015, 25 (04) : 2061 - 2080
  • [46] A SMOOTHING-OUT TECHNIQUE FOR MIN-MAX OPTIMIZATION
    ZANG, I
    MATHEMATICAL PROGRAMMING, 1980, 19 (01) : 61 - 77
  • [47] Robust Asymmetric Recommendation via Min-Max Optimization
    Yang, Peng
    Zhao, Peilin
    Zheng, Vincent W.
    Ding, Lizhong
    Gao, Xin
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 1077 - 1080
  • [48] Nonlinear optimization: on the min-max digraph and global smoothing
    Jongen, HT
    Jhones, AR
    CALCULUS OF VARIATIONS AND DIFFERENTIAL EQUATIONS, 2000, 410 : 119 - 135
  • [49] Adversarial Attack Generation Empowered by Min-Max Optimization
    Wang, Jingkang
    Zhang, Tianyun
    Liu, Sijia
    Chen, Pin-Yu
    Xu, Jiacen
    Fardad, Makan
    Li, Bo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [50] Synchronization of Heterogeneous Agents using Min-Max Optimization
    Strubel, Jan
    Stein, Gregor Lukas
    Konigorski, Ulrich
    2015 AMERICAN CONTROL CONFERENCE (ACC), 2015, : 50 - 55