Game Theory for Autonomy: From Min-Max Optimization to Equilibrium and Bounded Rationality Learning

被引:1
|
作者
Vamvoudakis, Kyriakos G. [1 ]
Fotiadis, Filippos [1 ]
Hespanha, Joao P. [2 ]
Chinchilla, Raphael [2 ]
Yang, Guosong [3 ]
Liu, Mushuang [4 ]
Shamma, Jeff S. [5 ]
Pavel, Lacra [6 ]
机构
[1] Georgia Inst Technol, Daniel Guggenheim Sch Aerosp Engn, Atlanta, GA 30332 USA
[2] Univ Calif Santa Barbara, Ctr Control Dynam Syst & Computat, Santa Barbara, CA 93106 USA
[3] Rutgers State Univ, Dept Elect & Comp Engn, Piscataway, NJ 08854 USA
[4] Univ Missouri, Dept Mech & Aerosp Engn, Columbia, MO 65211 USA
[5] Univ Illinois, Dept Ind & Enterprise Syst Engn, Champaign, IL 61801 USA
[6] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 3G4, Canada
关键词
FICTITIOUS PLAY; STOCHASTIC APPROXIMATIONS; UNCOUPLED DYNAMICS; GRADIENT PLAY; PASSIVITY; CONVERGENCE; REINFORCEMENT; SYSTEMS; MODEL;
D O I
10.23919/ACC55779.2023.10156432
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Finding Nash equilibria in non-cooperative games can be, in general, an exceptionally challenging task. This is owed to various factors, including but not limited to the cost functions of the game being nonconvex/nonconcave, the players of the game having limited information about one another, or even due to issues of computational complexity. The present tutorial draws motivation from this harsh reality and provides methods to approximate Nash or min-max equilibria in non-ideal settings using both optimization- and learning-based techniques. The tutorial acknowledges, however, that such techniques may not always converge, but instead lead to oscillations or even chaos. In that respect, tools from passivity and dissipativity theory are provided, which can offer explanations about these divergent behaviors. Finally, the tutorial highlights that, more frequently than often thought, the search for equilibrium policies is simply vain; instead, bounded rationality and non-equilibrium policies can be more realistic to employ owing to some players' learning imperfectly or being relatively naive - "bounded rational." The efficacy of such plays is demonstrated in the context of autonomous driving systems, where it is explicitly shown that they can guarantee vehicle safety.
引用
收藏
页码:4363 / 4380
页数:18
相关论文
共 50 条
  • [21] Separation of joint plan equilibrium payoffs from the min-max functions
    Simon, RS
    GAMES AND ECONOMIC BEHAVIOR, 2002, 41 (01) : 79 - 102
  • [22] Min-max differential game with partial differential equation
    Youness, Ebrahim A.
    Megahed, Abd El-Monem A.
    Eladdad, Elsayed E.
    Madkour, Hanem F. A.
    AIMS MATHEMATICS, 2022, 7 (08): : 13777 - 13789
  • [23] On direct methods for lexicographic min-max optimization
    Ogryczak, Wlodzimierz
    Sliwinski, Tomasz
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2006, PT 3, 2006, 3982 : 802 - 811
  • [24] Global optimization:: On pathlengths in min-max graphs
    Günzel, H
    Jongen, HT
    JOURNAL OF GLOBAL OPTIMIZATION, 2000, 17 (1-4) : 161 - 165
  • [25] Diffusion Stochastic Optimization for Min-Max Problems
    Cai, Haoyuan
    Alghunaim, Sulaiman A.
    Sayed, Ali H.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2025, 73 : 259 - 274
  • [26] Global Optimization: On Pathlengths in Min-Max Graphs
    HARALD GÜNZEL
    HUBERTUS TH. Jongen
    Journal of Global Optimization, 2000, 17 : 161 - 165
  • [27] MIN-MAX FORMULATION AS A STRATEGY IN SHAPE OPTIMIZATION
    ESCHENAUER, H
    KNEPPE, G
    ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND MECHANIK, 1986, 66 (05): : T344 - T345
  • [28] A SIMPLE ALGORITHM FOR MIN-MAX NETWORK OPTIMIZATION
    DIMAIO, B
    SORBELLO, F
    ALTA FREQUENZA, 1988, 57 (05): : 259 - 265
  • [29] Min-max theory of minimal surfaces and applications
    Marques, Fernando C.
    Neves, Andre
    MATHEMATICAL CONGRESS OF THE AMERICAS, 2016, 656 : 13 - 25
  • [30] MIN-MAX THEORY FOR MINIMAL HYPERSURFACES WITH BOUNDARY
    De Lellis, Camillo
    Ramic, Jusuf
    ANNALES DE L INSTITUT FOURIER, 2018, 68 (05) : 1909 - 1986