On the Complexity of Adversarial Decision Making

被引:0
|
作者
Foster, Dylan J.
Rakhlin, Alexander
Sekhari, Ayush
Sridharan, Karthik
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A central problem in online learning and decision making-from bandits to reinforcement learning-is to understand what modeling assumptions lead to sample-efficient learning guarantees. We consider a general adversarial decision making framework that encompasses (structured) bandit problems with adversarial rewards and reinforcement learning problems with adversarial dynamics. Our main result is to show-via new upper and lower bounds-that the Decision-Estimation Coefficient, a complexity measure introduced by Foster et al. [17] in the stochastic counterpart to our setting, is necessary and sufficient to obtain low regret for adversarial decision making. However, compared to the stochastic setting, one must apply the Decision-Estimation Coefficient to the convex hull of the class of models (or, hypotheses) under consideration. This establishes that the price of accommodating adversarial rewards or dynamics is governed by the behavior of the model class under convexification, and recovers a number of existing results-both positive and negative. En route to obtaining these guarantees, we provide new structural results that connect the Decision-Estimation Coefficient to variants of other well-known complexity measures, including the Information Ratio of Russo and Van Roy [47] and the Exploration-by-Optimization objective of Lattimore and Gyorgy [32].
引用
收藏
页数:14
相关论文
共 50 条
  • [1] A survey of decision making in adversarial games
    Li, Xiuxian
    Meng, Min
    Hong, Yiguang
    Chen, Jie
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (04)
  • [2] A survey of decision making in adversarial games
    Xiuxian LI
    Min MENG
    Yiguang HONG
    Jie CHEN
    [J]. Science China(Information Sciences), 2024, 67 (04) : 85 - 112
  • [3] A survey of decision making in adversarial games
    Xiuxian Li
    Min Meng
    Yiguang Hong
    Jie Chen
    [J]. Science China Information Sciences, 2024, 67
  • [4] COMPLEXITY AND DECISION-MAKING
    MACKINNON, AJ
    WEARING, AJ
    [J]. BEHAVIORAL SCIENCE, 1980, 25 (04): : 285 - 296
  • [5] The Stress as Adversarial Factor for Cyber Decision Making
    Sandoval Rodriguez-Bermejo, David
    Maestre Vidal, Jorge
    Estevez Tapiador, Juan Manuel
    [J]. ARES 2021: 16TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY, 2021,
  • [6] ADVERSARIAL DECISION-MAKING - BENEFITS OR LOSSES
    ELROD, R
    MOSS, SE
    [J]. OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 1994, 22 (03): : 283 - 289
  • [7] Adversarial vulnerabilities of human decision-making
    Dezfouli, Amir
    Nock, Richard
    Dayan, Peter
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (46) : 29221 - 29228
  • [8] COMPLEXITY, INCREASING FAILURE, AND DECISION MAKING
    STREUFERT, S
    STREUFERT, SC
    CASTORE, CH
    [J]. JOURNAL OF EXPERIMENTAL RESEARCH IN PERSONALITY, 1969, 3 (04): : 293 - 300
  • [9] Improving Decision Making in Complexity Environment
    Gorzen-Mitka, Iwona
    Okreglicka, Malgorzata
    [J]. 21ST INTERNATIONAL ECONOMIC CONFERENCE OF SIBIU 2014, IECS 2014 PROSPECTS OF ECONOMIC RECOVERY IN A VOLATILE INTERNATIONAL CONTEXT: MAJOR OBSTACLES, INITIATIVES AND PROJECTS, 2014, 16 : 402 - 409
  • [10] Complexity of ethical decision making in psychiatry
    Morenz, B
    Sales, B
    [J]. ETHICS & BEHAVIOR, 1997, 7 (01) : 1 - 14