When Waiting Is Not an Option: Learning Options with a Deliberation Cost

被引:0
|
作者
Harb, Jean [1 ]
Bacon, Pierre-Luc [1 ]
Klissarov, Martin [1 ]
Precup, Doina [1 ]
机构
[1] McGill Univ, Reasoning & Learning Lab, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent work has shown that temporally extended actions (options) can be learned fully end-to-end as opposed to being specified in advance. While the problem of how to learn options is increasingly well understood, the question of what good options should be has remained elusive. We formulate our answer to what good options should be in the bounded rationality framework (Simon, 1957) through the notion of deliberation cost. We then derive practical gradient-based learning algorithms to implement this objective. Our results in the Arcade Learning Environment (ALE) show increased performance and interpretability.
引用
收藏
页码:3165 / 3172
页数:8
相关论文
共 50 条
  • [1] Decisions with weather warnings when waiting is an option
    Mu, Di
    Kaplan, Todd R.
    Dankers, Rutger
    INTERNATIONAL JOURNAL OF DISASTER RISK REDUCTION, 2024, 102
  • [2] Learning when time is an option
    Beavers, Randy
    Dadzie, Richard
    COGENT BUSINESS & MANAGEMENT, 2020, 7 (01):
  • [3] TECHNICAL NOTE: WAITING COST MODELS FOR REAL OPTIONS
    Eschenbach, Ted G.
    Lewis, Neal A.
    Hartman, Joseph C.
    ENGINEERING ECONOMIST, 2009, 54 (01): : 1 - 21
  • [4] Successor Options: An Option Discovery Framework for Reinforcement Learning
    Ramesh, Rahul
    Tomar, Manan
    Ravindran, Balaraman
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3304 - 3310
  • [5] Milia-Like Idiopathic Calcinosis Cutis: When Waiting Is the Best Option
    Grazzini, Marta
    Bassi, Andrea
    Mazzatenta, Carlo
    JOURNAL OF PEDIATRICS, 2020, 224 : 173 - 174
  • [6] Waiting to Choose: The Role of Deliberation in Intertemporal Choice
    Imas, Alex
    Kuhn, Michael A.
    Mironova, Vera
    AMERICAN ECONOMIC JOURNAL-MICROECONOMICS, 2022, 14 (03) : 414 - 440
  • [7] Network Pricing With Investment Waiting Cost Based on Real Options Under Uncertainties
    Yan, Xiaohe
    Gu, Chenghong
    Zhang, Hongcai
    Liu, Nian
    Li, Furong
    Song, Yonghua
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2023, 38 (01) : 427 - 435
  • [8] Migration and the option value of waiting
    Burda, MC
    ECONOMIC AND SOCIAL REVIEW, 1995, 27 (01) : 1 - 19
  • [9] When trading options is not the only option: The effects of single-stock futures trading on options market quality
    Jiang, George J.
    Shimizu, Yoshiki
    Strong, Cuyler
    JOURNAL OF FUTURES MARKETS, 2020, 40 (09) : 1398 - 1419
  • [10] WHEN DELIBERATION PRODUCES EXTREMISM
    Schkade, David
    Sunstein, Cass R.
    Hastie, Reid
    CRITICAL REVIEW, 2010, 22 (2-3) : 227 - 252