Efficient computation of optimal actions

被引:238
|
作者
Todorov, Emanuel [1 ,2 ]
机构
[1] Univ Washington, Dept Appl Math & Comp Sci, Seattle, WA 98195 USA
[2] Univ Washington, Dept Engn, Seattle, WA 98195 USA
基金
美国国家科学基金会;
关键词
action selection; cost function; linear Bellman equation; stochastic optimal control;
D O I
10.1073/pnas.0710743106
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Optimal choice of actions is a fundamental problem relevant to fields as diverse as neuroscience, psychology, economics, computer science, and control engineering. Despite this broad relevance the abstract setting is similar: we have an agent choosing actions over time, an uncertain dynamical system whose state is affected by those actions, and a performance criterion that the agent seeks to optimize. Solving problems of this kind remains hard, in part, because of overly generic formulations. Here, we propose a more structured formulation that greatly simplifies the construction of optimal control laws in both discrete and continuous domains. An exhaustive search over actions is avoided and the problem becomes linear. This yields algorithms that outperform Dynamic Programming and Reinforcement Learning, and thereby solve traditional problems more efficiently. Our framework also enables computations that were not possible before: composing optimal control laws by mixing primitives, applying deterministic methods to stochastic systems, quantifying the benefits of error tolerance, and inferring goals from behavioral data via convex optimization. Development of a general class of easily solvable problems tends to accelerate progress-as linear systems theory has done, for example. Our framework may have similar impact in fields where optimal choice of actions is relevant.
引用
收藏
页码:11478 / 11483
页数:6
相关论文
共 50 条
  • [1] Efficient Perfectly Secure Computation with Optimal Resilience
    Ittai Abraham
    Gilad Asharov
    Avishay Yanai
    [J]. Journal of Cryptology, 2022, 35
  • [2] Efficient Perfectly Secure Computation with Optimal Resilience
    Abraham, Ittai
    Asharov, Gilad
    Yanai, Avishay
    [J]. JOURNAL OF CRYPTOLOGY, 2022, 35 (04)
  • [4] MONOTONICITY AND EFFICIENT COMPUTATION OF OPTIMAL DICHOTOMOUS SEARCH
    HASSIN, R
    HENIG, M
    [J]. DISCRETE APPLIED MATHEMATICS, 1993, 46 (03) : 221 - 234
  • [5] Efficient Perfectly Secure Computation with Optimal Resiliencen
    Abraham, Ittai
    Asharov, Gilad
    Yanai, Avishay
    [J]. THEORY OF CRYPTOGRAPHY, TCC 2021, PT II, 2021, 13043 : 66 - 96
  • [6] Efficient Computation of Bayesian Optimal Discriminating Designs
    Dette, Holger
    Guchenko, Roman
    Melas, Viatcheslav B.
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2017, 26 (02) : 424 - 433
  • [7] On Scalable and Efficient Computation of Large Scale Optimal Transport
    Xie, Yujia
    Chen, Minshuo
    Jiang, Haoming
    Zhao, Tuo
    Zha, Hongyuan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [8] Optimal Decision Tree Synthesis for Efficient Neighborhood Computation
    Grana, Costantino
    Borghesani, Daniele
    [J]. AI (ASTERISK) IA 2009: EMERGENT PERSPECTIVES IN ARTIFICIAL INTELLIGENCE, 2009, 5883 : 92 - 101
  • [9] Efficient computation of optimal oligo-RNA binding
    Hodas, NO
    Aalberts, DP
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 (22) : 6636 - 6642
  • [10] Efficient on-line computation of constrained optimal control
    Borrelli, F
    Baotic, M
    Bemporad, A
    Morari, M
    [J]. PROCEEDINGS OF THE 40TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2001, : 1187 - 1192