Optimistic planning with an adaptive number of action switches for near-optimal nonlinear control

被引:0
|
作者
Mathe, Koppany [1 ]
Busoniu, Lucian [1 ]
Munos, Remi [2 ]
De Schutter, Bart [3 ]
机构
[1] Tech Univ Cluj Napoca, Dept Automat, Cluj Napoca, Romania
[2] Google DeepMind, London, England
[3] Delft Univ Technol, Delft Ctr Syst & Control, Delft, Netherlands
关键词
Optimal control; Planning; Nonlinear predictive control; Near-optimality analysis; MODEL-PREDICTIVE CONTROL; EXPLICIT; OPTIMIZATION;
D O I
10.1016/j.engappai.2017.08.020
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider infinite-horizon optimal control of nonlinear systems where the control actions are discrete, and focus on optimistic planning algorithms from artificial intelligence, which can handle general nonlinear systems with nonquadratic costs. With the main goal of reducing computations, we introduce two such algorithms that only search for constrained action sequences. The constraint prevents the sequences from switching between different actions more than a limited number of times. We call the first method optimistic switch-limited planning (OSP), and develop analysis showing that its fixed number of switches S leads to polynomial complexity in the search horizon, in contrast to the exponential complexity of the existing OP algorithm for deterministic systems; and to a correspondingly faster convergence towards optimality. Since tuning S is difficult, we introduce an adaptive variant called OASP that automatically adjusts S so as to limit computations while ensuring that near-optimal solutions keep being explored. OSP and OASP are analytically evaluated in representative special cases, and numerically illustrated in simulations of a rotational pendulum. To show that the algorithms also work in challenging applications, OSP is used to control the pendulum in real time, while OASP is applied for trajectory control of a simulated quadrotor. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:355 / 367
页数:13
相关论文
共 50 条
  • [21] NEAR-OPTIMAL CONTROL OF DISCRETE-TIME NONLINEAR STOCHASTIC SYSTEMS
    HEALEY, M
    PROCEEDINGS OF THE INSTITUTION OF ELECTRICAL ENGINEERS-LONDON, 1972, 119 (02): : 260 - &
  • [22] Near-Optimal Control for Offshore Structures with Nonlinear Energy Sink Mechanisms
    Chen, Wei
    Du, Xingyu
    Zhang, Bao-Lin
    Cai, Zhihui
    Zheng, Zhongqiang
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (06)
  • [23] Accuracy Estimation of the Synthesis of Near-optimal Control for a Nonlinear Plant.
    Aleksandrov, E.E.
    Trigub, M.V.
    Izvestia vyssih ucebnyh zavedenij. Priborostroenie, 1982, 25 (10): : 27 - 31
  • [24] A near-optimal decentralized control approach for polynomial nonlinear interconnected systems
    Attia, Mohamed Sadok
    Bouafoura, Mohamed Karim
    Braiek, Naceur Benhadj
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2021, 43 (06) : 1363 - 1375
  • [25] Optimal, near-optimal, and robust epidemic control
    Dylan H. Morris
    Fernando W. Rossine
    Joshua B. Plotkin
    Simon A. Levin
    Communications Physics, 4
  • [26] Optimal, near-optimal, and robust epidemic control
    Morris, Dylan H.
    Rossine, Fernando W.
    Plotkin, Joshua B.
    Levin, Simon A.
    COMMUNICATIONS PHYSICS, 2021, 4 (01)
  • [27] Planning Near-Optimal Corridors Amidst Obstacles
    Wein, Ron
    van den Berg, Jur
    Halperin, Dan
    ALGORITHMIC FOUNDATION OF ROBOTICS VII, 2008, 47 : 491 - +
  • [28] Adaptive Event-Triggered Near-Optimal Tracking Control for Unknown Continuous-Time Nonlinear Systems
    Wang, Kunfu
    Gu, Qijia
    Huang, Baiqiao
    Wei, Qinglai
    Zhou, Tianmin
    IEEE ACCESS, 2022, 10 : 9506 - 9518
  • [29] Near-Optimal Sparse Adaptive Group Testing
    Tan, Nelvin
    Scarlett, Jonathan
    2020 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2020, : 1420 - 1425
  • [30] A NEAR-OPTIMAL METHOD FOR REASONING ABOUT ACTION
    PRATT, VR
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1980, 20 (02) : 231 - 254