Optimistic planning with an adaptive number of action switches for near-optimal nonlinear control

被引:0
|
作者
Mathe, Koppany [1 ]
Busoniu, Lucian [1 ]
Munos, Remi [2 ]
De Schutter, Bart [3 ]
机构
[1] Tech Univ Cluj Napoca, Dept Automat, Cluj Napoca, Romania
[2] Google DeepMind, London, England
[3] Delft Univ Technol, Delft Ctr Syst & Control, Delft, Netherlands
关键词
Optimal control; Planning; Nonlinear predictive control; Near-optimality analysis; MODEL-PREDICTIVE CONTROL; EXPLICIT; OPTIMIZATION;
D O I
10.1016/j.engappai.2017.08.020
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider infinite-horizon optimal control of nonlinear systems where the control actions are discrete, and focus on optimistic planning algorithms from artificial intelligence, which can handle general nonlinear systems with nonquadratic costs. With the main goal of reducing computations, we introduce two such algorithms that only search for constrained action sequences. The constraint prevents the sequences from switching between different actions more than a limited number of times. We call the first method optimistic switch-limited planning (OSP), and develop analysis showing that its fixed number of switches S leads to polynomial complexity in the search horizon, in contrast to the exponential complexity of the existing OP algorithm for deterministic systems; and to a correspondingly faster convergence towards optimality. Since tuning S is difficult, we introduce an adaptive variant called OASP that automatically adjusts S so as to limit computations while ensuring that near-optimal solutions keep being explored. OSP and OASP are analytically evaluated in representative special cases, and numerically illustrated in simulations of a rotational pendulum. To show that the algorithms also work in challenging applications, OSP is used to control the pendulum in real time, while OASP is applied for trajectory control of a simulated quadrotor. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:355 / 367
页数:13
相关论文
共 50 条
  • [41] Admissible Abstractions for Near-optimal Task and Motion Planning
    Vega-Brown, William
    Roy, Nicholas
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4852 - 4859
  • [42] Planning for optimal control and performance certification in nonlinear systems with controlled or uncontrolled switches
    Busoniu, Lucian
    Daafouz, Jamal
    Bragagnolo, Marcos Cesar
    Morarescu, Irinel-Constantin
    AUTOMATICA, 2017, 78 : 297 - 308
  • [43] A Hierarchy of Near-Optimal Policies for Multistage Adaptive Optimization
    Bertsimas, Dimitris
    Iancu, Dan Andrei
    Parrilo, Pablo A.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2011, 56 (12) : 2803 - 2818
  • [44] A minimax near-optimal algorithm for adaptive rejection sampling
    Achddou, Juliette
    Lam-Weil, Joseph
    Carpentier, Alexandra
    Blanchard, Gilles
    ALGORITHMIC LEARNING THEORY, VOL 98, 2019, 98
  • [45] Near-optimal control for stochastic recursive problems
    Hui, Eddie
    Huang, Jianhui
    Li, Xun
    Wang, Guangchen
    SYSTEMS & CONTROL LETTERS, 2011, 60 (03) : 161 - 168
  • [46] Adaptive Reconnaissance Attacks with Near-Optimal Parallel Batching
    Li, Xiang
    Smith, J. David
    Thai, My T.
    2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 699 - 709
  • [47] Near-optimal control policy for loss networks
    Ku, CY
    Yen, DC
    Chang, IC
    Huang, SM
    Jordan, S
    OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2006, 34 (04): : 406 - 416
  • [48] Near-optimal adaptive predictive control model study for roller shades in office spaces
    Bi, Guanghong
    Liu, Jiayi
    Gao, Ge
    Zhao, Lihua
    JOURNAL OF BUILDING ENGINEERING, 2023, 68
  • [49] Near-Optimal SOC Trajectory for Traffic-Based Adaptive PHEV Control Strategy
    Montazeri-Gh, Morteza
    Pourbafarani, Zeinab
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2017, 66 (11) : 9753 - 9760
  • [50] Near-Optimal Control of Nonlinear Switched Systems with Non-Cooperative Switching Rules
    Ben Rejeb, Jihene
    Busoniu, Lucian
    Morarescu, Irinel-Constantin
    Daafouz, Jamal
    2017 AMERICAN CONTROL CONFERENCE (ACC), 2017, : 2648 - 2653