Optimistic planning with an adaptive number of action switches for near-optimal nonlinear control

被引:0
|
作者
Mathe, Koppany [1 ]
Busoniu, Lucian [1 ]
Munos, Remi [2 ]
De Schutter, Bart [3 ]
机构
[1] Tech Univ Cluj Napoca, Dept Automat, Cluj Napoca, Romania
[2] Google DeepMind, London, England
[3] Delft Univ Technol, Delft Ctr Syst & Control, Delft, Netherlands
关键词
Optimal control; Planning; Nonlinear predictive control; Near-optimality analysis; MODEL-PREDICTIVE CONTROL; EXPLICIT; OPTIMIZATION;
D O I
10.1016/j.engappai.2017.08.020
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider infinite-horizon optimal control of nonlinear systems where the control actions are discrete, and focus on optimistic planning algorithms from artificial intelligence, which can handle general nonlinear systems with nonquadratic costs. With the main goal of reducing computations, we introduce two such algorithms that only search for constrained action sequences. The constraint prevents the sequences from switching between different actions more than a limited number of times. We call the first method optimistic switch-limited planning (OSP), and develop analysis showing that its fixed number of switches S leads to polynomial complexity in the search horizon, in contrast to the exponential complexity of the existing OP algorithm for deterministic systems; and to a correspondingly faster convergence towards optimality. Since tuning S is difficult, we introduce an adaptive variant called OASP that automatically adjusts S so as to limit computations while ensuring that near-optimal solutions keep being explored. OSP and OASP are analytically evaluated in representative special cases, and numerically illustrated in simulations of a rotational pendulum. To show that the algorithms also work in challenging applications, OSP is used to control the pendulum in real time, while OASP is applied for trajectory control of a simulated quadrotor. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:355 / 367
页数:13
相关论文
共 50 条
  • [31] Efficient, near-optimal control allocation
    Durham, WC
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1999, 22 (02) : 369 - 372
  • [32] Efficient, near-optimal control allocation
    Virginia Polytechnic Inst and State, Univ, Blacksburg, United States
    J Guid Control Dyn, 2 (369-372):
  • [33] Near-optimal neural-network robot control with adaptive gravity compensation
    Razmi, M.
    Macnab, C. J. B.
    NEUROCOMPUTING, 2020, 389 : 83 - 92
  • [34] Adaptive Near-Optimal Control of Uncertain Systems With Application to Underactuated Surface Vessels
    Zhang, Yinyan
    Li, Shuai
    Liu, Xiaoping
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2018, 26 (04) : 1204 - 1218
  • [35] Adaptive neural event-triggered near-optimal control for affined uncertain nonlinear discrete-time system
    Li, Xinyu
    Ding, Liang
    Li, Shu
    Yang, Huaiguang
    Qi, Huanan
    Gao, Haibo
    Deng, Zongquan
    ASIAN JOURNAL OF CONTROL, 2024, 26 (06) : 3210 - 3225
  • [36] Neural Network-Based Model-Free Adaptive Near-Optimal Tracking Control for a Class of Nonlinear Systems
    Zhang, Yinyan
    Li, Shuai
    Liu, Xiaoping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (12) : 6227 - 6241
  • [37] Asymptotically Near-Optimal Planning With Probabilistic Roadmap Spanners
    Marble, James D.
    Bekris, Kostas E.
    IEEE TRANSACTIONS ON ROBOTICS, 2013, 29 (02) : 432 - 444
  • [38] Near-Optimal Coverage Path Planning with Turn Costs
    Krupke, Dominik
    2024 PROCEEDINGS OF THE SYMPOSIUM ON ALGORITHM ENGINEERING AND EXPERIMENTS, ALENEX, 2024, : 118 - 132
  • [39] Asymptotically Near-Optimal Is Good Enough for Motion Planning
    Marble, James D.
    Bekris, Kostas E.
    ROBOTICS RESEARCH, ISRR, 2017, 100
  • [40] Linear time near-optimal planning in the blocks world
    Slaney, J
    Thiébaux, S
    PROCEEDINGS OF THE THIRTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE, VOLS 1 AND 2, 1996, : 1208 - 1214