Temporal-Logic-Based Intermittent, Optimal, and Safe Continuous-Time Learning for Trajectory Tracking

被引:3
|
作者
Kanellopoulos, Aris [1 ]
Fotiadis, Filippos [1 ]
Sun, Chuangchuang [2 ]
Xu, Zhe [3 ]
Vamvoudakis, Kyriakos G. [1 ]
Topcu, Ufuk [4 ,5 ]
Dixon, Warren E. [6 ]
机构
[1] Georgia Inst Technol, Daniel Guggenheim Sch Aerosp Engn, Atlanta, GA 30332 USA
[2] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA
[3] Arizona State Univ, Sch Engn Matter Transport & Energy, Tempe, AZ 85287 USA
[4] Univ Texas Austin, Dept Aerosp Engn & Engn Mech, Austin, TX 78712 USA
[5] Univ Texas Austin, Oden Inst Computat Engn & Sci, Austin, TX 78712 USA
[6] Univ Florida, Dept Mech & Aerosp Engn, Gainesville, FL 32611 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CDC45484.2021.9683309
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we develop safe reinforcement-learning-based controllers for systems tasked with accomplishing complex missions that can be expressed as linear temporal logic specifications, similar to those required by search-and-rescue missions. We decompose the original mission into a sequence of tracking sub-problems under safety constraints. We impose the safety conditions by utilizing barrier functions to map the constrained optimal tracking problem in the physical space to an unconstrained one in the transformed space. Furthermore, we develop policies that intermittently update the control signal to solve the tracking sub-problems with reduced burden in the communication and computation resources. Subsequently, an actor-critic algorithm is utilized to solve the underlying Hamilton-Jacobi-Bellman equations. Finally, we support our proposed framework with stability proofs and showcase its efficacy via simulation results.
引用
收藏
页码:1263 / 1268
页数:6
相关论文
共 50 条
  • [1] Temporal-logic-based intermittent, optimal, and safe continuous-time learning for trajectory tracking
    Kanellopoulos, Aris
    Fotiadis, Filippos
    Sun, Chuangchuang
    Xu, Zhe
    Vamvoudakis, Kyriakos G.
    Topcu, Ufuk
    Dixon, Warren E.
    [J]. arXiv, 2021,
  • [2] Continuous-Time Safe Learning with Temporal Logic Constraints in Adversarial Environments
    Sun, Chuangchuang
    Vamvoudakis, Kyriakos G.
    [J]. 2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 4786 - 4791
  • [3] Adaptive Optimal Trajectory Tracking Control of Continuous-Time Systems
    Buehrle, Etienne
    Koepf, Florian
    Inga, Jairo
    Hohmann, Soeren
    [J]. 2021 EUROPEAN CONTROL CONFERENCE (ECC), 2021, : 1987 - 1994
  • [4] Approximate optimal trajectory tracking for continuous-time nonlinear systems
    Kamalapurkar, Rushikesh
    Dinh, Huyen
    Bhasin, Shubhendu
    Dixon, Warren E.
    [J]. AUTOMATICA, 2015, 51 : 40 - 48
  • [5] Average-based Robustness for Continuous-Time Signal Temporal Logic
    Mehdipour, Noushin
    Vasile, Cristian-Ioan
    Belta, Calin
    [J]. 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 5312 - 5317
  • [6] Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks
    Jiang, Yuqian
    Bharadwaj, Suda
    Wu, Bo
    Shah, Rishi
    Topcu, Ufuk
    Stone, Peter
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7995 - 8003
  • [7] Joint Smoothing and Tracking Based on Continuous-Time Target Trajectory Function Fitting
    Li, Tiancheng
    Chen, Huimin
    Sun, Shudong
    Corchado, Juan M.
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2019, 16 (03) : 1476 - 1483
  • [8] Translating a continuous-time temporal logic into timed automata
    Li, GY
    Tang, ZS
    [J]. PROGRAMMING LANGUAGES AND SYSTEMS, PROCEEDINGS, 2003, 2895 : 322 - 338
  • [9] Robustness of temporal logic specifications for continuous-time signals
    Fainekos, Georgios E.
    Pappas, George J.
    [J]. THEORETICAL COMPUTER SCIENCE, 2009, 410 (42) : 4262 - 4291
  • [10] A Theory of Sampling for Continuous-Time Metric Temporal Logic
    Furia, Carlo A.
    Rossi, Matteo
    [J]. ACM TRANSACTIONS ON COMPUTATIONAL LOGIC, 2010, 12 (01)