A formal methods approach to interpretable reinforcement learning for robotic planning

被引:66
|
作者
Li, Xiao [1 ]
Serlin, Zachary [1 ]
Yang, Guang [2 ]
Belta, Calin [1 ,2 ]
机构
[1] Boston Univ, Dept Mech Engn, Boston, MA 02215 USA
[2] Boston Univ, Div Syst Engn, Boston, MA 02215 USA
基金
美国国家科学基金会;
关键词
Computational framework - Control barriers - Domain-specific knowledge - Generation process - Planning and control - Policy generation - Reinforcement learning approach - Task specifications;
D O I
10.1126/scirobotics.aay6276
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Growing interest in reinforcement learning approaches to robotic planning and control raises concerns of predictability and safety of robot behaviors realized solely through learned control policies. In addition, formally defining reward functions for complex tasks is challenging, and faulty rewards are prone to exploitation by the learning agent. Here, we propose a formal methods approach to reinforcement learning that (i) provides a formal specification language that integrates high-level, rich, task specifications with a priori, domain-specific knowledge; (ii) makes the reward generation process easily interpretable; (iii) guides the policy generation process according to the specification; and (iv) guarantees the satisfaction of the (critical) safety component of the specification. The main ingredients of our computational framework are a predicate temporal logic specifically tailored for robotic tasks and an automaton-guided, safe reinforcement learning algorithm based on control barrier functions. Although the proposed framework is quite general, we motivate it and illustrate it experimentally for a robotic cooking task, in which two manipulators worked together to make hot dogs.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach
    Zhang, Yudi
    Du, Yali
    Huang, Biwei
    Wang, Ziyan
    Wang, Jun
    Fang, Meng
    Pechenizkiy, Mykola
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [2] Optimization Methods for Interpretable Differentiable Decision Trees in Reinforcement Learning
    Silva, Andrew
    Killian, Taylor
    Jimenez, Ivan Rodriguez
    Son, Sung-Hyun
    Gombolay, Matthew
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
  • [3] A Model-free Deep Reinforcement Learning Approach for Robotic Manipulators Path Planning
    Liu, Wenxing
    Niu, Hanlin
    Mahyuddin, Muhammad Nasiruddin
    Herrmann, Guido
    Carrasco, Joaquin
    [J]. 2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 512 - 517
  • [4] The Robotic Arm Velocity Planning Based on Reinforcement Learning
    Hao-Hsuan Huang
    Chih-Kai Cheng
    Yi-Hung Chen
    Hung-Yin Tsai
    [J]. International Journal of Precision Engineering and Manufacturing, 2023, 24 : 1707 - 1721
  • [5] The Robotic Arm Velocity Planning Based on Reinforcement Learning
    Huang, Hao-Hsuan
    Cheng, Chih-Kai
    Chen, Yi-Hung
    Tsai, Hung-Yin
    [J]. INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2023, 24 (09) : 1707 - 1721
  • [6] Advancements in Deep Reinforcement Learning and Inverse Reinforcement Learning for Robotic Manipulation: Toward Trustworthy, Interpretable, and Explainable Artificial Intelligence
    Ozalp, Recep
    Ucar, Aysegul
    Guzelis, Cuneyt
    [J]. IEEE ACCESS, 2024, 12 : 51840 - 51858
  • [7] A survey on interpretable reinforcement learning
    Glanois, Claire
    Weng, Paul
    Zimmer, Matthieu
    Li, Dong
    Yang, Tianpei
    Hao, Jianye
    Liu, Wulong
    [J]. MACHINE LEARNING, 2024, 113 (08) : 5847 - 5890
  • [8] Interpretable Control by Reinforcement Learning
    Hein, Daniel
    Limmer, Steffen
    Runkler, Thomas A.
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 8082 - 8089
  • [9] Programmatically Interpretable Reinforcement Learning
    Verma, Abhinav
    Murali, Vijayaraghavan
    Singh, Rishabh
    Kohli, Pushmeet
    Chaudhuri, Swarat
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [10] Action Selection Methods in a Robotic Reinforcement Learning Scenario
    Cruz, Francisco
    Wuppen, Peter
    Fazrie, Alvin
    Weber, Cornelius
    Wermter, Stefan
    [J]. 2018 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2018,