A formal methods approach to interpretable reinforcement learning for robotic planning

被引：66

作者：

Li, Xiao ^{[1
]}

Serlin, Zachary ^{[1
]}

Yang, Guang ^{[2
]}

Belta, Calin ^{[1
,2
]}

机构：

[1] Boston Univ, Dept Mech Engn, Boston, MA 02215 USA

[2] Boston Univ, Div Syst Engn, Boston, MA 02215 USA

来源：

SCIENCE ROBOTICS | 2019年 / 4卷 / 37期

基金：

美国国家科学基金会;

关键词：

Computational framework - Control barriers - Domain-specific knowledge - Generation process - Planning and control - Policy generation - Reinforcement learning approach - Task specifications;

D O I：

10.1126/scirobotics.aay6276

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Growing interest in reinforcement learning approaches to robotic planning and control raises concerns of predictability and safety of robot behaviors realized solely through learned control policies. In addition, formally defining reward functions for complex tasks is challenging, and faulty rewards are prone to exploitation by the learning agent. Here, we propose a formal methods approach to reinforcement learning that (i) provides a formal specification language that integrates high-level, rich, task specifications with a priori, domain-specific knowledge; (ii) makes the reward generation process easily interpretable; (iii) guides the policy generation process according to the specification; and (iv) guarantees the satisfaction of the (critical) safety component of the specification. The main ingredients of our computational framework are a predicate temporal logic specifically tailored for robotic tasks and an automaton-guided, safe reinforcement learning algorithm based on control barrier functions. Although the proposed framework is quite general, we motivate it and illustrate it experimentally for a robotic cooking task, in which two manipulators worked together to make hot dogs.

引用

页数：15

共 50 条

[1] Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach
Zhang, Yudi
Du, Yali
Huang, Biwei
Wang, Ziyan
Wang, Jun
Fang, Meng
Pechenizkiy, Mykola
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[2] Optimization Methods for Interpretable Differentiable Decision Trees in Reinforcement Learning
Silva, Andrew
Killian, Taylor
Jimenez, Ivan Rodriguez
Son, Sung-Hyun
Gombolay, Matthew
[J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
[3] A Model-free Deep Reinforcement Learning Approach for Robotic Manipulators Path Planning
Liu, Wenxing
Niu, Hanlin
Mahyuddin, Muhammad Nasiruddin
Herrmann, Guido
Carrasco, Joaquin
[J]. 2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 512 - 517
[4] The Robotic Arm Velocity Planning Based on Reinforcement Learning
Hao-Hsuan Huang
Chih-Kai Cheng
Yi-Hung Chen
Hung-Yin Tsai
[J]. International Journal of Precision Engineering and Manufacturing, 2023, 24 : 1707 - 1721
[5] The Robotic Arm Velocity Planning Based on Reinforcement Learning
Huang, Hao-Hsuan
Cheng, Chih-Kai
Chen, Yi-Hung
Tsai, Hung-Yin
[J]. INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2023, 24 (09) : 1707 - 1721
[6] Advancements in Deep Reinforcement Learning and Inverse Reinforcement Learning for Robotic Manipulation: Toward Trustworthy, Interpretable, and Explainable Artificial Intelligence
Ozalp, Recep
Ucar, Aysegul
Guzelis, Cuneyt
[J]. IEEE ACCESS, 2024, 12 : 51840 - 51858
[7] A survey on interpretable reinforcement learning
Glanois, Claire
Weng, Paul
Zimmer, Matthieu
Li, Dong
Yang, Tianpei
Hao, Jianye
Liu, Wulong
[J]. MACHINE LEARNING, 2024, 113 (08) : 5847 - 5890
[8] Interpretable Control by Reinforcement Learning
Hein, Daniel
Limmer, Steffen
Runkler, Thomas A.
[J]. IFAC PAPERSONLINE, 2020, 53 (02): : 8082 - 8089
[9] Programmatically Interpretable Reinforcement Learning
Verma, Abhinav
Murali, Vijayaraghavan
Singh, Rishabh
Kohli, Pushmeet
Chaudhuri, Swarat
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[10] Action Selection Methods in a Robotic Reinforcement Learning Scenario
Cruz, Francisco
Wuppen, Peter
Fazrie, Alvin
Weber, Cornelius
Wermter, Stefan
[J]. 2018 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2018,

← 1 2 3 4 5 →