Optimistic Reinforcement Learning-Based Skill Insertions for Task and Motion Planning

被引：1

作者：

Liu, Gaoyuan ^{[1
,2
]}

de Winter, Joris ^{[1
]}

Durodie, Yuri ^{[1
,2
]}

Steckelmacher, Denis ^{[3
]}

Nowe, Ann ^{[3
]}

Vanderborght, Bram ^{[1
,2
]}

机构：

[1] Vrije Univ Brussel, Brubot, B-1050 Brussels, Belgium

[2] IMEC, B-3001 Leuven, Belgium

[3] Vrije Univ Brussel, Artificial Intelligence AI Lab, B-1050 Brussels, Belgium

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 06期

关键词：

Manipulation planning; reinforcement learning; task and motion planning; SAMPLING-BASED METHODS;

D O I：

10.1109/LRA.2024.3398402

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Task and motion planning (TAMP) for robotics manipulation necessitates long-horizon reasoning involving versatile actions and skills. While deterministic actions can be crafted by sampling or optimizing with certain constraints, planning actions with uncertainty, i.e., probabilistic actions, remains a challenge for TAMP. On the contrary, Reinforcement Learning (RL) excels in acquiring versatile, yet short-horizon, manipulation skills that are robust with uncertainties. In this letter, we design a method that integrates RL skills into TAMP pipelines. Besides the policy, a RL skill is defined with data-driven logical components that enable the skill to be deployed by symbolic planning. A plan refinement sub-routine is designed to further tackle the inevitable effect uncertainties. In the experiments, we compare our method with baseline hierarchical planning from both TAMP and RL fields and illustrate the strength of the method. The results show that by embedding RL skills, we extend the capability of TAMP to domains with probabilistic skills, and improve the planning efficiency compared to the previous methods.

引用

页码：5974 / 5981

页数：8

共 50 条

[41] Reinforcement Learning-Based Multimodal Model for the Stock Investment Portfolio Management Task
Du, Sha
Shen, Hailong
ELECTRONICS, 2024, 13 (19)
[42] Deep Reinforcement Learning-Based Task Assignment for Cooperative Mobile Edge Computing
Hsieh, Li-Tse
Liu, Hang
Guo, Yang
Gazda, Robert
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (04) : 3156 - 3171
[43] Anomaly Detection for Scalable Task Grouping in Reinforcement Learning-based RAN Optimization
Li, Jimmy
Kozlov, Igor
Wu, Di
Liu, Xue
Dudek, Gregory
2024 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS, ICC WORKSHOPS 2024, 2024, : 1395 - 1400
[44] DRLQ: A Deep Reinforcement Learning-based Task Placement for Quantum Cloud Computing
Nguyen, Hoa T.
Usman, Muhammad
Buyya, Rajkumar
2024 IEEE 17TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, CLOUD 2024, 2024, : 475 - 481
[45] A Reinforcement Learning-Based Incentive Mechanism for Task Allocation Under Spatiotemporal Crowdsensing
Jiang, Kaige
Wang, Yingjie
Wang, Haipeng
Liu, Zhaowei
Han, Qilong
Zhou, Ao
Xiang, Chaocan
Cai, Zhipeng
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 2179 - 2189
[46] Deep reinforcement learning-based dynamical task offloading for mobile edge computing
Xie, Bo
Cui, Haixia
JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
[47] Federated Deep Reinforcement Learning-Based Task Allocation in Vehicular Fog Computing
Shi, Jinming
Du, Jun
Wang, Jian
Yuan, Jian
2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,
[48] Deep Reinforcement Learning-based Task Offloading Decision in the Time Varying Channel
Jeong, Jinkyo
Kim, Il-Min
Hong, Daesik
2021 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2021,
[49] Reinforcement Learning-Based Task Scheduling Using DVFS Techniques in Mobile Devices
HajiKhodaverdian, Mohammadamin
Rastaghi, Hamed
Saadat, Milad
Shah-Mansouri, Hamed
2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
[50] Multi-AGV motion planning based on deep reinforcement learning
Sun H.
Yuan W.
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (02): : 708 - 716

← 1 2 3 4 5 →