Optimistic Reinforcement Learning-Based Skill Insertions for Task and Motion Planning

被引:1
|
作者
Liu, Gaoyuan [1 ,2 ]
de Winter, Joris [1 ]
Durodie, Yuri [1 ,2 ]
Steckelmacher, Denis [3 ]
Nowe, Ann [3 ]
Vanderborght, Bram [1 ,2 ]
机构
[1] Vrije Univ Brussel, Brubot, B-1050 Brussels, Belgium
[2] IMEC, B-3001 Leuven, Belgium
[3] Vrije Univ Brussel, Artificial Intelligence AI Lab, B-1050 Brussels, Belgium
来源
关键词
Manipulation planning; reinforcement learning; task and motion planning; SAMPLING-BASED METHODS;
D O I
10.1109/LRA.2024.3398402
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Task and motion planning (TAMP) for robotics manipulation necessitates long-horizon reasoning involving versatile actions and skills. While deterministic actions can be crafted by sampling or optimizing with certain constraints, planning actions with uncertainty, i.e., probabilistic actions, remains a challenge for TAMP. On the contrary, Reinforcement Learning (RL) excels in acquiring versatile, yet short-horizon, manipulation skills that are robust with uncertainties. In this letter, we design a method that integrates RL skills into TAMP pipelines. Besides the policy, a RL skill is defined with data-driven logical components that enable the skill to be deployed by symbolic planning. A plan refinement sub-routine is designed to further tackle the inevitable effect uncertainties. In the experiments, we compare our method with baseline hierarchical planning from both TAMP and RL fields and illustrate the strength of the method. The results show that by embedding RL skills, we extend the capability of TAMP to domains with probabilistic skills, and improve the planning efficiency compared to the previous methods.
引用
收藏
页码:5974 / 5981
页数:8
相关论文
共 50 条
  • [41] Reinforcement Learning-Based Multimodal Model for the Stock Investment Portfolio Management Task
    Du, Sha
    Shen, Hailong
    ELECTRONICS, 2024, 13 (19)
  • [42] Deep Reinforcement Learning-Based Task Assignment for Cooperative Mobile Edge Computing
    Hsieh, Li-Tse
    Liu, Hang
    Guo, Yang
    Gazda, Robert
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (04) : 3156 - 3171
  • [43] Anomaly Detection for Scalable Task Grouping in Reinforcement Learning-based RAN Optimization
    Li, Jimmy
    Kozlov, Igor
    Wu, Di
    Liu, Xue
    Dudek, Gregory
    2024 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS, ICC WORKSHOPS 2024, 2024, : 1395 - 1400
  • [44] DRLQ: A Deep Reinforcement Learning-based Task Placement for Quantum Cloud Computing
    Nguyen, Hoa T.
    Usman, Muhammad
    Buyya, Rajkumar
    2024 IEEE 17TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, CLOUD 2024, 2024, : 475 - 481
  • [45] A Reinforcement Learning-Based Incentive Mechanism for Task Allocation Under Spatiotemporal Crowdsensing
    Jiang, Kaige
    Wang, Yingjie
    Wang, Haipeng
    Liu, Zhaowei
    Han, Qilong
    Zhou, Ao
    Xiang, Chaocan
    Cai, Zhipeng
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 2179 - 2189
  • [46] Deep reinforcement learning-based dynamical task offloading for mobile edge computing
    Xie, Bo
    Cui, Haixia
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [47] Federated Deep Reinforcement Learning-Based Task Allocation in Vehicular Fog Computing
    Shi, Jinming
    Du, Jun
    Wang, Jian
    Yuan, Jian
    2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,
  • [48] Deep Reinforcement Learning-based Task Offloading Decision in the Time Varying Channel
    Jeong, Jinkyo
    Kim, Il-Min
    Hong, Daesik
    2021 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2021,
  • [49] Reinforcement Learning-Based Task Scheduling Using DVFS Techniques in Mobile Devices
    HajiKhodaverdian, Mohammadamin
    Rastaghi, Hamed
    Saadat, Milad
    Shah-Mansouri, Hamed
    2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
  • [50] Multi-AGV motion planning based on deep reinforcement learning
    Sun H.
    Yuan W.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (02): : 708 - 716