Task-Driven Reinforcement Learning With Action Primitives for Long-Horizon Manipulation Skills

被引:4
|
作者
Wang, Hao [1 ]
Zhang, Hao [1 ]
Li, Lin [1 ]
Kan, Zhen [1 ]
Song, Yongduan [2 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei 230026, Anhui, Peoples R China
[2] Chongqing Univ, Sch Automat, Chongqing 400044, Peoples R China
基金
中国国家自然科学基金;
关键词
Action primitives; linear temporal logic (LTL); long-horizon manipulation skills; task-driven RL;
D O I
10.1109/TCYB.2023.3298195
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is an interesting open problem to enable robots to efficiently and effectively learn long-horizon manipulation skills. Motivated to augment robot learning via more effective exploration, this work develops task-driven reinforcement learning with action primitives (TRAPs), a new manipulation skill learning framework that augments standard reinforcement learning algorithms with formal methods and parameterized action space (PAS). In particular, TRAPs uses linear temporal logic (LTL) to specify complex manipulation skills. LTL progression, a semantics-preserving rewriting operation, is then used to decompose the training task at an abstract level, informs the robot about their current task progress, and guides them via reward functions. The PAS, a predefined library of heterogeneous action primitives, further improves the efficiency of robot exploration. We highlight that TRAPs augments the learning of manipulation skills in both learning efficiency and effectiveness (i.e., task constraints). Extensive empirical studies demonstrate that TRAPs outperforms most existing methods.Sign
引用
收藏
页码:4513 / 4526
页数:14
相关论文
共 50 条
  • [1] State-Dependent Maximum Entropy Reinforcement Learning for Robot Long-Horizon Task Learning
    Deshuai Zheng
    Jin Yan
    Tao Xue
    Yong Liu
    [J]. Journal of Intelligent & Robotic Systems, 2024, 110
  • [2] State-Dependent Maximum Entropy Reinforcement Learning for Robot Long-Horizon Task Learning
    Zheng, Deshuai
    Yan, Jin
    Xue, Tao
    Liu, Yong
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2024, 110 (01)
  • [3] Mixline: A Hybrid Reinforcement Learning Framework for Long-Horizon Bimanual Coffee Stirring Task
    Sun, Zheng
    Wang, Zhiqi
    Liu, Junjia
    Li, Miao
    Chen, Fei
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT I, 2022, 13455 : 627 - 636
  • [4] Task-Driven Semantic Coding via Reinforcement Learning
    Li, Xin
    Shi, Jun
    Chen, Zhibo
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 6307 - 6320
  • [5] LEAGUE: Guided Skill Learning and Abstraction for Long-Horizon Manipulation
    Cheng, Shuo
    Xu, Danfei
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6451 - 6458
  • [6] SC-AIRL: Share-Critic in Adversarial Inverse Reinforcement Learning for Long-Horizon Task
    Xiang, Guangyu
    Li, Shaodong
    Shuang, Feng
    Gao, Fang
    Yuan, Xiaogang
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3179 - 3186
  • [7] Deep Reinforcement Learning for Task-Driven Discovery of Incomplete Networks
    Morales, Peter
    Caceres, Rajmonda Sulo
    Eliassi-Rad, Tina
    [J]. COMPLEX NETWORKS AND THEIR APPLICATIONS VIII, VOL 1, 2020, 881 : 903 - 914
  • [8] Uncertainty-aware hierarchical reinforcement learning for long-horizon tasks
    Wenning Hu
    Hongbin Wang
    Ming He
    Nianbin Wang
    [J]. Applied Intelligence, 2023, 53 : 28555 - 28569
  • [9] Uncertainty-aware hierarchical reinforcement learning for long-horizon tasks
    Hu, Wenning
    Wang, Hongbin
    He, Ming
    Wang, Nianbin
    [J]. APPLIED INTELLIGENCE, 2023, 53 (23) : 28555 - 28569
  • [10] TaskSum: Task-Driven Extractive Text Summarization for Long News Documents Based on Reinforcement Learning
    Tang, Moming
    Cheng, Dawei
    Chen, Cen
    Liang, Yuqi
    Luo, Yifeng
    Qian, Weining
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT III, 2022, : 306 - 313