Constraint-aware learning of policies by demonstration

被引:12
|
作者
Armesto, Leopoldo [1 ]
Moura, Joao [2 ,3 ]
Ivan, Vladimir [3 ]
Erden, Mustafa Suphi [2 ]
Sala, Antonio [4 ]
Vijayakumar, Sethu [3 ]
机构
[1] Univ Politecn Valencia, Inst Diseno & Fabricac, C Camino Vera S-N, Valencia 46019, Spain
[2] Heriot Watt Univ, Inst Sensors Signals & Syst, Edinburgh, Midlothian, Scotland
[3] Univ Edinburgh, Inst Act Percept & Behav, Edinburgh, Midlothian, Scotland
[4] Univ Politecn Valencia, Inst Univ Automat & Inf Ind, Valencia, Spain
来源
基金
欧盟地平线“2020”; 英国工程与自然科学研究理事会;
关键词
Direct policy learning; constrained motion; null-space policy; force/torque application; PATH;
D O I
10.1177/0278364918784354
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Many practical tasks in robotic systems, such as cleaning windows, writing, or grasping, are inherently constrained. Learning policies subject to constraints is a challenging problem. In this paper, we propose a method of constraint-aware learning that solves the policy learning problem using redundant robots that execute a policy that is acting in the null space of a constraint. In particular, we are interested in generalizing learned null-space policies across constraints that were not known during the training. We split the combined problem of learning constraints and policies into two: first estimating the constraint, and then estimating a null-space policy using the remaining degrees of freedom. For a linear parametrization, we provide a closed-form solution of the problem. We also define a metric for comparing the similarity of estimated constraints, which is useful to pre-process the trajectories recorded in the demonstrations. We have validated our method by learning a wiping task from human demonstration on flat surfaces and reproducing it on an unknown curved surface using a force- or torque-based controller to achieve tool alignment. We show that, despite the differences between the training and validation scenarios, we learn a policy that still provides the desired wiping motion.
引用
收藏
页码:1673 / 1689
页数:17
相关论文
共 50 条
  • [1] Constraint-Aware XSLT Evaluation
    Li, Ming
    Mani, Murali
    Rundensteiner, Elke A.
    [J]. CONCEPTUAL MODELING - ER 2008, PROCEEDINGS, 2008, 5231 : 524 - 525
  • [2] Constraint-aware Schema Transformation
    Alves, Tiago L.
    Silva, Paulo F.
    Visser, Joost
    [J]. ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2012, 290 : 3 - 18
  • [3] LearnedSQLGen: Constraint-aware SQL Generation using Reinforcement Learning
    Zhang, Lixi
    Chai, Chengliang
    Zhou, Xuanhe
    Li, Guoliang
    [J]. PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 945 - 958
  • [4] Safe Learning by Constraint-Aware Policy Optimization for Robotic Ultrasound Imaging
    Duan, Anqing
    Yang, Chenguang
    Zhao, Jingyuan
    Huo, Shengzeng
    Zhou, Peng
    Ma, Wanyu
    Zheng, Yongping
    Navarro-Alarcon, David
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 1 - 12
  • [5] Constraint-Aware Policy for Compliant Manipulation
    Saito, Daichi
    Sasabuchi, Kazuhiro
    Wake, Naoki
    Kanehira, Atsushi
    Takamatsu, Jun
    Koike, Hideki
    Ikeuchi, Katsushi
    [J]. ROBOTICS, 2024, 13 (01)
  • [6] A Formalisation of Constraint-Aware Model Transformations
    Rutle, Adrian
    Rossini, Alessandro
    Lamo, Yngve
    Wolter, Uwe
    [J]. FUNDAMENTAL APPROACHES TO SOFTWARE ENGINEERING, PROCEEDINGS, 2010, 6013 : 13 - +
  • [7] Constraint-Aware Trajectory for Drone Delivery Services
    Janszen, Jermaine
    Shahzaad, Babar
    Alkouz, Balsam
    Bouguettaya, Athman
    [J]. SERVICE-ORIENTED COMPUTING, ICSOC 2021 WORKSHOPS, 2022, 13236 : 306 - 310
  • [8] Constraint-Aware Approach to Web Service Composition
    Wang, PengWei
    Ding, ZhiJun
    Jiang, ChangJun
    Zhou, MengChu
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2014, 44 (06): : 770 - 784
  • [9] Phoenix: A Constraint-aware Scheduler for Heterogeneous Datacenters
    Thinakaran, Prashanth
    Gunasekaran, Jashwant Raj
    Sharma, Bikash
    Kandemir, Mahmut Taylan
    Das, Chita R.
    [J]. 2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 977 - 987
  • [10] Constraint-aware neural networks for Riemann problems
    Magiera, Jim
    Ray, Deep
    Hesthaven, Jan S.
    Rohde, Christian
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2020, 409