Efficient policy search in low-dimensional embedding spaces by generalizing motion primitives with a parameterized skill memory

被引：0

作者：

René Felix Reinhart

Jochen Jakob Steil

机构：

[1] Bielefeld University,Research Institute for Cognition and Robotics (CoR

来源：

Autonomous Robots | 2015年 / 38卷

关键词：

Motion primitives; Policy search; Self-organization; Continuous association;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Motion primitives are an established paradigm to generate complex motions from simpler building blocks. A much less addressed issue is at which level to encode and how to organize a library of motion primitives. Typically, the intrinsic variability of a skill is significantly lower-dimensional than the parameter space of motion primitive models. This paper therefore proposes a parameterized skill memory in a first step, which organizes a set of motion primitives in a low-dimensional, topology-preserving embedding space. The skill memory acts as a pivotal mechanism that links low-dimensional skill parametrization to motion primitive parameters and complete motion trajectories. The skill memory is implemented by means of a dynamical system which features continuous generalization of motion shapes and the multi-directional retrieval of motion primitive parameters from low-dimensional skill parametrizations. The skill parametrization can be predefined or automatically discovered, e.g. by unsupervised dimension reduction techniques. The paper shows that parameterized skill memories achieve excellent generalization of motion shapes from few training examples in several scenarios, including the bi-manual manipulation of a rod with the humanoid robot iCub. In a second step, the low-dimensional and topological skill parametrization is leveraged for efficient, gradient-based policy search. Policy search by generalizing motion shapes from low-dimensional parametrizations is compared to conventional policy search in the parameter space of a motion primitive model. It turns out that the reduced search space accessible through the skill memory significantly accelerates the policy improvement.

引用

页码：331 / 348

页数：17

共 12 条

[1] Efficient policy search in low-dimensional embedding spaces by generalizing motion primitives with a parameterized skill memory
Reinhart, Rene Felix
Steil, Jochen Jakob
AUTONOMOUS ROBOTS, 2015, 38 (04) : 331 - 348
[2] Efficient Policy Search with a Parameterized Skill Memory
Reinhart, Rene Felix
Steil, Jochen Jakob
2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 1400 - 1407
[3] Low-Dimensional Euclidean Embedding for Visualization of Search Spaces in Combinatorial Optimization
Michalak, Krzysztof
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2019, 23 (02) : 232 - 246
[4] Low-Dimensional Euclidean Embedding for Visualization of Search Spaces in Combinatorial Optimization
Michalak, Krzysztof
PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 27 - 28
[5] Embedding tree metrics into low-dimensional Euclidean spaces
Gupta, A
DISCRETE & COMPUTATIONAL GEOMETRY, 2000, 24 (01) : 105 - 116
[6] Embedding Tree Metrics into Low-Dimensional Euclidean Spaces
A. Gupta
Discrete & Computational Geometry, 2000, 24 : 105 - 116
[7] Motion synthesis and editing in low-dimensional spaces
Shin, Hyun Joon
Lee, Jehee
COMPUTER ANIMATION AND VIRTUAL WORLDS, 2006, 17 (3-4) : 219 - 227
[8] Learning and exploiting low-dimensional structure for efficient holonomic motion planning in high-dimensional spaces
Vernaza, Paul
Lee, Daniel D.
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2012, 31 (14): : 1739 - 1760
[9] An efficient algorithm for isometrically embedding weighted trees into low-dimensional l8-normed spaces
Queiroz, Jonathan
Januario, Tiago
KNOWLEDGE-BASED SYSTEMS, 2022, 251
[10] Synthesizing physically realistic human motion in low-dimensional, behavior-specific spaces
Safonova, A
Hodgins, JK
Pollard, NS
ACM TRANSACTIONS ON GRAPHICS, 2004, 23 (03): : 514 - 521

← 1 2 →