Efficient policy search in low-dimensional embedding spaces by generalizing motion primitives with a parameterized skill memory

被引:0
|
作者
René Felix Reinhart
Jochen Jakob Steil
机构
[1] Bielefeld University,Research Institute for Cognition and Robotics (CoR
来源
Autonomous Robots | 2015年 / 38卷
关键词
Motion primitives; Policy search; Self-organization; Continuous association;
D O I
暂无
中图分类号
学科分类号
摘要
Motion primitives are an established paradigm to generate complex motions from simpler building blocks. A much less addressed issue is at which level to encode and how to organize a library of motion primitives. Typically, the intrinsic variability of a skill is significantly lower-dimensional than the parameter space of motion primitive models. This paper therefore proposes a parameterized skill memory in a first step, which organizes a set of motion primitives in a low-dimensional, topology-preserving embedding space. The skill memory acts as a pivotal mechanism that links low-dimensional skill parametrization to motion primitive parameters and complete motion trajectories. The skill memory is implemented by means of a dynamical system which features continuous generalization of motion shapes and the multi-directional retrieval of motion primitive parameters from low-dimensional skill parametrizations. The skill parametrization can be predefined or automatically discovered, e.g. by unsupervised dimension reduction techniques. The paper shows that parameterized skill memories achieve excellent generalization of motion shapes from few training examples in several scenarios, including the bi-manual manipulation of a rod with the humanoid robot iCub. In a second step, the low-dimensional and topological skill parametrization is leveraged for efficient, gradient-based policy search. Policy search by generalizing motion shapes from low-dimensional parametrizations is compared to conventional policy search in the parameter space of a motion primitive model. It turns out that the reduced search space accessible through the skill memory significantly accelerates the policy improvement.
引用
收藏
页码:331 / 348
页数:17
相关论文
共 12 条