Imitation Learning with Non-Parametric Regression

被引:0
|
作者
Vaandrager, Maarten [1 ]
Babuska, Robert [2 ]
Busoniu, Lucian [3 ,4 ]
Lopes, Gabriel A. D. [2 ]
机构
[1] Plotprojects, NL-1078 MN Amsterdam, Netherlands
[2] Delft Univ Technol, DCSC, NL-2628 CD Delft, Netherlands
[3] CNRS, Res Ctr Automat Control CRAN, F-54516 Vandoeuvre Les Nancy, France
[4] Tech Univ Cluj Napoca, Dept Automat, Cluj Napoca 400020, Romania
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Humans are very fast learners. Yet, we rarely learn a task completely from scratch. Instead, we usually start with a rough approximation of the desired behavior and take the learning from there. In this paper, we use imitation to quickly generate a rough solution to a robotic task from demonstrations, supplied as a collection of state-space trajectories. Appropriate control actions needed to steer the system along the trajectories are then automatically learned in the form of a (nonlinear) state feedback control law. The learning scheme has two components: a dynamic reference model and an adaptive inverse process model, both based on a data-driven, non-parametric method called local linear regression. The reference model infers the desired behavior from the demonstration trajectories, while the inverse process model provides the control actions to achieve this behavior and is improved online using learning. Experimental results with a pendulum swing-up problem and a robotic arm demonstrate the practical usefulness of this approach. The resulting learned dynamics are not limited to single trajectories, but capture instead the overall dynamics of the motion, making the proposed approach a promising step towards versatile learning machines such as future household robots, or robots for autonomous missions.
引用
收藏
页码:91 / 96
页数:6
相关论文
共 50 条
  • [1] Non-parametric Imitation Learning of Robot Motor Skills
    Huang, Yanlong
    Rozo, Leonel
    Silverio, Joao
    Caldwell, Darwin G.
    [J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 5266 - 5272
  • [2] Non-parametric regression for networks
    Severn, Katie E.
    Dryden, Ian L.
    Preston, Simon P.
    [J]. STAT, 2021, 10 (01):
  • [3] Non-parametric regression methods
    Ince H.
    [J]. Computational Management Science, 2006, 3 (2) : 161 - 174
  • [4] A note on combining parametric and non-parametric regression
    Rahman, M
    Gokhale, DV
    Ullah, A
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 1997, 26 (02) : 519 - 529
  • [5] Non-Parametric Regression and Riesz Estimators
    Kountzakis, Christos
    Tsachouridou-Papadatou, Vasileia
    [J]. AXIOMS, 2023, 12 (04)
  • [6] Parametrically guided non-parametric regression
    Glad, IK
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 1998, 25 (04) : 649 - 668
  • [7] Non-parametric regression with wavelet kernels
    Rakotomamonjy, A
    Mary, X
    Canu, S
    [J]. APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2005, 21 (02) : 153 - 163
  • [8] Non-parametric Regression for Circular Responses
    Di Marzio, Marco
    Panzera, Agnese
    Taylor, Charles C.
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2013, 40 (02) : 238 - 255
  • [9] A NOTE ON NON-PARAMETRIC CENSORED REGRESSION
    MCLEISH, DL
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 1983, 18 (01) : 1 - 6
  • [10] NON-PARAMETRIC ESTIMATION OF A REGRESSION FUNCION
    SCHUSTER, EF
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1968, 39 (02): : 695 - +