Imitation Learning with Non-Parametric Regression

被引:0
|
作者
Vaandrager, Maarten [1 ]
Babuska, Robert [2 ]
Busoniu, Lucian [3 ,4 ]
Lopes, Gabriel A. D. [2 ]
机构
[1] Plotprojects, NL-1078 MN Amsterdam, Netherlands
[2] Delft Univ Technol, DCSC, NL-2628 CD Delft, Netherlands
[3] CNRS, Res Ctr Automat Control CRAN, F-54516 Vandoeuvre Les Nancy, France
[4] Tech Univ Cluj Napoca, Dept Automat, Cluj Napoca 400020, Romania
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Humans are very fast learners. Yet, we rarely learn a task completely from scratch. Instead, we usually start with a rough approximation of the desired behavior and take the learning from there. In this paper, we use imitation to quickly generate a rough solution to a robotic task from demonstrations, supplied as a collection of state-space trajectories. Appropriate control actions needed to steer the system along the trajectories are then automatically learned in the form of a (nonlinear) state feedback control law. The learning scheme has two components: a dynamic reference model and an adaptive inverse process model, both based on a data-driven, non-parametric method called local linear regression. The reference model infers the desired behavior from the demonstration trajectories, while the inverse process model provides the control actions to achieve this behavior and is improved online using learning. Experimental results with a pendulum swing-up problem and a robotic arm demonstrate the practical usefulness of this approach. The resulting learned dynamics are not limited to single trajectories, but capture instead the overall dynamics of the motion, making the proposed approach a promising step towards versatile learning machines such as future household robots, or robots for autonomous missions.
引用
收藏
页码:91 / 96
页数:6
相关论文
共 50 条
  • [31] ON CLASS OF NON-PARAMETRIC TESTS FOR REGRESSION PARAMETERS
    SRIVASTAVA, MS
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1968, 39 (02): : 697 - +
  • [32] On the Validity of the Bootstrap in Non-Parametric Functional Regression
    Ferraty, Frederic
    Van Keilegom, Ingrid
    Vieu, Philippe
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2010, 37 (02) : 286 - 306
  • [33] Test for Linearity in Non-Parametric Regression Models
    Khedidja, Djaballah-Djeddour
    Moussa, Tazerouti
    [J]. AUSTRIAN JOURNAL OF STATISTICS, 2022, 51 (01) : 16 - 34
  • [34] Non-parametric regression with a latent time series
    Linton, Oliver
    Nielsen, Jens Perch
    Nielsen, Soren Feodor
    [J]. ECONOMETRICS JOURNAL, 2009, 12 (02): : 187 - 207
  • [35] Local Dimensionality Reduction for Non-Parametric Regression
    Hoffmann, Heiko
    Schaal, Stefan
    Vijayakumar, Sethu
    [J]. NEURAL PROCESSING LETTERS, 2009, 29 (02) : 109 - 131
  • [36] Non-parametric quantile regression with censored data
    Gannoun, A
    Saracco, J
    Yuan, A
    Bonney, GE
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2005, 32 (04) : 527 - 550
  • [37] Comparing non-parametric regression lines via regression depth
    Wilcox, Rand R.
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2010, 80 (04) : 379 - 387
  • [38] LEARNING NON-PARAMETRIC MODELS OF PRONUNCIATION
    Hutchinson, Brian
    Droppo, Jasha
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4904 - 4907
  • [39] Non-parametric Representation Learning with Kernels
    Esser, Pascal
    Fleissner, Maximilian
    Ghoshdastidar, Debarghya
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 11910 - 11918
  • [40] A new test for the parametric form of the variance function in non-parametric regression
    Dette, Holger
    Neurneyer, Natalie
    Van Keilegorn, Ingrid
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2007, 69 : 903 - 917