Small-variance asymptotics for non-parametric online robot learning

被引:8
|
作者
Tanwani, Ajay Kumar [1 ,2 ,3 ]
Calinon, Sylvain [1 ,2 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
[2] EPFL, Martigny, Switzerland
[3] Univ Calif Berkeley, 2111 Etcheverry Hall,2505 Hearst Ave, Berkeley, CA 94709 USA
来源
基金
欧盟地平线“2020”;
关键词
Learning and adaptive systems; Bayesian non-parametrics; online learning; hidden semi-Markov model; subspace clustering; teleoperation; HIGH-DIMENSIONAL DATA; HIDDEN MARKOV-MODELS; MANIPULATION TASKS; MIXTURE; DIRICHLET; TUTORIAL;
D O I
10.1177/0278364918816374
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Small-variance asymptotics is emerging as a useful technique for inference in large-scale Bayesian non-parametric mixture models. This paper analyzes the online learning of robot manipulation tasks with Bayesian non-parametric mixture models under small-variance asymptotics. The analysis yields a scalable online sequence clustering (SOSC) algorithm that is non-parametric in the number of clusters and the subspace dimension of each cluster. SOSC groups the new datapoint in low-dimensional subspaces by online inference in a non-parametric mixture of probabilistic principal component analyzers (MPPCA) based on a Dirichlet process, and captures the state transition and state duration information online in a hidden semi-Markov model (HSMM) based on a hierarchical Dirichlet process. A task-parameterized formulation of our approach autonomously adapts the model to changing environmental situations during manipulation. We apply the algorithm in a teleoperation setting to recognize the intention of the operator and remotely adjust the movement of the robot using the learned model. The generative model is used to synthesize both time-independent and time-dependent behaviors by relying on the principles of shared and autonomous control. Experiments with the Baxter robot yield parsimonious clusters that adapt online with new demonstrations and assist the operator in performing remote manipulation tasks.
引用
收藏
页码:3 / 22
页数:20
相关论文
共 50 条
  • [1] Small-Variance Asymptotics for Dirichlet Process Mixtures of SVMs
    Wang, Yining
    Zhu, Jun
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2135 - 2141
  • [2] Combinatorial Topic Models using Small-Variance Asymptotics
    Jiang, Ke
    Sra, Suvrit
    Kulis, Brian
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 54, 2017, 54 : 421 - 429
  • [3] Small-Variance Asymptotics for Bayesian Nonparametric Models with Constraints
    Li, Cheng
    Rana, Santu
    Dinh Phung
    Venkatesh, Svetha
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART II, 2015, 9078 : 92 - 105
  • [4] Small-Variance Asymptotics for Nonparametric Bayesian Overlapping Stochastic Blockmodels
    Arora, Gundeep
    Porwal, Anupreet
    Agarwal, Kanupriya
    Samdariya, Avani
    Rai, Piyush
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2000 - 2006
  • [5] Bayesian Hierarchical Clustering with Exponential Family: Small-Variance Asymptotics and Reducibility
    Lee, Juho
    Choi, Seungjin
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 38, 2015, 38 : 581 - 589
  • [6] Non-parametric residual variance estimation in supervised learning
    Liitiaeinen, Elia
    Lendasse, Amaury
    Corona, Francesco
    [J]. COMPUTATIONAL AND AMBIENT INTELLIGENCE, 2007, 4507 : 63 - +
  • [7] JUMP-Means: Small-Variance Asymptotics for Markov Jump Processes
    Huggins, Jonathan H.
    Narasimhan, Karthik
    Saeedi, Ardavan
    Mansinghka, Vikash K.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 693 - 701
  • [8] Non-parametric Imitation Learning of Robot Motor Skills
    Huang, Yanlong
    Rozo, Leonel
    Silverio, Joao
    Caldwell, Darwin G.
    [J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 5266 - 5272
  • [9] DP-space: Bayesian Nonparametric Subspace Clustering with Small-variance Asymptotics
    Wang, Yining
    Zhu, Jun
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 862 - 870
  • [10] A Non-Parametric Learning Approach to Identify Online Human Trafficking
    Alvari, Hamidreza
    Shakarian, Paulo
    Snyder, J. E. Kelly
    [J]. IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS: CYBERSECURITY AND BIG DATA, 2016, : 133 - 138