Self-Improvement of Learned Action Models with Learned Goal Models

被引:0
|
作者
Akgun, Baris
Thomaz, Andrea L.
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a new method for robots to further improve upon skills acquired through Learning from Demonstration. Previously, we have introduced a method to learn both an action model to execute the skill and a goal model to monitor the execution of the skill. In this paper we show how to use the learned goal models to improve the learned action models autonomously, without further user interaction. Trajectories are sampled from the action model and executed on the robot. The goal model then labels them as success or failure and the successful ones are used to update the action model. We introduce an adaptive sampling method to speed up convergence. We show through both simulation and real robot experiments that our method can fix a failed action model.
引用
收藏
页码:5259 / 5264
页数:6
相关论文
共 50 条
  • [1] Speed adaptation for self-improvement of skills learned from user demonstrations
    Vuga, Rok
    Nemec, Bojan
    Ude, Ales
    [J]. ROBOTICA, 2016, 34 (12) : 2806 - 2822
  • [2] Velocity adaptation for self-improvement of skills learned from user demonstrations
    Nemec, Bojan
    Gams, Andrej
    Ude, Ales
    [J]. 2013 13TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2013, : 423 - 428
  • [3] Refining the execution of abstract actions with learned action models
    Stulp, Freek
    Beetz, Michael
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2008, 32 : 487 - 523
  • [4] Refining the execution of abstract actions with learned action models
    Stulp, Freek
    Beetz, Michael
    [J]. Journal of Artificial Intelligence Research, 1600, 32 : 487 - 523
  • [5] Learned trafficability models
    Digney, BL
    [J]. UNMANNED GROUND VEHICLE TECHNOLOGY III, 2001, 4364 : 51 - 60
  • [6] Perception for learned trafficability models
    Broten, GS
    Digney, BL
    [J]. UNMANNED GROUND VEHICLE TECHNOLOGY IV, 2002, 4715 : 149 - 160
  • [7] Learned models for continuous planning
    Schmill, MD
    Oates, T
    Cohen, PR
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS 99, PROCEEDINGS, 1999, : 278 - 282
  • [8] Fusion of Self-supervised Learned Models for MOS Prediction
    Yang, Zhengdong
    Zhou, Wangjin
    Chu, Chenhui
    Li, Sheng
    Dabre, Raj
    Rubino, Raphael
    Zhao, Yi
    [J]. INTERSPEECH 2022, 2022, : 5443 - 5447
  • [9] Vitamin D action Lessons learned from genetic mouse models
    Goltzman, David
    [J]. SKELETAL BIOLOGY AND MEDICINE, 2010, 1192 : 145 - 152
  • [10] Self-Improvement
    Wreschner, Arthur
    [J]. ARCHIV FUR DIE GESAMTE PSYCHOLOGIE, 1930, 78 (1-2): : 232 - 233