Trial and Error Using Previous Experiences as Simulation Models in Humanoid Motor Learning

被引:4
|
作者
Sugimoto, Norikazu [1 ]
Tangkaratt, Voot [2 ]
Wensveen, Thijs [3 ]
Zhao, Tingting [4 ]
Sugiyama, Masashi [2 ]
Morimoto, Jun [5 ]
机构
[1] Natl Inst Informat & Commun Technol, Osaka, Japan
[2] Univ Tokyo, Tokyo 1138654, Japan
[3] Delft Univ Technol, NL-2600 AA Delft, Netherlands
[4] Tianjin Univ Sci & Technol, Tianjin, Peoples R China
[5] ATR Computat Neurosci Labs, Kyoto, Japan
关键词
POLICY GRADIENTS; SAMPLE REUSE; ROBOTICS;
D O I
10.1109/MRA.2015.2511681
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Since biological systems have the ability to efficiently reuse previous experiences to change their behavioral strategies to avoid enemies or find food, the number of required samples from real environments to improve behavioral policy is greatly reduced. Even for real robotic systems, it is desirable to use only a limited number of samples from real environments due to the limited durability of real systems to reduce the required time to improve control performance. In this article, we used previous experiences as environmental local models so that the movement policy of a humanoid robot can be efficiently improved with a limited number of samples from its real environment. We applied our proposed learning method to a real humanoid robot and successfully achieve two challenging control tasks. We applied our proposed learning approach to acquire a policy for a cart-pole swing-up task in a real-virtual hybrid task environment, where the robot waves a PlayStation (PS) Move motion controller to move a cart-pole in a virtual simulator. Furthermore, we applied our proposed method to a challenging basketball-shooting task in a real environment. © 1994-2011 IEEE.
引用
收藏
页码:96 / 105
页数:10
相关论文
共 50 条
  • [41] SIMULATION EDUCATION IN THE INTERNET AGE: SOME EXPERIENCES ON THE USE OF PURE ONLINE AND BLENDED LEARNING MODELS
    Fonseca, Pau
    Juan, Angel A.
    Pla, Lluis M.
    Rodriguez, Sara V.
    Faulin, Javier
    PROCEEDINGS OF THE 2009 WINTER SIMULATION CONFERENCE (WSC 2009 ), VOL 1-4, 2009, : 291 - +
  • [42] Using Film to Elucidate Leadership Effectiveness Models: Reflection on Authentic Learning Experiences
    Rajendran, Diana
    Andrew, Martin
    JOURNAL OF UNIVERSITY TEACHING AND LEARNING PRACTICE, 2014, 11 (01):
  • [43] Using International Nursing Association for Clinical Simulation and Learning Standards to Evaluate the Rigor of High-Fidelity Simulation Learning Experiences
    Grota, Patti G.
    O'Neal, Cynthia
    CLINICAL SIMULATION IN NURSING, 2020, 46 : 50 - 56
  • [44] Coordinating multi-agent (creature) systems using ecological models of motor control and motor learning
    Bennett, K
    AUSTRALIAN JOURNAL OF PSYCHOLOGY, 2001, 53 : 43 - 43
  • [45] Reduction in overlay error from mark asymmetry using simulation, ORION, and alignment models
    Menchtchikov, Boris
    Socha, Robert
    Zheng, Chumeng
    Raghunathan, Sudhar
    Aarts, Igor
    Shome, Krishanu
    Lee, Jonathan
    de Ruiter, Chris
    Rijpstra, Manouk
    Megens, Henry
    Brinkhof, Ralph
    Teeuwisse, Floris
    Karssemeijer, Leendertjan
    Lyulina, Irina
    Li, Chung-Tien
    Hermans, Jan
    Leray, Philippe
    OPTICAL MICROLITHOGRAPHY XXXI, 2018, 10587
  • [46] A CONCURRENT LEARNING ALGORITHM OF FORWARD AND INVERSE MODELS USING FEEDBACK ERROR LEARNING IN THE EARLY-STAGE
    YAMAGUCHI, S
    OKAZAKI, N
    ITAKURA, H
    SYSTEMS AND COMPUTERS IN JAPAN, 1995, 26 (03) : 65 - 73
  • [47] Students' engagement and learning experiences using virtual patient simulation in a computer supported collaborative learning environment
    Adefila, A.
    Opie, J.
    Ball, S.
    Bluteau, P.
    INNOVATIONS IN EDUCATION AND TEACHING INTERNATIONAL, 2020, 57 (01) : 50 - 61
  • [48] INITIATION IN THE DIGESTIVE ENDOSCOPY, PEDAGOGIC EXPERIENCES USING 6 SIMULATION-MODELS AND CINEMATOGRAPHY
    LUCERO, RS
    ESPINIELLA, F
    ZARATE, J
    PADORNO, L
    TOMAS, J
    APUD, A
    GROSSO, M
    DIGESTIVE DISEASES AND SCIENCES, 1986, 31 (10) : S257 - S257
  • [49] LEARNING SEQUENTIAL DECISION RULES USING SIMULATION-MODELS AND COMPETITION
    GREFENSTETTE, JJ
    RAMSEY, CL
    SCHULTZ, AC
    MACHINE LEARNING, 1990, 5 (04) : 355 - 381
  • [50] Robotic learning of motion using demonstrations and statistical models for surgical simulation
    Yang, Tao
    Chui, Chee Kong
    Liu, Jiang
    Huang, Weimin
    Su, Yi
    Chang, Stephen K. Y.
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2014, 9 (05) : 813 - 823