Trial and Error Using Previous Experiences as Simulation Models in Humanoid Motor Learning

被引:4
|
作者
Sugimoto, Norikazu [1 ]
Tangkaratt, Voot [2 ]
Wensveen, Thijs [3 ]
Zhao, Tingting [4 ]
Sugiyama, Masashi [2 ]
Morimoto, Jun [5 ]
机构
[1] Natl Inst Informat & Commun Technol, Osaka, Japan
[2] Univ Tokyo, Tokyo 1138654, Japan
[3] Delft Univ Technol, NL-2600 AA Delft, Netherlands
[4] Tianjin Univ Sci & Technol, Tianjin, Peoples R China
[5] ATR Computat Neurosci Labs, Kyoto, Japan
关键词
POLICY GRADIENTS; SAMPLE REUSE; ROBOTICS;
D O I
10.1109/MRA.2015.2511681
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Since biological systems have the ability to efficiently reuse previous experiences to change their behavioral strategies to avoid enemies or find food, the number of required samples from real environments to improve behavioral policy is greatly reduced. Even for real robotic systems, it is desirable to use only a limited number of samples from real environments due to the limited durability of real systems to reduce the required time to improve control performance. In this article, we used previous experiences as environmental local models so that the movement policy of a humanoid robot can be efficiently improved with a limited number of samples from its real environment. We applied our proposed learning method to a real humanoid robot and successfully achieve two challenging control tasks. We applied our proposed learning approach to acquire a policy for a cart-pole swing-up task in a real-virtual hybrid task environment, where the robot waves a PlayStation (PS) Move motion controller to move a cart-pole in a virtual simulator. Furthermore, we applied our proposed method to a challenging basketball-shooting task in a real environment. © 1994-2011 IEEE.
引用
收藏
页码:96 / 105
页数:10
相关论文
共 50 条
  • [21] Error Estimation Models Integrating Previous Models and Using Artificial Neural Networks for Embedded Software Development Projects
    Iwata, Kazunori
    Nakashima, Toyoshiro
    Anan, Yoshiyuki
    Ishii, Naohiro
    20TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL 2, PROCEEDINGS, 2008, : 371 - +
  • [22] State Estimation for Force-Controlled Humanoid Balance using Simple Models in the Presence of Modeling Error
    Stephens, Benjamin J.
    2011 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2011,
  • [23] Using Reinforcement Learning and Error Models for Drone Precise Landing
    Saryazdi, Sepehr
    Alkouz, Balsam
    Bouguettaya, Athman
    Lakhdari, Abdallah
    ACM Transactions on Internet Technology, 2024, 24 (03)
  • [24] Angular Glint Error Simulation Using Attributed Scattering Center Models
    Guo, Kun-Yi
    Xiao, Guang-Liang
    Zhai, Yun
    Sheng, Xin-Qing
    IEEE ACCESS, 2018, 6 : 35194 - 35205
  • [25] Sparse Identification of Motor Learning Using Proxy Process Models
    Parmar, Pritesh N.
    Patton, James L.
    2019 IEEE 16TH INTERNATIONAL CONFERENCE ON REHABILITATION ROBOTICS (ICORR), 2019, : 855 - 860
  • [26] Distributed Power Allocation with SINR Constraints Using Trial and Error Learning
    Rose, Luca
    Perlaza, Samir M.
    Debbah, Merouane
    Le Martret, Christophe J.
    2012 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2012, : 1835 - 1840
  • [27] Machine Learning Approaches for Error Correction of Hydraulic Simulation Models for Canal Flow Schemes
    Torres-Rua, Alfonso F.
    Ticlavilca, Andres M.
    Walker, Wynn R.
    McKee, Mac
    JOURNAL OF IRRIGATION AND DRAINAGE ENGINEERING, 2012, 138 (11) : 999 - 1010
  • [28] Performance analysis of PID control in DC Brushless motor using trial and error method
    Sekarsari, K.
    Tata, T.
    5TH ANNUAL APPLIED SCIENCE AND ENGINEERING CONFERENCE (AASEC 2020), 2021, 1098
  • [29] Practical experiences in using the simulation method of learning the sportive technique in swimming
    Mihailescu, Liliana
    Dubit, Nicoleta Simona
    6TH INTERNATIONAL CONFERENCE EDU WORLD 2014: EDUCATION FACING CONTEMPORARY WORLD ISSUES, 2015, 180 : 1276 - 1282
  • [30] Learning How to Generate Kinesthetic Motor Imagery Using a BCI-based Learning Environment: a Comparative Study Based on Guided or Trial-and-Error Approaches
    Rimbert, Sebastien
    Bougrain, Laurent
    Fleck, Stephanie
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 2483 - 2489