BIPED WALKING PATTERN GENERATION USING REINFORCEMENT LEARNING

被引:0
|
作者
Lee, Jungho [1 ]
Oh, Jun Ho [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Humanoid Robot Res Ctr, Taejon 305701, South Korea
关键词
Biped walking; reinforcement learning; walking pattern; robot learning; humanoid robot; DYNAMIC WALKING;
D O I
10.1142/S021984360900167X
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this research, a stable biped walking pattern is generated using reinforcement learning. The biped walking pattern is chosen as a simple third order polynomial. To complete it, four boundary conditions are needed. The initial position and velocity and the final position and velocity of the joint are selected as boundary conditions. In order to find the proper boundary condition value, a reinforcement learning algorithm is used. Also desired motion or posture can be achieved using the initial and final positions. The final velocity of the walking pattern is chosen as a learning parameter. To test the algorithm, a simulator that takes into consideration the whole model of the robot and the environment is developed. The algorithm is verified through a simulation.
引用
收藏
页码:1 / 21
页数:21
相关论文
共 50 条
  • [31] ONLINE WALKING PATTERN GENERATION AND CONTROL FOR BIPED ROBOT WITH CONTACT CONSISTENCY
    Hou, Wenqi
    Wang, Jian
    Wang, Jianwen
    Ma, Hongxu
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2015, 30 (04): : 407 - 416
  • [32] A realtime pattern generator for biped walking
    Kajita, S
    Kanehiro, F
    Kaneko, K
    Fujiwara, K
    Yokoi, K
    Hirukawa, H
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2002, : 31 - 37
  • [33] Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot
    Hitomi, Kentarou
    Shibata, Tomohiro
    Nakamura, Yutaka
    Ishii, Shin
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2006, 54 (12) : 982 - 988
  • [34] Walking Pattern Modification using Manipulability Ellipsoid for Biped Robot
    Maneewarn, Thavida
    Boonprakob, Apinya
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, VOLS 1-4, 2009, : 160 - 165
  • [35] ZMP Based Pattern Generation for Biped Walking Using Optimal Preview Integral Sliding Mode Control
    Williams, Markimba M.
    Loukianov, Alexander G.
    Bayro-Corrochano, E.
    [J]. 2013 13TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2013, : 100 - 105
  • [36] Neuro-fuzzy gait synthesis with reinforcement learning for a biped walking robot
    Zhou C.
    [J]. Soft Computing, 2000, 4 (04) : 238 - 250
  • [37] Optimal three-dimensional biped walking pattern generation based on geodesics
    Zhang, Liandong
    Zhou, Changjiu
    [J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2017, 14 (02):
  • [38] Generation of adaptive splitbelt treadmill walking of a biped robot using learning of intralimb and interlimb coordinations
    Fujiki, Soichiro
    Aoi, Shinya
    Senda, Kei
    Tsuchiya, Kazuo
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 2392 - 2397
  • [39] A walking pattern generation for biped robot with parallel mechanism by considering contact force
    Morisawa, M
    Fujimoto, Y
    Murakami, T
    Ohnishi, K
    [J]. IECON'01: 27TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-3, 2001, : 2184 - 2189
  • [40] Reinforcement learning for biped locomotion
    Sato, M
    Nakamura, Y
    Ishii, S
    [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2002, 2002, 2415 : 777 - 782