Kicking Motion Design of Humanoid Robots Using Gradual Accumulation Learning Method Based on Q-learning

被引:0
|
作者
Wang, Jiawen [1 ]
Liang, Zhiwei [1 ]
Zhou, Zixuan [1 ]
Zhang, Yunfei [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210046, Jiangsu, Peoples R China
关键词
machine learning; Q-learning; kicking design; reinforcement learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper manly presented kicking design motion of humanoid robots using a reinforcement learning method which is based on the Q-learning. First, this method build a multidirectional fixed-point kicking model, which is based on the offset of kicking point, the foot space motion trajectory and ZMP stability criterion, and that makes subsequent train costs much less time. Besides, discretization of state set is also used to improve the training method. Compared to other machine learning algorithms, this method reduces the dimension of the system and solves the problem of excessive train when kicking in long distance. A series of experiments proves that the method described in this paper is feasible and effective.
引用
收藏
页码:5274 / 5279
页数:6
相关论文
共 50 条
  • [1] Model-based Q-Learning for Humanoid Robots
    Le, Than D.
    Le, An T.
    Nguyen, Duy T.
    2017 18TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2017, : 608 - 613
  • [2] Learning Motion Policy for Mobile Robots using Deep Q-Learning
    Kwak, Nosan
    Yoon, Sukjune
    Roh, Kyungshik
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 805 - 810
  • [3] Intelligent Fuzzy Q-Learning control of humanoid robots
    Er, MJ
    Zhou, Y
    ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 3, PROCEEDINGS, 2005, 3498 : 216 - 221
  • [4] A SHOOTING STRATEGY WHEN MOVING ON HUMANOID ROBOTS USING INVERSE KINEMATICS AND Q-LEARNING
    Rezaeipanah, Amin
    Jamshidi, Zahra
    Jafari, Shahram
    INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2021, 36 (03): : 133 - 139
  • [5] A Hybrid Q-learning Algorithm to Score a Moving Ball for Humanoid Robots
    Jafari, Masoumeh
    Saeedvand, Saeed
    Aghdasi, Hadi S.
    2019 IEEE 5TH CONFERENCE ON KNOWLEDGE BASED ENGINEERING AND INNOVATION (KBEI 2019), 2019, : 498 - 503
  • [6] Q-learning based univector field navigation method for mobile robots
    Vien, Ngo Anh
    Viet, Nguyen Hoang
    Park, HyunJeong
    Lee, SeungGwan
    Chung, TaeChoong
    ADVANCES AND INNOVATIONS IN SYSTEMS, COMPUTING SCIENCES AND SOFTWARE ENGINEERING, 2007, : 463 - +
  • [7] Dynamic fuzzy q-learning control of humanoid robots for automatic gait synthesis
    Zhou, Yi
    Er, Meng Joo
    International Journal of Fuzzy Systems, 2006, 8 (04) : 190 - 199
  • [8] Motion control for humanoid robots based on the concept learning
    Kuwayama, K
    Kato, S
    Seki, H
    Yamakita, T
    Itoh, H
    MHS2003: PROCEEDINGS OF 2003 INTERNATIONAL SYMPOSIUM ON MICROMECHATRONICS AND HUMAN SCIENCE, 2003, : 259 - 263
  • [9] Power Usage Reduction of Humanoid Standing Process Using Q-Learning
    Elibol, Ercan
    Calderon, Juan
    Llofriu, Martin
    Quintero, Carlos
    Moreno, Wilfrido
    Weitzenfeld, Alfredo
    ROBOCUP 2015: ROBOT WORLD CUP XIX, 2015, 9513 : 251 - 263
  • [10] Study on motion forms of mobile robots generated by Q-Learning process based on reward databases
    Hara, Masayuki
    Inoue, Masashi
    Motoyama, Haruhisa
    Huang, Jian
    Yabuta, Tetsuro
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 5112 - +