Kicking Motion Design of Humanoid Robots Using Gradual Accumulation Learning Method Based on Q-learning

被引:0
|
作者
Wang, Jiawen [1 ]
Liang, Zhiwei [1 ]
Zhou, Zixuan [1 ]
Zhang, Yunfei [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210046, Jiangsu, Peoples R China
来源
PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC) | 2016年
关键词
machine learning; Q-learning; kicking design; reinforcement learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper manly presented kicking design motion of humanoid robots using a reinforcement learning method which is based on the Q-learning. First, this method build a multidirectional fixed-point kicking model, which is based on the offset of kicking point, the foot space motion trajectory and ZMP stability criterion, and that makes subsequent train costs much less time. Besides, discretization of state set is also used to improve the training method. Compared to other machine learning algorithms, this method reduces the dimension of the system and solves the problem of excessive train when kicking in long distance. A series of experiments proves that the method described in this paper is feasible and effective.
引用
收藏
页码:5274 / 5279
页数:6
相关论文
共 50 条
  • [41] Navigation method for autonomous mobile robots based on ROS and multi-robot improved Q-learning
    Hamed, Oussama
    Hamlich, Mohamed
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2024,
  • [42] MOTION CONTROL OF A ROBOT BY MEANS OF Q-LEARNING USING THE EXAMPLE OF LOCOMOTION
    Bussmann, Tobias
    Schilberg, Daniel
    PROCEEDINGS OF ASME 2023 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2023, VOL 3, 2023,
  • [43] ADAPTIVE CONTENTION WINDOW DESIGN USING DEEP Q-LEARNING
    Kumar, Abhishek
    Verma, Gunjan
    Rao, Chirag
    Swami, Ananthram
    Segarra, Santiago
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4950 - 4954
  • [44] Technology Enhanced Learning Using Humanoid Robots
    Recupero, Diego Reforgiato
    FUTURE INTERNET, 2021, 13 (02): : 1 - 17
  • [45] Optimal path planning approach based on Q-learning algorithm for mobile robots
    Maoudj, Abderraouf
    Hentout, Abdelfetah
    APPLIED SOFT COMPUTING, 2020, 97
  • [46] Design of cognitive radar jamming based on Q-learning algorithm
    Li, Yun-Jie
    Zhu, Yun-Peng
    Gao, Mei-Guo
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2015, 35 (11): : 1194 - 1199
  • [47] Molecular design based on Q-learning and maximum likelihood estimation
    Liu, Ying
    Zhang, Bingfeng
    Zhao, Jun
    Wang, Wei
    Lv, Zheng
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 2119 - 2124
  • [48] Incremental Learning of Full Body Motion Primitives for Humanoid Robots
    Kulic, Dana
    Lee, Dongheui
    Ott, Christian
    Nakamura, Yoshihiko
    2008 8TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS 2008), 2008, : 508 - +
  • [49] Self-learning control of cooperative motion for humanoid robots
    School of Mechatronics, Changwon National University, 9 Sarimdong, Changwon 641-773, Korea, Republic of
    不详
    Int. J. Control Autom. Syst., 2006, 6 (725-735):
  • [50] Learning Complex Stand-Up Motion for Humanoid Robots
    Jeong, Heejin
    Lee, Daniel D.
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 4218 - 4219