Kicking Motion Design of Humanoid Robots Using Gradual Accumulation Learning Method Based on Q-learning

被引：0

作者：

Wang, Jiawen ^{[1
]}

Liang, Zhiwei ^{[1
]}

Zhou, Zixuan ^{[1
]}

Zhang, Yunfei ^{[1
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210046, Jiangsu, Peoples R China

来源：

PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC) | 2016年

关键词：

machine learning; Q-learning; kicking design; reinforcement learning;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper manly presented kicking design motion of humanoid robots using a reinforcement learning method which is based on the Q-learning. First, this method build a multidirectional fixed-point kicking model, which is based on the offset of kicking point, the foot space motion trajectory and ZMP stability criterion, and that makes subsequent train costs much less time. Besides, discretization of state set is also used to improve the training method. Compared to other machine learning algorithms, this method reduces the dimension of the system and solves the problem of excessive train when kicking in long distance. A series of experiments proves that the method described in this paper is feasible and effective.

引用

页码：5274 / 5279

页数：6

共 50 条

[41] Navigation method for autonomous mobile robots based on ROS and multi-robot improved Q-learning
Hamed, Oussama
Hamlich, Mohamed
PROGRESS IN ARTIFICIAL INTELLIGENCE, 2024,
[42] MOTION CONTROL OF A ROBOT BY MEANS OF Q-LEARNING USING THE EXAMPLE OF LOCOMOTION
Bussmann, Tobias
Schilberg, Daniel
PROCEEDINGS OF ASME 2023 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2023, VOL 3, 2023,
[43] ADAPTIVE CONTENTION WINDOW DESIGN USING DEEP Q-LEARNING
Kumar, Abhishek
Verma, Gunjan
Rao, Chirag
Swami, Ananthram
Segarra, Santiago
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4950 - 4954
[44] Technology Enhanced Learning Using Humanoid Robots
Recupero, Diego Reforgiato
FUTURE INTERNET, 2021, 13 (02): : 1 - 17
[45] Optimal path planning approach based on Q-learning algorithm for mobile robots
Maoudj, Abderraouf
Hentout, Abdelfetah
APPLIED SOFT COMPUTING, 2020, 97
[46] Design of cognitive radar jamming based on Q-learning algorithm
Li, Yun-Jie
Zhu, Yun-Peng
Gao, Mei-Guo
Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2015, 35 (11): : 1194 - 1199
[47] Molecular design based on Q-learning and maximum likelihood estimation
Liu, Ying
Zhang, Bingfeng
Zhao, Jun
Wang, Wei
Lv, Zheng
PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 2119 - 2124
[48] Incremental Learning of Full Body Motion Primitives for Humanoid Robots
Kulic, Dana
Lee, Dongheui
Ott, Christian
Nakamura, Yoshihiko
2008 8TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS 2008), 2008, : 508 - +
[49] Self-learning control of cooperative motion for humanoid robots
School of Mechatronics, Changwon National University, 9 Sarimdong, Changwon 641-773, Korea, Republic of
不详
Int. J. Control Autom. Syst., 2006, 6 (725-735):
[50] Learning Complex Stand-Up Motion for Humanoid Robots
Jeong, Heejin
Lee, Daniel D.
THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 4218 - 4219

← 1 2 3 4 5 →