Kicking Motion Design of Humanoid Robots Using Gradual Accumulation Learning Method Based on Q-learning

被引：0

作者：

Wang, Jiawen ^{[1
]}

Liang, Zhiwei ^{[1
]}

Zhou, Zixuan ^{[1
]}

Zhang, Yunfei ^{[1
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210046, Jiangsu, Peoples R China

来源：

PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC) | 2016年

关键词：

machine learning; Q-learning; kicking design; reinforcement learning;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper manly presented kicking design motion of humanoid robots using a reinforcement learning method which is based on the Q-learning. First, this method build a multidirectional fixed-point kicking model, which is based on the offset of kicking point, the foot space motion trajectory and ZMP stability criterion, and that makes subsequent train costs much less time. Besides, discretization of state set is also used to improve the training method. Compared to other machine learning algorithms, this method reduces the dimension of the system and solves the problem of excessive train when kicking in long distance. A series of experiments proves that the method described in this paper is feasible and effective.

引用

页码：5274 / 5279

页数：6

共 50 条

[21] An Enhanced Ensemble Learning Method for Sentiment Analysis based on Q-learning
Savargiv, Mohammad
Masoumi, Behrooz
Keyvanpour, Mohammad Reza
IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2024, 48 (03) : 1261 - 1277
[22] Cyclic error correction based Q-learning for mobile robots navigation
Rongkuan Tang
Hongliang Yuan
International Journal of Control, Automation and Systems, 2017, 15 : 1790 - 1798
[23] Cyclic Error Correction based Q-learning for Mobile Robots Navigation
Tang, Rongkuan
Yuan, Hongliang
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2017, 15 (04) : 1790 - 1798
[24] Hexagon-based Q-learning for object search with multiple robots
Yoon, HU
Sim, KB
ADVANCES IN NATURAL COMPUTATION, PT 3, PROCEEDINGS, 2005, 3612 : 713 - 722
[25] Enhancing Nash Q-learning and Team Q-learning mechanisms by using bottlenecks
Ghazanfari, Behzad
Mozayani, Nasser
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2014, 26 (06) : 2771 - 2783
[26] Incremental Learning Framework for Autonomous Robots Based on Q-Learning and the Adaptive Kernel Linear Model
Hu, Yanming
Li, Decai
He, Yuqing
Han, Jianda
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (01) : 64 - 74
[27] A type of Q-learning method based on Elman network
Liu Chang-you
Sun Guang-yu
Proceedings of 2004 Chinese Control and Decision Conference, 2004, : 562 - 564
[28] Cooperative Q-Learning Based on Learning Automata
Yang, Mao
Tian, Yantao
Qi, Xinyue
2009 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS ( ICAL 2009), VOLS 1-3, 2009, : 1972 - 1977
[29] An Efficient Initialization Approach of Q-learning for Mobile Robots
Song, Yong
Li, Yi-bin
Li, Cai-hong
Zhang, Gui-fang
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2012, 10 (01) : 166 - 172
[30] An efficient initialization approach of Q-learning for mobile robots
Yong Song
Yi-bin Li
Cai-hong Li
Gui-fang Zhang
International Journal of Control, Automation and Systems, 2012, 10 : 166 - 172

← 1 2 3 4 5 →