Kicking Motion Design of Humanoid Robots Using Gradual Accumulation Learning Method Based on Q-learning

被引:0
|
作者
Wang, Jiawen [1 ]
Liang, Zhiwei [1 ]
Zhou, Zixuan [1 ]
Zhang, Yunfei [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210046, Jiangsu, Peoples R China
来源
PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC) | 2016年
关键词
machine learning; Q-learning; kicking design; reinforcement learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper manly presented kicking design motion of humanoid robots using a reinforcement learning method which is based on the Q-learning. First, this method build a multidirectional fixed-point kicking model, which is based on the offset of kicking point, the foot space motion trajectory and ZMP stability criterion, and that makes subsequent train costs much less time. Besides, discretization of state set is also used to improve the training method. Compared to other machine learning algorithms, this method reduces the dimension of the system and solves the problem of excessive train when kicking in long distance. A series of experiments proves that the method described in this paper is feasible and effective.
引用
收藏
页码:5274 / 5279
页数:6
相关论文
共 50 条
  • [21] An Enhanced Ensemble Learning Method for Sentiment Analysis based on Q-learning
    Savargiv, Mohammad
    Masoumi, Behrooz
    Keyvanpour, Mohammad Reza
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2024, 48 (03) : 1261 - 1277
  • [22] Cyclic error correction based Q-learning for mobile robots navigation
    Rongkuan Tang
    Hongliang Yuan
    International Journal of Control, Automation and Systems, 2017, 15 : 1790 - 1798
  • [23] Cyclic Error Correction based Q-learning for Mobile Robots Navigation
    Tang, Rongkuan
    Yuan, Hongliang
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2017, 15 (04) : 1790 - 1798
  • [24] Hexagon-based Q-learning for object search with multiple robots
    Yoon, HU
    Sim, KB
    ADVANCES IN NATURAL COMPUTATION, PT 3, PROCEEDINGS, 2005, 3612 : 713 - 722
  • [25] Enhancing Nash Q-learning and Team Q-learning mechanisms by using bottlenecks
    Ghazanfari, Behzad
    Mozayani, Nasser
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2014, 26 (06) : 2771 - 2783
  • [26] Incremental Learning Framework for Autonomous Robots Based on Q-Learning and the Adaptive Kernel Linear Model
    Hu, Yanming
    Li, Decai
    He, Yuqing
    Han, Jianda
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (01) : 64 - 74
  • [27] A type of Q-learning method based on Elman network
    Liu Chang-you
    Sun Guang-yu
    Proceedings of 2004 Chinese Control and Decision Conference, 2004, : 562 - 564
  • [28] Cooperative Q-Learning Based on Learning Automata
    Yang, Mao
    Tian, Yantao
    Qi, Xinyue
    2009 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS ( ICAL 2009), VOLS 1-3, 2009, : 1972 - 1977
  • [29] An Efficient Initialization Approach of Q-learning for Mobile Robots
    Song, Yong
    Li, Yi-bin
    Li, Cai-hong
    Zhang, Gui-fang
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2012, 10 (01) : 166 - 172
  • [30] An efficient initialization approach of Q-learning for mobile robots
    Yong Song
    Yi-bin Li
    Cai-hong Li
    Gui-fang Zhang
    International Journal of Control, Automation and Systems, 2012, 10 : 166 - 172