A robot demonstration method based on LWR and Q-learning algorithm

被引:4
|
作者
Zhao, Guangzhe [1 ,2 ,3 ]
Tao, Yong [4 ]
Liu, Hui [4 ]
Deng, Xianling [5 ]
Chen, Youdong [4 ]
Xiong, Hegen [6 ]
Xie, Xianwu [6 ]
Fang, Zengliang [4 ]
机构
[1] Beijing Univ Civil Engn & Architecture, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Yanbian Univ, Yanji, Peoples R China
[4] Beihang Univ, Beijing 100191, Peoples R China
[5] Chongqing Univ Sci & Technol, Chongqing, Peoples R China
[6] Wuhan Univ Sci & Technol, Wuhan, Hubei, Peoples R China
关键词
Reinforcement learning; Q-learning; locally weighted regression; program by demonstration;
D O I
10.3233/JIFS-169564
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A robot demonstration method is proposed based on the combination of locally weighted regression (LWR) and Q-learning algorithm. It is applied on a 6-DOF hitting-ball-system. This method can adapt to the work task by learning from demonstration and generating new actions. With the LWR algorithm, the mapping between target values and actions is established. According to deviation of landing position, a Q-learning algorithm is proposed to adjust the parameters of manipulator and compensate the errors caused by model and the controller. The model of LWR fits a local small space to approximate the global state and decision space. It turns out to reduce the dimension and simplify the training of Qlearning. The convergence rate is enhanced and the precision of performing task is improved. The simulation and experiment demonstrate the applicability of the proposed method.
引用
收藏
页码:35 / 46
页数:12
相关论文
共 50 条
  • [1] Mobile robot path planning based on Q-learning algorithm
    Li, Shaochuan
    Wang, Xuiqing
    Hu, Liwei
    Liu, Ying
    [J]. 2019 WORLD ROBOT CONFERENCE SYMPOSIUM ON ADVANCED ROBOTICS AND AUTOMATION (WRC SARA 2019), 2019, : 160 - 165
  • [2] Behavior Control Algorithm for Mobile Robot Based on Q-Learning
    Yang, Shiqiang
    Li, Congxiao
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTER NETWORK, ELECTRONIC AND AUTOMATION (ICCNEA), 2017, : 45 - 48
  • [3] A search and rescue robot search method based on flower pollination algorithm and Q-learning fusion algorithm
    Hao, Bing
    Zhao, Jianshuo
    Du, He
    Wang, Qi
    Yuan, Qi
    Zhao, Shuo
    [J]. PLOS ONE, 2023, 18 (03):
  • [4] Autonomous Navigation based on a Q-learning algorithm for a Robot in a Real Environment
    Strauss, Clement
    Sahin, Ferat
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEM OF SYSTEMS ENGINEERING (SOSE), 2008, : 361 - 365
  • [5] Study of Cooperation Strategy of Robot Based on Parallel Q-Learning Algorithm
    Wang, Shuda
    Si, Feng
    Yang, Jing
    Wang, Shuoning
    Yang, Jun
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS, PT I, PROCEEDINGS, 2008, 5314 : 633 - 642
  • [6] PATH PLANNING OF MOBILE ROBOT BASED ON THE IMPROVED Q-LEARNING ALGORITHM
    Chen, Chaorui
    Wang, Dongshu
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2022, 18 (03): : 687 - 702
  • [7] A Hybrid Fuzzy Q-Learning algorithm for robot navigation
    Gordon, Sean W.
    Reyes, Napoleon H.
    Barczak, Andre
    [J]. 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 2625 - 2631
  • [8] Q-learning based method of adaptive path planning for mobile robot
    Li, Yibin
    Li, Caihong
    Zhang, Zijian
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON INFORMATION ACQUISITION, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2006, : 983 - 987
  • [9] The Improved Q-Learning Algorithm based on Pheromone Mechanism for Swarm Robot System
    Shi, Zhiguo
    Tu, Jun
    Zhang, Qiao
    Zhang, Xiaomeng
    Wei, Junming
    [J]. 2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 6033 - 6038
  • [10] Hybrid control for robot navigation - A hierarchical Q-learning algorithm
    Chen, Chunlin
    Li, Han-Xiong
    Dong, Daoyi
    [J]. IEEE ROBOTICS & AUTOMATION MAGAZINE, 2008, 15 (02) : 37 - 47