Learning mixed behaviours with parallel Q-Learning

被引:0
|
作者
Laurent, GJ [1 ]
Piat, E [1 ]
机构
[1] CNRS, Lab Automat Besancon, UMR 6596, F-25000 Besancon, France
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a reinforcement learning algorithm based on a parallel approach of the Watkins's Q-Learning. This algorithm is used to control a two axis micro-manipulator system. The aim is to learn complex behaviours as reaching target positions and avoiding obstacles at the same time. The simulations and the tests with the real manipulator show that this algorithm is able to learn simultaneously opposite behaviours and that it generates interesting action policies with regard to the global path optimization.
引用
收藏
页码:1002 / 1007
页数:6
相关论文
共 50 条
  • [41] Cooperative Q-Learning Based on Learning Automata
    Yang, Mao
    Tian, Yantao
    Qi, Xinyue
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS ( ICAL 2009), VOLS 1-3, 2009, : 1972 - 1977
  • [42] Introspective Q-learning and learning from demonstration
    Li, Mao
    Brys, Tim
    Kudenko, Daniel
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2019, 34
  • [43] Study of Cooperation Strategy of Robot Based on Parallel Q-Learning Algorithm
    Wang, Shuda
    Si, Feng
    Yang, Jing
    Wang, Shuoning
    Yang, Jun
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS, PT I, PROCEEDINGS, 2008, 5314 : 633 - 642
  • [44] Q-learning with Logarithmic Regret
    Yang, Kunhe
    Yang, Lin F.
    Du, Simon S.
    [J]. 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [45] Two mode Q-learning
    Park, KH
    Kim, JH
    [J]. CEC: 2003 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-4, PROCEEDINGS, 2003, : 2449 - 2454
  • [46] Underestimation estimators to Q-learning
    Abliz, Patigul
    Ying, Shi
    [J]. INFORMATION SCIENCES, 2022, 607 : 173 - 185
  • [47] Q-learning with Nearest Neighbors
    Shah, Devavrat
    Xie, Qiaomin
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [48] q-Learning in Continuous Time
    Jia, Yanwei
    Zhou, Xun Yu
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [49] Q-Learning With Kalman Filters
    Riakaiiaia, Kei
    Miura, Takao
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2939 - 2947
  • [50] Distributionally Robust Q-Learning
    Liu, Zijian
    Bai, Qinxun
    Blanchet, Jose
    Dong, Perry
    Xu, Wei
    Zhou, Zhengqing
    Zhou, Zhengyuan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,