Learning mixed behaviours with parallel Q-Learning

被引：0

作者：

Laurent, GJ ^{[1
]}

Piat, E ^{[1
]}

机构：

[1] CNRS, Lab Automat Besancon, UMR 6596, F-25000 Besancon, France

来源：

2002 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-3, PROCEEDINGS | 2002年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a reinforcement learning algorithm based on a parallel approach of the Watkins's Q-Learning. This algorithm is used to control a two axis micro-manipulator system. The aim is to learn complex behaviours as reaching target positions and avoiding obstacles at the same time. The simulations and the tests with the real manipulator show that this algorithm is able to learn simultaneously opposite behaviours and that it generates interesting action policies with regard to the global path optimization.

引用

页码：1002 / 1007

页数：6

共 50 条

[1] Parallel Implementation of Reinforcement Learning Q-Learning Technique for FPGA
Da Silva, Lucileide M. D.
Torquato, Matheus F.
Fernandes, Marcelo A. C.
[J]. IEEE ACCESS, 2019, 7 : 2782 - 2798
[2] Accuracy based fuzzy Q-learning for robot behaviours
Gu, DB
Hu, HS
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, PROCEEDINGS, 2004, : 1455 - 1460
[3] GUI Testing to the Power of Parallel Q-Learning
Mobilio, Marco
Clerissi, Diego
Denaro, Giovanni
Mariani, Leonardo
[J]. 2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS, ICSE-COMPANION, 2023, : 55 - 59
[4] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning
Tan, Fuxiao
Yan, Pengfei
Guan, Xinping
[J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483
[5] Q-LEARNING
WATKINS, CJCH
DAYAN, P
[J]. MACHINE LEARNING, 1992, 8 (3-4) : 279 - 292
[6] Learning rates for Q-learning
Even-Dar, E
Mansour, Y
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 5 : 1 - 25
[7] Learning rates for Q-Learning
Even-Dar, E
Mansour, Y
[J]. COMPUTATIONAL LEARNING THEORY, PROCEEDINGS, 2001, 2111 : 589 - 604
[8] Backward Q-learning: The combination of Sarsa algorithm and Q-learning
Wang, Yin-Hao
Li, Tzuu-Hseng S.
Lin, Chih-Jui
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (09) : 2184 - 2193
[9] Solving Twisty Puzzles Using Parallel Q-learning
Hukmani, Kavish
Kolekar, Sucheta
Vobugari, Sreekumar
[J]. ENGINEERING LETTERS, 2021, 29 (04)
[10] Parallel Q-Learning for a block-pushing problem
Laurent, G
Piat, E
[J]. IROS 2001: PROCEEDINGS OF THE 2001 IEEE/RJS INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4: EXPANDING THE SOCIETAL ROLE OF ROBOTICS IN THE NEXT MILLENNIUM, 2001, : 286 - 291

← 1 2 3 4 5 →