Robotic Table Tennis with Model-Free Reinforcement Learning

被引：9

作者：

Gao, Wenbo ^{[3
]}

Graesser, Laura ^{[1
]}

Choromanski, Krzysztof ^{[1
]}

Song, Xingyou ^{[1
]}

Lazic, Nevena ^{[2
]}

Sanketi, Pannag ^{[1
]}

Sindhwani, Vikas ^{[1
]}

Jaitly, Navdeep ^{[4
]}

机构：

[1] Google, Robot, Mountain View, CA 94043 USA

[2] DeepMind, London, England

[3] Columbia Univ, New York, NY 10027 USA

[4] Google, Mountain View, CA 94043 USA

来源：

2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2020年

关键词：

D O I：

10.1109/IROS45743.2020.9341191

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose a model-free algorithm for learning efficient policies capable of returning table tennis balls by controlling robot joints at a rate of 100Hz. We demonstrate that evolutionary search (ES) methods acting on CNN-based policy architectures for non-visual inputs and convolving across time learn compact controllers leading to smooth motions. Furthermore, we show that with appropriately tuned curriculum learning on the task and rewards, policies are capable of developing multi-modal styles, specifically forehand and backhand stroke, whilst achieving 80% return rate on a wide range of ball throws. We observe that multi-modality does not require any architectural priors, such as multi-head architectures or hierarchical policies.

引用

页码：5556 / 5563

页数：8

共 50 条

[1] A Model-free Approach to Stroke Learning for Robotic Table Tennis
Gao, Yapeng
Tebbe, Jonas
Zell, Andreas
[J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[2] MODEL-FREE ONLINE REINFORCEMENT LEARNING OF A ROBOTIC MANIPULATOR
Sweafford, Jerry, Jr.
Fahimi, Farbod
[J]. MECHATRONIC SYSTEMS AND CONTROL, 2019, 47 (03): : 136 - 143
[3] Reinforcement learning with model-based feedforward inputs for robotic table tennis
Ma, Hao
Buechler, Dieter
Schoelkopf, Bernhard
Muehlebach, Michael
[J]. AUTONOMOUS ROBOTS, 2023, 47 (08) : 1387 - 1403
[4] Reinforcement learning with model-based feedforward inputs for robotic table tennis
Hao Ma
Dieter Büchler
Bernhard Schölkopf
Michael Muehlebach
[J]. Autonomous Robots, 2023, 47 : 1387 - 1403
[5] Sample-efficient Reinforcement Learning in Robotic Table Tennis
Tebbe, Jonas
Krauch, Lukas
Gao, Yapeng
Zell, Andreas
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4171 - 4178
[6] A Model-free Deep Reinforcement Learning Approach for Robotic Manipulators Path Planning
Liu, Wenxing
Niu, Hanlin
Mahyuddin, Muhammad Nasiruddin
Herrmann, Guido
Carrasco, Joaquin
[J]. 2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 512 - 517
[7] Reinforcement Learning for VR Table Tennis
Seah, H. S.
Jiang, D. Q.
Tandianus, B.
Sui, Y. L.
Wang, H.
[J]. INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2024, 2024, 13164
[8] A learning approach to robotic table tennis
Matsushima, M
Hashimoto, T
Takeuchi, M
Miyazaki, F
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2005, 21 (04) : 767 - 771
[9] Reinforcement Learning-Based Model-Free Controller for Feedback Stabilization of Robotic Systems
Singh, Rupam
Bhushan, Bharat
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7059 - 7073
[10] Learning Representations in Model-Free Hierarchical Reinforcement Learning
Rafati, Jacob
Noelle, David C.
[J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 10009 - 10010

← 1 2 3 4 5 →