Robotic Table Tennis with Model-Free Reinforcement Learning

被引:9
|
作者
Gao, Wenbo [3 ]
Graesser, Laura [1 ]
Choromanski, Krzysztof [1 ]
Song, Xingyou [1 ]
Lazic, Nevena [2 ]
Sanketi, Pannag [1 ]
Sindhwani, Vikas [1 ]
Jaitly, Navdeep [4 ]
机构
[1] Google, Robot, Mountain View, CA 94043 USA
[2] DeepMind, London, England
[3] Columbia Univ, New York, NY 10027 USA
[4] Google, Mountain View, CA 94043 USA
关键词
D O I
10.1109/IROS45743.2020.9341191
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a model-free algorithm for learning efficient policies capable of returning table tennis balls by controlling robot joints at a rate of 100Hz. We demonstrate that evolutionary search (ES) methods acting on CNN-based policy architectures for non-visual inputs and convolving across time learn compact controllers leading to smooth motions. Furthermore, we show that with appropriately tuned curriculum learning on the task and rewards, policies are capable of developing multi-modal styles, specifically forehand and backhand stroke, whilst achieving 80% return rate on a wide range of ball throws. We observe that multi-modality does not require any architectural priors, such as multi-head architectures or hierarchical policies.
引用
收藏
页码:5556 / 5563
页数:8
相关论文
共 50 条
  • [1] A Model-free Approach to Stroke Learning for Robotic Table Tennis
    Gao, Yapeng
    Tebbe, Jonas
    Zell, Andreas
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [2] MODEL-FREE ONLINE REINFORCEMENT LEARNING OF A ROBOTIC MANIPULATOR
    Sweafford, Jerry, Jr.
    Fahimi, Farbod
    [J]. MECHATRONIC SYSTEMS AND CONTROL, 2019, 47 (03): : 136 - 143
  • [3] Reinforcement learning with model-based feedforward inputs for robotic table tennis
    Ma, Hao
    Buechler, Dieter
    Schoelkopf, Bernhard
    Muehlebach, Michael
    [J]. AUTONOMOUS ROBOTS, 2023, 47 (08) : 1387 - 1403
  • [4] Reinforcement learning with model-based feedforward inputs for robotic table tennis
    Hao Ma
    Dieter Büchler
    Bernhard Schölkopf
    Michael Muehlebach
    [J]. Autonomous Robots, 2023, 47 : 1387 - 1403
  • [5] Sample-efficient Reinforcement Learning in Robotic Table Tennis
    Tebbe, Jonas
    Krauch, Lukas
    Gao, Yapeng
    Zell, Andreas
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4171 - 4178
  • [6] A Model-free Deep Reinforcement Learning Approach for Robotic Manipulators Path Planning
    Liu, Wenxing
    Niu, Hanlin
    Mahyuddin, Muhammad Nasiruddin
    Herrmann, Guido
    Carrasco, Joaquin
    [J]. 2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 512 - 517
  • [7] Reinforcement Learning for VR Table Tennis
    Seah, H. S.
    Jiang, D. Q.
    Tandianus, B.
    Sui, Y. L.
    Wang, H.
    [J]. INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2024, 2024, 13164
  • [8] A learning approach to robotic table tennis
    Matsushima, M
    Hashimoto, T
    Takeuchi, M
    Miyazaki, F
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2005, 21 (04) : 767 - 771
  • [9] Reinforcement Learning-Based Model-Free Controller for Feedback Stabilization of Robotic Systems
    Singh, Rupam
    Bhushan, Bharat
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7059 - 7073
  • [10] Learning Representations in Model-Free Hierarchical Reinforcement Learning
    Rafati, Jacob
    Noelle, David C.
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 10009 - 10010