ROBOPIANIST: Dexterous Piano Playing with Deep Reinforcement Learning

被引:0
|
作者
Zakka, Kevin [1 ,2 ]
Wu, Philipp [1 ]
Smith, Laura [1 ]
Gileadi, Nimrod [2 ]
Howell, Taylor [3 ]
Peng, Xue Bin [4 ]
Singh, Sumeet [2 ]
Tassa, Yuval [2 ]
Florence, Pete [2 ]
Zeng, Andy [2 ]
Abbeel, Pieter [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Google DeepMind, London, England
[3] Stanford Univ, Stanford, CA 94305 USA
[4] Simon Fraser Univ, Burnaby, BC, Canada
来源
关键词
high-dimensional control; bi-manual dexterity; MANIPULATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Replicating human-like dexterity in robot hands represents one of the largest open problems in robotics. Reinforcement learning is a promising approach that has achieved impressive progress in the last few years; however, the class of problems it has typically addressed corresponds to a rather narrow definition of dexterity as compared to human capabilities. To address this gap, we investigate piano-playing, a skill that challenges even the human limits of dexterity, as a means to test high-dimensional control, and which requires high spatial and temporal precision, and complex finger coordination and planning. We introduce ROBOPIANIST, a system that enables simulated anthropomorphic hands to learn an extensive repertoire of 150 piano pieces where traditional model-based optimization struggles. We additionally introduce an open-sourced environment, benchmark of tasks, interpretable evaluation metrics, and open challenges for future study. Our website featuring videos, code, and datasets is available at https://kzakka.com/robopianist/.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] DEEP REINFORCEMENT LEARNING FOR PLAYING 2.5D FIGHTING GAMES
    Li, Yu-Jhe
    Chang, Hsin-Yu
    Lin, Yu-Jing
    Wu, Po-Wei
    Wang, Yu-Chiang Frank
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 3778 - 3782
  • [22] Application of reinforcement learning to dexterous robot control
    Bucak, IO
    Zohdy, MA
    [J]. PROCEEDINGS OF THE 1998 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 1998, : 1405 - 1409
  • [23] Continual learning, deep reinforcement learning, and microcircuits: a novel method for clever game playing
    Oscar Chang
    Leo Ramos
    Manuel Eugenio Morocho-Cayamcela
    Rolando Armas
    Luis Zhinin-Vera
    [J]. Multimedia Tools and Applications, 2025, 84 (3) : 1537 - 1559
  • [24] Skilled motor learning and EEG with piano playing
    Simmons, M
    Cox, C
    Lorenz, W
    Mills, M
    [J]. AUSTRALIAN JOURNAL OF PSYCHOLOGY, 2004, 56 : 222 - 223
  • [25] Playing optical tweezers with deep reinforcement learning: in virtual, physical and augmented environments
    Praeger, Matthew
    Xie, Yunhui
    Grant-Jacob, James A.
    Eason, Robert W.
    Mills, Ben
    [J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2021, 2 (03):
  • [26] PLAYING EXERCISES IN LEARNING PIANO FOR BEGINNERS SURVEY OF RUSSIAN PIANO METHODS
    Isekeeva, Snejeana Vitalievna
    Batyrshina, Gulnara Ibragimovna
    Shirieva, Nadezhda Velerovna
    [J]. TURKISH ONLINE JOURNAL OF DESIGN ART AND COMMUNICATION, 2016, 6 : 2617 - 2625
  • [27] Interactive learning for multi-finger dexterous hand: A model-free hierarchical deep reinforcement learning approach
    Li, Baojiang
    Qiu, Shengjie
    Bai, Jibo
    Wang, Bin
    Zhang, Zhekai
    Li, Liang
    Wang, Haiyan
    Wang, Xichao
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 295
  • [28] Demonstration-Guided Deep Reinforcement Learning of Control Policies for Dexterous Human-Robot Interaction
    Christen, Sammy
    Stevsic, Stefan
    Hilliges, Otmar
    [J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 2161 - 2167
  • [29] Playing the Game of Congklak with Reinforcement Learning
    Kasim, Muhammad Firmansyah
    [J]. PROCEEDINGS OF 2016 8TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2016,
  • [30] Social Reinforcement Learning in Game Playing
    Kiourt, Chairi
    Kalles, Dimitris
    [J]. 2012 IEEE 24TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2012), VOL 1, 2012, : 322 - 326