Continuous Control with Action Quantization from Demonstrations

被引:0
|
作者
Dadashi, Robert [1 ]
Hussenot, Leonard [1 ,2 ]
Vincent, Damien [1 ]
Girgin, Sertan [1 ]
Raichuk, Anton [1 ]
Geist, Matthieu [1 ]
Pietquin, Olivier [1 ]
机构
[1] Google REs, Brain Team, Mountain View, CA 94043 USA
[2] Univ Lille, CNRS, UMR CRIStAL 9189, Inria Scool, Lille, France
关键词
NEURAL-NETWORKS; LEVEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel Reinforcement Learning (RL) framework for problems with continuous action spaces: Action Quantization from Demonstrations (AQuaDem). The proposed approach consists in learning a discretization of continuous action spaces from human demonstrations. This discretization returns a set of plausible actions (in light of the demonstrations) for each input state, thus capturing the priors of the demonstrator and their multimodal behavior. By discretizing the action space, any discrete action deep RL technique can be readily applied to the continuous control problem. Experiments show that the proposed approach outperforms state-of-the-art methods such as SAC in the RL setup, and GAIL in the Imitation Learning setup. We provide a website with interactive videos: https://google- research. github.io/aquadem/ and make the code available: https://github.com/ google- research/google- research/ tree/master/aquadem.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Action Recognition and Interpretation from Virtual Demonstrations
    Haidu, Andrei
    Beetz, Michael
    2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 2833 - 2838
  • [2] Learning Traffic Signal Control from Demonstrations
    Xiong, Yuanhao
    Zheng, Guanjie
    Xu, Kai
    Li, Zhenhui
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2289 - 2292
  • [3] Learning the Stiffness of a Continuous Soft Manipulator from Multiple Demonstrations
    Bruno, Danilo
    Calinon, Sylvain
    Malekzadeh, Milad S.
    Caldwell, Darwin G.
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2015), PT III, 2015, 9246 : 185 - 195
  • [4] PERCEPTION OF A THROWING ACTION FROM POINT-LIGHT DEMONSTRATIONS
    WILLIAMS, JG
    PERCEPTUAL AND MOTOR SKILLS, 1988, 67 (01) : 273 - 274
  • [5] Quantization from an exponential distribution of infinitesimal action
    Budiyono, Agung
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2013, 392 (02) : 307 - 313
  • [6] Quantum mechanics from the quantization of the action variable
    Hushwater, V
    FORTSCHRITTE DER PHYSIK-PROGRESS OF PHYSICS, 1998, 46 (6-8): : 863 - 871
  • [7] TRACKING ACTION IMPROVES CONTINUOUS CONTROL
    COX, RK
    SHUNTA, JP
    CHEMICAL ENGINEERING PROGRESS, 1973, 69 (09) : 56 - 61
  • [8] Inverse Optimal Control from Demonstrations with Mixed Qualities
    Lee, Kyungjae
    Choi, Yunho
    Oh, Songhwai
    2020 17TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS (UR), 2020, : 581 - 586
  • [9] Learning Continuous Grasping Function With a Dexterous Hand From Human Demonstrations
    Ye, Jianglong
    Wang, Jiashun
    Huang, Binghao
    Qin, Yuzhe
    Wang, Xiaolong
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (05) : 2882 - 2889
  • [10] Task Parameterization Using Continuous Constraints Extracted From Human Demonstrations
    Ureche, Ana Lucia Pais
    Umezawa, Keisuke
    Nakamura, Yoshihiko
    Billard, Aude
    IEEE TRANSACTIONS ON ROBOTICS, 2015, 31 (06) : 1458 - 1471