Neural Dynamic Policies for End-to-End Sensorimotor Learning

被引:0
|
作者
Bahl, Shikhar [1 ]
Mukadam, Mustafa [2 ]
Gupta, Abhinav [1 ]
Pathak, Deepak [1 ]
机构
[1] CMU, Pittsburgh, PA 15213 USA
[2] FAIR, Seattle, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The current dominant paradigm in sensorimotor control, whether imitation or reinforcement learning, is to train policies directly in raw action spaces such as torque, joint angle, or end-effector position. This forces the agent to make decision at each point in training, and hence, limit the scalability to continuous, high-dimensional, and long-horizon tasks. In contrast, research in classical robotics has, for a long time, exploited dynamical systems as a policy representation to learn robot behaviors via demonstrations. These techniques, however, lack the flexibility and generalizability provided by deep learning or deep reinforcement learning and have remained under-explored in such settings. In this work, we begin to close this gap and embed dynamics structure into deep neural network-based policies by reparameterizing action spaces with differential equations. We propose Neural Dynamic Policies (NDPs) that make predictions in trajectory distribution space as opposed to prior policy learning methods where action represents the raw control space. The embedded structure allow us to perform end-to-end policy learning under both reinforcement and imitation learning setups. We show that NDPs achieve better or comparable performance to state-of-the-art approaches on many robotic control tasks using both reward-based training and demonstrations. Project video and code are available at: https://shikharbahl.github.io/neural-dynamic-policies/.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] End-to-End Learning Driver Policy using Moments Deep Neural Network
    Qian, Deheng
    Ren, Dongchun
    Meng, Yingying
    Zhu, Yanliang
    Ding, Shuguang
    Fu, Sheng
    Wang, Zhichao
    Xia, Huaxia
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 1533 - 1538
  • [32] An End-to-End Encrypted Neural Network for Gradient Updates Transmission in Federated Learning
    Li, Hongyu
    Han, Tianqi
    2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 589 - 589
  • [33] A Vocabulary-Free Multilingual Neural Tokenizer for End-to-End Task Learning
    Islam, Md Mofijul
    Aguilar, Gustavo
    Ponnusamy, Pragaash
    Mathialagan, Clint Solomon
    Ma, Chengyuan
    Guo, Chenlei
    PROCEEDINGS OF THE 7TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP, 2022, : 91 - 99
  • [34] Leukocyte Segmentation via End-to-End Learning of Deep Convolutional Neural Networks
    Lu, Yan
    Fan, Haoyi
    Li, Zuoyong
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 191 - 200
  • [35] CONVOLUTIONAL ANALYSIS OPERATOR LEARNING BY END-TO-END TRAINING OF ITERATIVE NEURAL NETWORKS
    Kofler, Andreas
    Wald, Christian
    Schaeffter, Tobias
    Haltmeier, Markus
    Kolbitsch, Christoph
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [36] END-TO-END LEARNING OF COMPRESSIBLE FEATURES
    Singh, Saurabh
    Abu-El-Haija, Sami
    Johnston, Nick
    Balle, Johannes
    Shrivastava, Abhinav
    Toderici, George
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3349 - 3353
  • [37] QTN-VQC: an end-to-end learning framework for quantum neural networks
    Qi, Jun
    Yang, Chao-Han
    Chen, Pin-Yu
    PHYSICA SCRIPTA, 2024, 99 (01)
  • [38] A Theoretical Framework for End-to-End Learning of Deep Neural Networks With Applications to Robotics
    Li, Sitan
    Nguyen, Huu-Thiet
    Cheah, Chien Chern
    IEEE ACCESS, 2023, 11 : 21992 - 22006
  • [39] End-to-end Learning Approach for Autonomous Driving: A Convolutional Neural Network Model
    Wang, Yaqin
    Liu, Dongfang
    Jeon, Hyewon
    Chu, Zhiwei
    Matson, Eric T.
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 833 - 839
  • [40] End-to-End Visuomotor Learning of Drawing Sequences using Recurrent Neural Networks
    Sasaki, Kazuma
    Ogata, Tetsuya
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,