AnyMorph: Learning Transferable Polices By Inferring Agent Morphology

被引:0
|
作者
Trabucco, Brandon [1 ,2 ]
Phielipp, Mariano [2 ]
Berseth, Glen [3 ]
机构
[1] Carnegie Mellon Univ, Machine Learning Dept, Pittsburgh, PA 15213 USA
[2] Intel AI, San Diego, CA USA
[3] Mila, Montreal, PQ, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The prototypical approach to reinforcement learning involves training policies tailored to a particular agent from scratch for every new morphology. Recent work aims to eliminate the re-training of policies by investigating whether a morphology-agnostic policy, trained on a diverse set of agents with similar task objectives, can be transferred to new agents with unseen morphologies without re-training. This is a challenging problem that required previous approaches to use hand-designed descriptions of the new agent's morphology. Instead of hand-designing this description, we propose a data-driven method that learns a representation of morphology directly from the reinforcement learning objective. Ours is the first reinforcement learning algorithm that can train a policy to generalize to new agent morphologies without requiring a description of the agent's morphology in advance. We evaluate our approach on the standard benchmark for agent-agnostic control, and improve over the current state of the art in zero-shot generalization to new agents. Importantly, our method attains good performance without an explicit description of morphology.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Inferring building height from footprint morphology data
    Stipek, Clinton
    Hauser, Taylor
    Adams, Daniel
    Epting, Justin
    Brelsford, Christa
    Moehl, Jessica
    Dias, Philipe
    Piburn, Jesse
    Stewart, Robert
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [42] Inferring Pyramidal Neuron Morphology using EAP Data
    Chen, Ziao
    Carroll, Matthew
    Nair, Satish S.
    2023 11TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING, NER, 2023,
  • [43] Inferring paleohabitats from the functional morphology of bovid postcrania
    DeGusta, D
    JOURNAL OF HUMAN EVOLUTION, 2000, 38 (03) : A9 - A10
  • [44] Species Delimitation: Inferring Gaps in Morphology across Geography
    Zapata, Felipe
    Jimenez, Ivan
    SYSTEMATIC BIOLOGY, 2012, 61 (02) : 179 - 194
  • [45] Inferring Robot Morphology from Observation of Unscripted Movement
    Bell, Neil
    Seipp, Brian
    Oates, J. Tim
    Matuszek, Cynthia
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 9544 - 9551
  • [46] Inferring behavior from pedal phalangeal morphology in theropods
    Kambic, Robert
    JOURNAL OF VERTEBRATE PALEONTOLOGY, 2007, 27 (03) : 97A - 97A
  • [47] Faster and transferable deep learning steganalysis on GPU
    Ye Dengpan
    Jiang Shunzhi
    Li Shiyu
    Liu ChangRui
    Journal of Real-Time Image Processing, 2019, 16 : 623 - 633
  • [48] Learning explicitly transferable representations for domain adaptation
    Jing, Mengmeng
    Li, Jingjing
    Lu, Ke
    Zhu, Lei
    Yang, Yang
    NEURAL NETWORKS, 2020, 130 : 39 - 48
  • [49] Learning Transferable Architectures for Scalable Image Recognition
    Zoph, Barret
    Vasudevan, Vijay
    Shlens, Jonathon
    Le, Quoc V.
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8697 - 8710
  • [50] INFERRING SEISMIC BEHAVIOR FROM MORPHOLOGY IN TIMBER ROOFS
    Parisi, Maria Adelaide
    Chesi, Claudio
    Tardini, Chiara
    INTERNATIONAL JOURNAL OF ARCHITECTURAL HERITAGE, 2012, 6 (01) : 100 - 116