Neural Task Graphs: Generalizing to Unseen Tasks from a Single Video Demonstration

被引:40
|
作者
Huang, De-An [1 ]
Nair, Suraj [1 ]
Xu, Danfei [1 ]
Zhu, Yuke [1 ]
Garg, Animesh [1 ]
Li Fei-Fei [1 ]
Savarese, Silvio [1 ]
Niebles, Juan Carlos [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
D O I
10.1109/CVPR.2019.00876
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Our goal is to generate a policy to complete an unseen task given just a single video demonstration of the task in a given domain. We hypothesize that to successfully generalize to unseen complex tasks from a single video demonstration, it is necessary to explicitly incorporate the compositional structure of the tasks into the model. To this end, we propose Neural Task Graph (NTG) Networks, which use conjugate task graph as the intermediate representation to modularize both the video demonstration and the derived policy. We empirically show NTG achieves inter-task generalization on two complex tasks: Block Stacking in BulletPhysics and Object Collection in AI2-THOR. NTG improves data efficiency with visual input as well as achieve strong generalization without the need for dense hierarchical supervision. We further show that similar performance trends hold when applied to real-world data. We show that NTG can effectively predict task structure on the JIGSAWS surgical dataset and generalize to unseen tasks.
引用
收藏
页码:8557 / 8566
页数:10
相关论文
共 22 条
  • [1] Generalizing Topological Task Graphs From Multiple Symbolic Demonstrations in Programming by Demonstration (PbD) Processes
    Abbas, Tanveer
    MacDonald, Bruce A.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2011,
  • [2] Learning tasks from a single demonstration
    Atkeson, CG
    Schaal, S
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION - PROCEEDINGS, VOLS 1-4, 1997, : 1706 - 1712
  • [3] Interactive Hierarchical Task Learning from a Single Demonstration
    Mohseni-Kabir, Anahita
    Rich, Charles
    Chernova, Sonia
    Sidner, Candace L.
    Miller, Daniel
    [J]. PROCEEDINGS OF THE 2015 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION (HRI'15), 2015, : 205 - 212
  • [4] Towards Learning to Imitate from a Single Video Demonstration
    Berseth, Glen
    Golemo, Florian
    Pal, Christopher
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24 : 1 - 26
  • [5] A ROS-integrated architecture to learn manipulation tasks from a single demonstration
    Peppoloni, Lorenzo
    Di Fava, Alessandro
    Ruffaldi, Emanuele
    Avizzano, Carlo Alberto
    [J]. 2014 23RD IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (IEEE RO-MAN), 2014, : 537 - 542
  • [6] Learning Behaviors from a Single Video Demonstration Using Human Feedback
    Gandhi, Sunil
    Oates, Tim
    Mohsenin, Tinoosh
    Waytowich, Nicholas R.
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1970 - 1972
  • [7] SQUIRL: Robust and Efficient Learning from Video Demonstration of Long-Horizon Robotic Manipulation Tasks
    Wu, Bohan
    Xu, Feng
    He, Zhanpeng
    Gupta, Abhi
    Allen, Peter K.
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 9720 - 9727
  • [8] NeuMan: Neural Human Radiance Field from a Single Video
    Jiang, Wei
    Yi, Kwang Moo
    Samei, Golnoosh
    Tuzel, Oncel
    Ranjan, Anurag
    [J]. COMPUTER VISION - ECCV 2022, PT XXXII, 2022, 13692 : 402 - 418
  • [9] Neural Implicit Representations for Physical Parameter Inference from a Single Video
    Hofherr, Florian
    Koestler, Lukas
    Bernard, Florian
    Cremers, Daniel
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2092 - 2102
  • [10] Detection of preclinical neural dysfunction from functional connectivity graphs derived from task fMRI. An example from degeneration
    Vives-Gilabert, Yolanda
    Abdulkadir, Ahmed
    Kaller, Christoph P.
    Mader, Wolfgang
    Wolf, Robert C.
    Schelter, Bjoern
    Kloeppel, Stefan
    [J]. PSYCHIATRY RESEARCH-NEUROIMAGING, 2013, 214 (03) : 322 - 330