Vision-Based Navigation With Language-Based Assistance via Imitation Learning With Indirect Intervention

被引:47
|
作者
Nguyen, Khanh [1 ]
Dey, Debadeepta [2 ]
Brockett, Chris [2 ]
Dolan, Bill [2 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] Microsoft Res, Redmond, WA USA
关键词
D O I
10.1109/CVPR.2019.01281
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present Vision-based Navigation with Language-based Assistance (VNLA), a grounded vision-language task where an agent with visual perception is guided via language to find objects in photorealistic indoor environments. The task emulates a real-world scenario in that (a) the requester may not know how to navigate to the target objects and thus makes requests by only specifying high-level end-goals, and (b) the agent is capable of sensing when it is lost and querying an advisor, who is more qualified at the task, to obtain language subgoals to make progress. To model language-based assistance, we develop a general framework termed Imitation Learning with Indirect Intervention (I3L), and propose a solution that is effective on the VNLA task. Empirical results show that this approach significantly improves the success rate of the learning agent over other baselines on both seen and unseen environments.
引用
下载
收藏
页码:12519 / 12529
页数:11
相关论文
共 50 条
  • [21] Review of vision-based reinforcement learning for drone navigation
    Aburaya, Anas
    Selamat, Hazlina
    Muslim, Mohd Taufiq
    INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS, 2024, : 974 - 992
  • [22] Efficient vision-based navigation
    Hornung, Armin
    Bennewitz, Maren
    Strasdat, Hauke
    AUTONOMOUS ROBOTS, 2010, 29 (02) : 137 - 149
  • [23] Vision-based wheelchair navigation using geometric AdaBoost learning
    Kim, Eun Yi
    ELECTRONICS LETTERS, 2017, 53 (08) : 534 - 536
  • [24] Vision-Based Autonomous Navigation Using Supervised Learning Techniques
    Souza, Jefferson R.
    Pessin, Gustavo
    Osorio, Fernando S.
    Wolf, Denis F.
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, PT I, 2011, 363 : 11 - 20
  • [25] Learning of Sensorimotor Behaviors by a SASE Agent for Vision-based Navigation
    Ji, Zhengping
    Huang, Xiao
    Weng, Juyang
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 3374 - 3381
  • [26] Language-based learning disorders
    Wegner, LM
    Reed, M
    PEDIATRIC ANNALS, 2005, 34 (04): : 300 - 309
  • [27] A survey on vision-based UAV navigation
    Lu, Yuncheng
    Xue, Zhucun
    Xia, Gui-Song
    Zhang, Liangpei
    GEO-SPATIAL INFORMATION SCIENCE, 2018, 21 (01) : 21 - 32
  • [28] Landmark selection for vision-based navigation
    Sala, P
    Sim, R
    Shokoufandeh, A
    Dickinson, S
    IEEE TRANSACTIONS ON ROBOTICS, 2006, 22 (02) : 334 - 349
  • [29] Motion and structure for vision-based navigation
    Sagüés, C
    Guerrero, JJ
    ROBOTICA, 1999, 17 : 355 - 364
  • [30] Vision-Based UAV Navigation in Orchards
    Stefas, Nikolaus
    Bayram, Haluk
    Isler, Volkau
    IFAC PAPERSONLINE, 2016, 49 (16): : 10 - 15