Vision-Based Navigation With Language-Based Assistance via Imitation Learning With Indirect Intervention

被引:47
|
作者
Nguyen, Khanh [1 ]
Dey, Debadeepta [2 ]
Brockett, Chris [2 ]
Dolan, Bill [2 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] Microsoft Res, Redmond, WA USA
关键词
D O I
10.1109/CVPR.2019.01281
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present Vision-based Navigation with Language-based Assistance (VNLA), a grounded vision-language task where an agent with visual perception is guided via language to find objects in photorealistic indoor environments. The task emulates a real-world scenario in that (a) the requester may not know how to navigate to the target objects and thus makes requests by only specifying high-level end-goals, and (b) the agent is capable of sensing when it is lost and querying an advisor, who is more qualified at the task, to obtain language subgoals to make progress. To model language-based assistance, we develop a general framework termed Imitation Learning with Indirect Intervention (I3L), and propose a solution that is effective on the VNLA task. Empirical results show that this approach significantly improves the success rate of the learning agent over other baselines on both seen and unseen environments.
引用
下载
收藏
页码:12519 / 12529
页数:11
相关论文
共 50 条
  • [41] Evaluation of a vision-based parking assistance system
    Vestri, C
    Bougnoux, S
    Bendahan, R
    Fintzel, K
    Wybo, S
    Abad, F
    Kakinami, T
    2005 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2005, : 56 - 60
  • [42] Developing a Vision-Based Driving Assistance System
    Shibli, Ashfak Md
    Hoque, Mohammed Moshiul
    Alam, Lamia
    EMERGING TECHNOLOGIES IN DATA MINING AND INFORMATION SECURITY, IEMIS 2018, VOL 1, 2019, 755 : 799 - 812
  • [43] Vision-based Distributed Multi-UAV Collision Avoidance via Deep Reinforcement Learning for Navigation
    Huang, Huaxing
    Zhu, Guijie
    Fan, Zhun
    Zhai, Hao
    Cai, Yuwei
    Shi, Ze
    Dong, Zhaohui
    Hao, Zhifeng
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 13745 - 13752
  • [44] LANGUAGE-BASED LEARNING-DISABILITIES
    HOOK, PE
    ANNALS OF OTOLOGY RHINOLOGY AND LARYNGOLOGY, 1980, 89 (05): : 179 - 181
  • [45] Competition in learning language-based categories
    Taraban, R
    Roark, B
    APPLIED PSYCHOLINGUISTICS, 1996, 17 (02) : 125 - 148
  • [46] Dyschronic language-based learning disability
    Llinás, R
    Ribary, U
    Tallal, P
    BASIC MECHANISMS IN COGNITION AND LANGUAGE: WITH SPECIAL REFERENCE TO PHONOLOGICAL PROBLEMS IN DYSLEXIA, 1998, 70 : 101 - 108
  • [47] Connecting Language and Vision for Natural Language-Based Vehicle Retrieval
    Bai, Shuai
    Zheng, Zhedong
    Wang, Xiaohan
    Lin, Junyang
    Zhang, Zhu
    Zhou, Chang
    Yang, Hongxia
    Yang, Yi
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 4029 - 4038
  • [48] Vision-based terrain learning
    Karlsen, Robert E.
    Witus, Gary
    UNMANNED SYSTEMS TECHNOLOGY VIII, PTS 1 AND 2, 2006, 6230
  • [49] Language-based translation and prediction of surgical navigation steps for endoscopic wayfinding assistance in minimally invasive surgery
    Richard Bieck
    Katharina Heuermann
    Markus Pirlich
    Juliane Neumann
    Thomas Neumuth
    International Journal of Computer Assisted Radiology and Surgery, 2020, 15 : 2089 - 2100
  • [50] ESA Technology Developments in Vision-Based Navigation
    Dubois-Matra, Olivier
    Casasco, Massimo
    Gestido, Manuel Sanchez
    Garcia, Irene Huertas
    PROCEEDINGS OF THE IUTAM SYMPOSIUM ON OPTIMAL GUIDANCE AND CONTROL FOR AUTONOMOUS SYSTEMS 2023, 2024, 40 : 39 - 50