Vision-Based Navigation With Language-Based Assistance via Imitation Learning With Indirect Intervention

被引：47

作者：

Nguyen, Khanh ^{[1
]}

Dey, Debadeepta ^{[2
]}

Brockett, Chris ^{[2
]}

Dolan, Bill ^{[2
]}

机构：

[1] Univ Maryland, College Pk, MD 20742 USA

[2] Microsoft Res, Redmond, WA USA

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.01281

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present Vision-based Navigation with Language-based Assistance (VNLA), a grounded vision-language task where an agent with visual perception is guided via language to find objects in photorealistic indoor environments. The task emulates a real-world scenario in that (a) the requester may not know how to navigate to the target objects and thus makes requests by only specifying high-level end-goals, and (b) the agent is capable of sensing when it is lost and querying an advisor, who is more qualified at the task, to obtain language subgoals to make progress. To model language-based assistance, we develop a general framework termed Imitation Learning with Indirect Intervention (I3L), and propose a solution that is effective on the VNLA task. Empirical results show that this approach significantly improves the success rate of the learning agent over other baselines on both seen and unseen environments.

引用

下载

页码：12519 / 12529

页数：11

共 50 条

[21] Review of vision-based reinforcement learning for drone navigation
Aburaya, Anas
Selamat, Hazlina
Muslim, Mohd Taufiq
INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS, 2024, : 974 - 992
[22] Efficient vision-based navigation
Hornung, Armin
Bennewitz, Maren
Strasdat, Hauke
AUTONOMOUS ROBOTS, 2010, 29 (02) : 137 - 149
[23] Vision-based wheelchair navigation using geometric AdaBoost learning
Kim, Eun Yi
ELECTRONICS LETTERS, 2017, 53 (08) : 534 - 536
[24] Vision-Based Autonomous Navigation Using Supervised Learning Techniques
Souza, Jefferson R.
Pessin, Gustavo
Osorio, Fernando S.
Wolf, Denis F.
ENGINEERING APPLICATIONS OF NEURAL NETWORKS, PT I, 2011, 363 : 11 - 20
[25] Learning of Sensorimotor Behaviors by a SASE Agent for Vision-based Navigation
Ji, Zhengping
Huang, Xiao
Weng, Juyang
2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 3374 - 3381
[26] Language-based learning disorders
Wegner, LM
Reed, M
PEDIATRIC ANNALS, 2005, 34 (04): : 300 - 309
[27] A survey on vision-based UAV navigation
Lu, Yuncheng
Xue, Zhucun
Xia, Gui-Song
Zhang, Liangpei
GEO-SPATIAL INFORMATION SCIENCE, 2018, 21 (01) : 21 - 32
[28] Landmark selection for vision-based navigation
Sala, P
Sim, R
Shokoufandeh, A
Dickinson, S
IEEE TRANSACTIONS ON ROBOTICS, 2006, 22 (02) : 334 - 349
[29] Motion and structure for vision-based navigation
Sagüés, C
Guerrero, JJ
ROBOTICA, 1999, 17 : 355 - 364
[30] Vision-Based UAV Navigation in Orchards
Stefas, Nikolaus
Bayram, Haluk
Isler, Volkau
IFAC PAPERSONLINE, 2016, 49 (16): : 10 - 15

← 1 2 3 4 5 →