Navigating to objects in the real world

被引:18
|
作者
Gervet, Theophile [1 ]
Chintala, Soumith [2 ]
Batra, Dhruv [2 ,3 ]
Malik, Jitendra [2 ,4 ]
Chaplot, Devendra Singh [2 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Meta AI Res, Menlo Pk, CA USA
[3] Georgia Inst Technol, Atlanta, GA USA
[4] Univ Calif Berkeley, Berkeley, CA USA
关键词
SIM2REAL; VISION; ROBOTICS; SYSTEM;
D O I
10.1126/scirobotics.adf6991
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Semantic navigation is necessary to deploy mobile robots in uncontrolled environments such as homes or hospitals. Many learning-based approaches have been proposed in response to the lack of semantic understanding of the classical pipeline for spatial navigation, which builds a geometric map using depth sensors and plans to reach point goals. Broadly, end-to-end learning approaches reactively map sensor inputs to actions with deep neural networks, whereas modular learning approaches enrich the classical pipeline with learning-based semantic sensing and exploration. However, learned visual navigation policies have predominantly been evaluated in sim, with little known about what works on a robot. We present a large-scale empirical study of semantic visual navigation methods comparing representative methods with classical, modular, and end-to-end learning approaches across six homes with no prior experience, maps, or instrumentation. We found that modular learning works well in the real world, attaining a 90% success rate. In contrast, end-to-end learning does not, dropping from 77% sim to a 23% real-world success rate because of a large image domain gap between sim and reality. For practitioners, we show that modular learning is a reliable approach to navigate to objects: Modularity and abstraction in policy design enable sim-to-real transfer. For researchers, we identify two key issues that prevent today's simulators from being reliable evaluation benchmarks-a large sim-to-real gap in images and a disconnect between sim and real-world error modes-and propose concrete steps forward.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Virtual objects in the real world
    Aliaga, DG
    [J]. COMMUNICATIONS OF THE ACM, 1997, 40 (03) : 49 - 54
  • [2] Exploring objects for recognition in the real world
    Kootstra, Gert
    Ypma, Jelmer
    de Boer, Bart
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, VOLS 1-5, 2007, : 429 - 434
  • [3] Virtualizing real-world objects
    Lensch, HPA
    Kautz, J
    Goesele, M
    Lang, JC
    Seidel, HP
    [J]. COMPUTER GRAPHICS INTERNATIONAL, PROCEEDINGS, 2003, : 134 - 141
  • [4] Navigating How to Initiate Tenapanor Therapy in the Real World
    Soeda, Keisuke
    Komaba, Hirotaka
    [J]. KIDNEY360, 2024, 5 (07): : 938 - 940
  • [5] SEARCHING FOR OBJECTS IN REAL-WORLD SCIENCES
    BIEDERMAN, I
    GLASS, AL
    STACY, EW
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1973, 97 (01): : 22 - 27
  • [6] Real-world objects are more memorable than photographs of objects
    Snow, Jacqueline C.
    Skiba, Rafal M.
    Coleman, Taylor L.
    Berryhill, Marian E.
    [J]. FRONTIERS IN HUMAN NEUROSCIENCE, 2014, 8
  • [7] NAVIGATING REAL WORLD EVIDENCE IN ONCOLOGY - WHERE TO GO AND WHY
    Macaulay, R.
    Sharma, A.
    Prakash, V
    Medin, E.
    Anell, B.
    [J]. VALUE IN HEALTH, 2018, 21 : S218 - S218
  • [8] Real-world knowledge through real-world maps: A developmental guide for navigating the educational terrain
    Liben, LS
    Kastens, KA
    Stevenson, LM
    [J]. DEVELOPMENTAL REVIEW, 2002, 22 (02) : 267 - 322
  • [9] Texture Mapping Real-World Objects with Hydrographics
    Panozzo, Daniele
    Diamanti, Olga
    Paris, Sylvain
    Tarini, Marco
    Sorkine, Evgeni
    Sorkine-Hornung, Olga
    [J]. COMPUTER GRAPHICS FORUM, 2015, 34 (05) : 65 - 75
  • [10] Augmenting Virtual Reality with Near Real World Objects
    Rauter, Michael
    Abseher, Christoph
    Safar, Markus
    [J]. 2019 26TH IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES (VR), 2019, : 1134 - 1135