Efficient 3D Instance Mapping and Localization with Neural Fields

被引:0
|
作者
Tang, George [1 ]
Jatavallabhula, Krishna Murthy [1 ]
Torralba, Antonio [1 ]
机构
[1] MIT, Cambridge, MA 02139 USA
关键词
D O I
10.1109/ICRA57147.2024.10611715
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We tackle the problem of learning an implicit scene representation for 3D instance segmentation from a sequence of posed RGB images. Towards this, we introduce 3DIML, a novel framework that efficiently learns a label field that may be rendered from novel viewpoints to produce view-consistent instance segmentation masks. 3DIML significantly improves upon training and inference runtimes of existing implicit scene representation based methods. Opposed to prior art that optimizes a neural field in a self-supervised manner, requiring complicated training procedures and loss function design, 3DIML leverages a two-phase process. The first phase, InstanceMap, takes as input 2D segmentation masks of the image sequence generated by a frontend instance segmentation model, and associates corresponding masks across images to 3D labels. These almost view-consistent pseudolabel masks are then used in the second phase, InstanceLift, to supervise the training of a neural label field, which interpolates regions missed by InstanceMap and resolves ambiguities. Additionally, we introduce InstanceLoc, which enables near realtime localization of instance masks given a trained label field and an off-the-shelf image segmentation model by fusing outputs from both. We evaluate 3DIML on sequences from the Replica and ScanNet datasets and demonstrate 3DIML's effectiveness under mild assumptions for the image sequences. We achieve a 14-24x speedup over existing implicit scene representation methods with comparable quality, showcasing its potential to facilitate faster and more effective 3D scene understanding.
引用
收藏
页码:1818 / 1824
页数:7
相关论文
共 50 条
  • [1] An efficient 3D mapping framework
    Tihonkih, Dmitrii
    Kober, Vitaly
    Makovetskii, Artyom
    Voronin, Aleksei
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLII, 2019, 11137
  • [2] 3D Mapping and Localization on Aerial Images
    Eker, Onur
    Cevikalp, Hakan
    Uzun, Bedirhan
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [3] 3D neural fields and steerable filters for contour-based person localization
    Corradini, A
    Braumann, UD
    Boehme, HJ
    Gross, HM
    NINTH WORKSHOP ON VIRTUAL INTELLIGENCE/DYNAMIC NEURAL NETWORKS: ACADEMIC/INDUSTRIAL/NASA/DEFENSE TECHNICAL INTERCHANGE AND TUTORIALS, 1999, 3728 : 476 - 485
  • [4] 3D Concept Grounding on Neural Fields
    Hong, Yining
    Du, Yilun
    Lin, Chunru
    Tenenbaum, Joshua B.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [5] Hierarchies of Octrees for Efficient 3D Mapping
    Wurm, Kai M.
    Hennes, Daniel
    Holz, Dirk
    Rusu, Radu B.
    Stachniss, Cyrill
    Konolige, Kurt
    Burgard, Wolfram
    2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011, : 4249 - 4255
  • [6] 3D Cooperative Localization and Mapping: Observability Analysis
    Cristofaro, Andrea
    Martinelli, Agostino
    2011 AMERICAN CONTROL CONFERENCE, 2011,
  • [7] Cooperative mutual 3D laser mapping and localization
    Ryde, Julian
    Hu, Huosheng
    2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, VOLS 1-3, 2006, : 1048 - +
  • [8] 3D mapping of complex flow fields in biomaterials
    Mack, Julia
    Youssef, Khalid
    Noel, Onika
    Lake, Michael
    Wu, Ashley
    Iruela-Arispe, Luisa
    Bouchard, Louis
    GLYCOBIOLOGY, 2012, 22 (11) : 1581 - 1582
  • [9] Object spatial localization by fusing 3D point clouds and instance segmentation
    Xia, Chenfei
    Han, Shoudong
    Pan, Xiaofeng
    SN APPLIED SCIENCES, 2020, 2 (03):
  • [10] Object spatial localization by fusing 3D point clouds and instance segmentation
    Chenfei Xia
    Shoudong Han
    Xiaofeng Pan
    SN Applied Sciences, 2020, 2