Efficient 3D Instance Mapping and Localization with Neural Fields

被引:0
|
作者
Tang, George [1 ]
Jatavallabhula, Krishna Murthy [1 ]
Torralba, Antonio [1 ]
机构
[1] MIT, Cambridge, MA 02139 USA
关键词
D O I
10.1109/ICRA57147.2024.10611715
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We tackle the problem of learning an implicit scene representation for 3D instance segmentation from a sequence of posed RGB images. Towards this, we introduce 3DIML, a novel framework that efficiently learns a label field that may be rendered from novel viewpoints to produce view-consistent instance segmentation masks. 3DIML significantly improves upon training and inference runtimes of existing implicit scene representation based methods. Opposed to prior art that optimizes a neural field in a self-supervised manner, requiring complicated training procedures and loss function design, 3DIML leverages a two-phase process. The first phase, InstanceMap, takes as input 2D segmentation masks of the image sequence generated by a frontend instance segmentation model, and associates corresponding masks across images to 3D labels. These almost view-consistent pseudolabel masks are then used in the second phase, InstanceLift, to supervise the training of a neural label field, which interpolates regions missed by InstanceMap and resolves ambiguities. Additionally, we introduce InstanceLoc, which enables near realtime localization of instance masks given a trained label field and an off-the-shelf image segmentation model by fusing outputs from both. We evaluate 3DIML on sequences from the Replica and ScanNet datasets and demonstrate 3DIML's effectiveness under mild assumptions for the image sequences. We achieve a 14-24x speedup over existing implicit scene representation methods with comparable quality, showcasing its potential to facilitate faster and more effective 3D scene understanding.
引用
收藏
页码:1818 / 1824
页数:7
相关论文
共 50 条
  • [21] Resource Efficient 3D Convolutional Neural Networks
    Koepueklue, Okan
    Kose, Neslihan
    Gunduz, Ahmet
    Rigoll, Gerhard
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1910 - 1919
  • [22] DEFORMTOON3D: Deformable Neural Radiance Fields for 3D Toonification
    Zhang, Junzhe
    Lan, Yushi
    Yang, Shuai
    Hong, Fangzhou
    Wang, Quan
    Yeo, Chai Kiat
    Liu, Ziwei
    Loy, Chen Change
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9110 - 9120
  • [23] Panoptic Lifting for 3D Scene Understanding with Neural Fields
    Siddiqui, Yawar
    Porzi, Lorenzo
    Bulo, Samuel Rota
    Mueller, Norman
    Niessner, Matthias
    Dai, Angela
    Kontschieder, Peter
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9043 - 9052
  • [24] Efficient and Scalable Object Localization in 3D on Mobile Device
    Gupta, Neetika
    Khan, Naimul Mefraz
    JOURNAL OF IMAGING, 2022, 8 (07)
  • [25] Efficient 3D Scene Labeling Using Fields of Trees
    Kaehler, Olaf
    Reid, Ian
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3064 - 3071
  • [26] 3D Semantic Mapping Based on Convolutional Neural Networks
    Li, Jing
    Liu, Yanyu
    Wang, Junzheng
    Yan, Min
    Yao, Yanzhi
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9303 - 9308
  • [27] Nuvo: Neural UV Mapping for Unruly 3D Representations
    Srinivasan, Pratul P.
    Garbin, Stephan J.
    Verbin, Dor
    Barron, Jonathan T.
    Mildenhall, Ben
    COMPUTER VISION - ECCV 2024, PT XXXIX, 2025, 15097 : 18 - 34
  • [28] Text-Guided Graph Neural Networks for Referring 3D Instance Segmentation
    Huang, Pin-Hao
    Lee, Han-Hung
    Chen, Hwann-Tzong
    Liu, Tyng-Luh
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1610 - 1618
  • [29] An Instrumented Vehicle for Efficient and Accurate 3D Mapping of Roads
    Moreno, Francisco-Angel
    Gonzalez-Jimenez, Javier
    Blanco, Jose-Luis
    Esteban, Antonio
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2013, 28 (06) : 403 - 419
  • [30] An efficient image mapping method for 3D surface textures
    Matsushita, Kenji
    Kaneko, Toyohisa
    2001, John Wiley and Sons Inc. (32)