Efficient 3D Instance Mapping and Localization with Neural Fields

被引:0
|
作者
Tang, George [1 ]
Jatavallabhula, Krishna Murthy [1 ]
Torralba, Antonio [1 ]
机构
[1] MIT, Cambridge, MA 02139 USA
关键词
D O I
10.1109/ICRA57147.2024.10611715
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We tackle the problem of learning an implicit scene representation for 3D instance segmentation from a sequence of posed RGB images. Towards this, we introduce 3DIML, a novel framework that efficiently learns a label field that may be rendered from novel viewpoints to produce view-consistent instance segmentation masks. 3DIML significantly improves upon training and inference runtimes of existing implicit scene representation based methods. Opposed to prior art that optimizes a neural field in a self-supervised manner, requiring complicated training procedures and loss function design, 3DIML leverages a two-phase process. The first phase, InstanceMap, takes as input 2D segmentation masks of the image sequence generated by a frontend instance segmentation model, and associates corresponding masks across images to 3D labels. These almost view-consistent pseudolabel masks are then used in the second phase, InstanceLift, to supervise the training of a neural label field, which interpolates regions missed by InstanceMap and resolves ambiguities. Additionally, we introduce InstanceLoc, which enables near realtime localization of instance masks given a trained label field and an off-the-shelf image segmentation model by fusing outputs from both. We evaluate 3DIML on sequences from the Replica and ScanNet datasets and demonstrate 3DIML's effectiveness under mild assumptions for the image sequences. We achieve a 14-24x speedup over existing implicit scene representation methods with comparable quality, showcasing its potential to facilitate faster and more effective 3D scene understanding.
引用
收藏
页码:1818 / 1824
页数:7
相关论文
共 50 条
  • [41] Noninvasive 3D Field Mapping of Complex Static Electric Fields
    Kainz, Andreas
    Keplinger, Franz
    Hortschitz, Wilfried
    Kahr, Matthias
    Steiner, Harald
    Stifter, Michael
    Hunt, James R.
    Resta-Lopez, Javier
    Rodin, Volodymyr
    Welsch, Carsten P.
    Borburgh, Jan
    Fraser, Matthew Alexander
    Bartmann, Wolfgang
    PHYSICAL REVIEW LETTERS, 2019, 122 (24)
  • [42] In operando 3D mapping of elastic deformation fields in crystalline solids
    Amini, Shahrouz
    Zhu, Tingting
    Razi, Hajar
    Griesshaber, Erika
    Werner, Peter
    Fratzl, Peter
    MATTER, 2024, 7 (07) : 2591 - 2608
  • [43] 3D Stochastic Completion Fields for Mapping Connectivity in Diffusion MRI
    MomayyezSiahkal, Parya
    Siddiqi, Kaleem
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (04) : 983 - 995
  • [44] IAN: Instance-Augmented Net for 3D Instance Segmentation
    Wan, Zihao
    Hu, Jianhua
    Zhang, Haojian
    Wang, Yunkuan
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (07) : 4354 - 4361
  • [45] Sewer defect instance segmentation, localization, and 3D reconstruction for sewer floating capsule robots
    Fang, Xu
    Li, Qing
    Zhu, Jiasong
    Chen, Zhipeng
    Zhang, Dejin
    Wu, Kechun
    Ding, Kai
    Li, Qingquan
    AUTOMATION IN CONSTRUCTION, 2022, 142
  • [46] Localization and 3D Mapping using 1D Automotive Radar Sensor
    van Gaalen, Robin
    Uysal, Faruk
    Yarovoy, Alexander
    2020 IEEE RADAR CONFERENCE (RADARCONF20), 2020,
  • [47] Modified Fast-SLAM For 2D Mapping And 3D Localization
    Gharatappeh, Soheil
    Ghorbanian, Mohammad
    Keshmiri, Mehdi
    Taghirad, Hamid D.
    2015 3RD RSI INTERNATIONAL CONFERENCE ON ROBOTICS AND MECHATRONICS (ICROM), 2015, : 108 - 113
  • [48] SatelliteRF: Accelerating 3D Reconstruction in Multi-View Satellite Images with Efficient Neural Radiance Fields
    Zhou, Xin
    Wang, Yang
    Lin, Daoyu
    Cao, Zehao
    Li, Biqing
    Liu, Junyi
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [49] Hierarchical Aggregation for 3D Instance Segmentation
    Chen, Shaoyu
    Fang, Jiemin
    Zhang, Qian
    Liu, Wenyu
    Wang, Xinggang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15447 - 15456
  • [50] Neural 3D Gaze: 3D Pupil Localization and Gaze Tracking based on Anatomical Eye Model and Neural Refraction Correction
    Lu, Conny
    Chakravarthula, Praneeth
    Liu, Kaihao
    Liu, Xixiang
    Li, Siyuan
    Fuchs, Henry
    2022 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR 2022), 2022, : 375 - 383