SGAligner: 3D Scene Alignment with Scene Graphs

被引:0
|
作者
Sarkar, Sayan Deb [1 ]
Miksik, Ondrej [2 ]
Pollefeys, Marc [1 ,2 ]
Barath, Daniel [1 ]
Armeni, Iro [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
[2] Microsoft Mixed Real & AI Lab, Zurich, Switzerland
关键词
D O I
10.1109/ICCV51070.2023.02004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Building 3D scene graphs has recently emerged as a topic in scene representation for several embodied AI applications to represent the world in a structured and rich manner. With their increased use in solving downstream tasks (e.g., navigation and room rearrangement), can we leverage and recycle them for creating 3D maps of environments, a pivotal step in agent operation? We focus on the fundamental problem of aligning pairs of 3D scene graphs whose overlap can range from zero to partial and can contain arbitrary changes. We propose SGAligner, the first method for aligning pairs of 3D scene graphs that is robust to in-the-wild scenarios (i.e., unknown overlap - if any and changes in the environment). We get inspired by multi-modality knowledge graphs and use contrastive learning to learn a joint, multi-modal embedding space. We evaluate on the 3RScan dataset and further showcase that our method can be used for estimating the transformation between pairs of 3D scenes. Since benchmarks for these tasks are missing, we create them on this dataset. The code, benchmark, and trained models are available on the project website.
引用
收藏
页码:21870 / 21880
页数:11
相关论文
共 50 条
  • [1] 3D Scene Management Method Combined with Scene Graphs
    Wang, Xiang
    Shen, Tao
    Hu, Liang
    Guo, Congnan
    Gao, Su
    [J]. SENSORS AND MATERIALS, 2022, 34 (01) : 277 - 287
  • [2] 3D Scene Management Method Combined with Scene Graphs
    Wang, Xiang
    Shen, Tao
    Huo, Liang
    Guo, Congnan
    Gao, Su
    [J]. Sensors and Materials, 2021, 34 (01) : 277 - 287
  • [3] Unsupervised Traffic Scene Generation with Synthetic 3D Scene Graphs
    Savkin, Artem
    Ellouze, Rachid
    Navab, Nassir
    Tombari, Federico
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 1229 - 1235
  • [4] 3D VSG: Long-term Semantic Scene Change Prediction through 3D Variable Scene Graphs
    Looper, Samuel
    Rodriguez-Puigvert, Javier
    Siegwart, Roland
    Cadena, Cesar
    Schmid, Lukas
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 8179 - 8186
  • [5] Automatic alignment of paintings and photographs depicting a 3D scene
    Russell, Bryan C.
    Sivic, Josef
    Ponce, Jean
    Dessales, Helene
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [6] Learning 3D Semantic Scene Graphs with Instance Embeddings
    Johanna Wald
    Nassir Navab
    Federico Tombari
    [J]. International Journal of Computer Vision, 2022, 130 : 630 - 651
  • [7] Learning 3D Semantic Scene Graphs with Instance Embeddings
    Wald, Johanna
    Navab, Nassir
    Tombari, Federico
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (03) : 630 - 651
  • [8] Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions
    Wald, Johanna
    Dhamo, Helisa
    Navab, Nassir
    Tombari, Federico
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3960 - 3969
  • [9] 3D Scene Reconstruction and Object Recognition for Indoor Scene
    Shen, Yangping
    Manabe, Yoshitsugu
    Yata, Noriko
    [J]. INTERNATIONAL WORKSHOP ON ADVANCED IMAGE TECHNOLOGY (IWAIT) 2019, 2019, 11049
  • [10] Monte Carlo Scene Search for 3D Scene Understanding
    Hampali, Shreyas
    Stekovic, Sinisa
    Sarkar, Sayan Deb
    Kumar, Chetan S.
    Fraundorfer, Friedrich
    Lepetit, Vincent
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13799 - 13808