Uni-3D: A Universal Model for Panoptic 3D Scene Reconstruction

被引:0
|
作者
Zhang, Xiang [1 ]
Chen, Zeyuan [1 ]
Wei, Fangyin [2 ]
Tu, Zhuowen [1 ]
机构
[1] Univ Calif San Diego, La Jolla, CA 92093 USA
[2] Princeton Univ, Princeton, NJ 08544 USA
关键词
D O I
10.1109/ICCV51070.2023.00849
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Performing holistic 3D scene understanding from a single-view observation, involving generating instance shapes and 3D scene segmentation, is a long-standing challenge. Prevailing works either focus only on geometry or segmentation, or model the task in two folds by separate modules, whose results are merged later to form the final prediction. Inspired by recent advances in 2D vision that unify image segmentation and detection by Transformer-based models, we present Uni-3D, a holistic 3D scene parsing/reconstruction system for a single RGB image. Uni-3D features a universal model with query-based representations for predicting segments of both object instances and scene layout. In Uni-3D, we also introduce a single Transformer for 2D depth-aware panoptic segmentation, which offers queries that serve as strong shape priors in 3D. Uni-3D seamlessly integrates 2D and 3D in its architecture and it outperforms previous methods significantly.
引用
收藏
页码:9222 / 9232
页数:11
相关论文
共 50 条
  • [1] Panoptic 3D Scene Reconstruction From a Single RGB Image
    Dahnert, Manuel
    Hou, Ji
    Niessner, Matthias
    Dai, Angela
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [2] PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving
    Shi, Yining
    Li, Jiusi
    Jiang, Kun
    Wang, Ke
    Wang, Yunlong
    Yang, Mengmeng
    Yang, Diange
    [J]. 2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 1219 - 1228
  • [3] Panoptic Lifting for 3D Scene Understanding with Neural Fields
    Siddiqui, Yawar
    Porzi, Lorenzo
    Bulo, Samuel Rota
    Mueller, Norman
    Niessner, Matthias
    Dai, Angela
    Kontschieder, Peter
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9043 - 9052
  • [4] 3D crime scene reconstruction
    Buck, Ursula
    [J]. FORENSIC SCIENCE INTERNATIONAL, 2019, 304
  • [5] Forensic 3D scene reconstruction
    Little, CQ
    Small, DE
    Peters, RR
    Rigdon, JB
    [J]. 28TH AIPR WORKSHOP: 3D VISUALIZATION FOR DATA EXPLORATION AND DECISION MAKING, 2000, 3905 : 67 - 73
  • [6] Online Panoptic 3D Reconstruction as a Linear Assignment Problem
    Raivio, Leevi
    Rahtu, Esa
    [J]. IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT II, 2022, 13232 : 39 - 50
  • [7] Dynamic 3D Scene Reconstruction and Enhancement
    Jiang, Cansen
    Fougerolle, Yohan
    Fofi, David
    Demonceaux, Cedric
    [J]. IMAGE ANALYSIS AND PROCESSING,(ICIAP 2017), PT I, 2017, 10484 : 518 - 529
  • [8] 3D scene reconstruction using Kinect
    [J]. Morana, M. (marco.morana@unipa.it), 1600, Springer Verlag (260):
  • [9] 3D Scene Reconstruction and Object Recognition for Indoor Scene
    Shen, Yangping
    Manabe, Yoshitsugu
    Yata, Noriko
    [J]. INTERNATIONAL WORKSHOP ON ADVANCED IMAGE TECHNOLOGY (IWAIT) 2019, 2019, 11049
  • [10] 3D Building Scene Reconstruction Based on 3D LiDAR Point Cloud
    Yang, Shih-Chi
    Fan, Yu-Cheng
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2017,