PlaneFormers: From Sparse View Planes to 3D Reconstruction

被引:9
|
作者
Agarwala, Samir [1 ]
Jin, Linyi [1 ]
Rockwell, Chris [1 ]
Fouhey, David F. [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI USA
来源
关键词
D O I
10.1007/978-3-031-20062-5_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an approach for the planar surface reconstruction of a scene from images with limited overlap. This reconstruction task is challenging since it requires jointly reasoning about single image 3D reconstruction, correspondence between images, and the relative camera pose between images. Past work has proposed optimization-based approaches. We introduce a simpler approach, the PlaneFormer, that uses a transformer applied to 3D-aware plane tokens to perform 3D reasoning. Our experiments show that our approach is substantially more effective than prior work, and that several 3D-specific design decisions are crucial for its success. Code is available at https://github.com/samiragarwala/PlaneFormers.
引用
收藏
页码:192 / 209
页数:18
相关论文
共 50 条
  • [41] 3D curve structure reconstruction from a sparse set of unordered images
    Zheng Jian-dong
    Zhang Li-yan
    Du Xiao-yu
    Ding Zhi-an
    COMPUTERS IN INDUSTRY, 2009, 60 (02) : 126 - 134
  • [42] Automated Reconstruction of 3D Open Surfaces from Sparse Point Clouds
    Arshad, Mohammad Samiul
    Beksi, William J.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY ADJUNCT (ISMAR-ADJUNCT 2022), 2022, : 216 - 221
  • [43] Sparse-view planar 3D reconstruction method based on hierarchical token pooling Transformer
    Zhang, Jiahui
    Yang, Jinfu
    Fu, Fuji
    Ma, Jiaqi
    APPLIED SOFT COMPUTING, 2025, 174
  • [44] An FPGA Accelerator for 3D Cone-beam Sparse-view Computed Tomography Reconstruction
    Gu, Yuhan
    Wu, Qing
    Yuan, Zhechen
    Zhang, Xiangyu
    Su, Wenyan
    Zhang, Yuyao
    Lou, Xin
    2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024, 2024, : 577 - 581
  • [45] High-precision 3D reconstruction of terahertz computed tomography under extremely sparse view
    Dou, Jiazhen
    Fang, Jiongshen
    Jiang, Wenjun
    Di, Jianglei
    Qin, Yuwen
    OPTICS AND LASERS IN ENGINEERING, 2025, 186
  • [46] Learning 3D Gaussians for Extremely Sparse-View Cone-Beam CT Reconstruction
    Lin, Yiqun
    Wang, Hualiang
    Chen, Jixiang
    Li, Xiaomeng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VII, 2024, 15007 : 425 - 435
  • [47] Learning View Priors for Single-view 3D Reconstruction
    Kato, Hiroharu
    Harada, Tatsuya
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9770 - 9779
  • [48] A View-Planning Approach to 3D Reconstruction
    Turkar, Yash
    Aluckal, Christo
    Adhivarahan, Charuvahan
    Sebastiani, Alessandro
    Dantu, Karthik
    EXTENDED REALITY, PT III, XR SALENTO 2024, 2024, 15029 : 340 - 350
  • [49] Multi-view 3D Reconstruction with Transformers
    Wang, Dan
    Cui, Xinrui
    Chen, Xun
    Zou, Zhengxia
    Shi, Tianyang
    Salcudean, Septimiu
    Wang, Z. Jane
    Ward, Rabab
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5702 - 5711
  • [50] Single View Facial Hair 3D Reconstruction
    Rotger, Gemma
    Moreno-Noguer, Francesc
    Lumbreras, Felipe
    Agudo, Antonio
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT I, 2020, 11867 : 423 - 436