PlaneFormers: From Sparse View Planes to 3D Reconstruction

被引:9
|
作者
Agarwala, Samir [1 ]
Jin, Linyi [1 ]
Rockwell, Chris [1 ]
Fouhey, David F. [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI USA
来源
关键词
D O I
10.1007/978-3-031-20062-5_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an approach for the planar surface reconstruction of a scene from images with limited overlap. This reconstruction task is challenging since it requires jointly reasoning about single image 3D reconstruction, correspondence between images, and the relative camera pose between images. Past work has proposed optimization-based approaches. We introduce a simpler approach, the PlaneFormer, that uses a transformer applied to 3D-aware plane tokens to perform 3D reasoning. Our experiments show that our approach is substantially more effective than prior work, and that several 3D-specific design decisions are crucial for its success. Code is available at https://github.com/samiragarwala/PlaneFormers.
引用
收藏
页码:192 / 209
页数:18
相关论文
共 50 条
  • [1] 3D Clothed Human Reconstruction from Sparse Multi-View Images
    Hong, Jin Gyu
    Noh, Seung Young
    Lee, Hee Kyung
    Cheong, Won Sik
    Chang, Ju Yong
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, : 677 - 687
  • [2] Single and sparse view 3D reconstruction by learning shape priors
    Chen, Yu
    Cipolla, Roberto
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2011, 115 (05) : 586 - 602
  • [3] 3D Reconstruction and Analysis of Bat Flight Maneuvers from Sparse Multiple View Video
    Bergou, A. J.
    Swartz, S.
    Breuer, K.
    Taubin, G.
    INTEGRATIVE AND COMPARATIVE BIOLOGY, 2012, 52 : E211 - E211
  • [4] Volume reconstruction from sparse 3D ultrasonography
    Gooding, MJ
    Kennedy, S
    Noble, JA
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2003, PT 2, 2003, 2879 : 416 - 423
  • [5] A 3D Freehand Ultrasound System for Multi-view Reconstructions from Sparse 2D Scanning Planes
    Honggang Yu
    Marios S Pattichis
    Carla Agurto
    M Beth Goens
    BioMedical Engineering OnLine, 10
  • [6] A 3D Freehand Ultrasound System for Multi-view Reconstructions from Sparse 2D Scanning Planes
    Yu, Honggang
    Pattichis, Marios S.
    Agurto, Carla
    Goens, M. Beth
    BIOMEDICAL ENGINEERING ONLINE, 2011, 10
  • [7] 3D road reconstruction from a single view
    Istituto Elettrotecnico Nazionale, `Galileo Ferraris', Torino, Italy
    Comput Vision Image Undersanding, 2 (212-226):
  • [8] 3D road reconstruction from a single view
    Guiducci, A
    COMPUTER VISION AND IMAGE UNDERSTANDING, 1998, 70 (02) : 212 - 226
  • [9] 3D hand reconstruction from a monocular view
    Lee, SU
    Cohen, I
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, 2004, : 310 - 313
  • [10] Outlier removal for sparse 3D reconstruction from video
    Vural, Elif
    Alatan, A. Aydin
    2008 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO, 2008, : 321 - 324