TMVNet : Using Transformers for Multi-view Voxel-based 3D Reconstruction

被引:8
|
作者
Peng, Kebin [1 ]
Islam, Rifatul [1 ]
Quarles, John [1 ]
Desai, Kevin [1 ]
机构
[1] Univ Texas San Antonio, San Antonio, TX 78249 USA
关键词
D O I
10.1109/CVPRW56347.2022.00036
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Previous research in multi-view 3D reconstruction have used different convolution neural network (CNN) architectures to obtain a 3D voxel representation. Even though CNN works well, they have limitations in exploiting the long-range dependencies in sequence transduction tasks such as multi-view 3D reconstruction. In this paper, we propose TMVNet - a two-layer transformer encoder that can better use long-range dependencies information. In contrast to using a 2D CNN decoder by the previous approaches, our model uses a 3D CNN encoder to capture the relations between the voxels in the 3D space. Also, our proposed 3D feature fusion network aggregates 3D position feature from CNN and long-range dependencies feature from transformer together. The proposed TMVNet is trained and tested on the ShapeNet dataset. Comparison against ten state-of-the-art multi-view 3D reconstruction methods and the reported quantitative and qualitative results showcase the superiority of our method.
引用
收藏
页码:221 / 229
页数:9
相关论文
共 50 条
  • [1] Multi-view 3D Reconstruction with Transformers
    Wang, Dan
    Cui, Xinrui
    Chen, Xun
    Zou, Zhengxia
    Shi, Tianyang
    Salcudean, Septimiu
    Wang, Z. Jane
    Ward, Rabab
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5702 - 5711
  • [2] 3D ear reconstruction attempts: Using multi-view
    Liu, Heng
    Yan, Jingqi
    Zhang, David
    [J]. INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION, 2006, 345 : 578 - 583
  • [3] Overview of 3D Reconstruction Methods Based on Multi-view
    Li, Mengxin
    Zheng, Dai
    Zhang, Rui
    Yin, Jiadi
    Tian, Xiangqian
    [J]. 2015 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS IHMSC 2015, VOL II, 2015,
  • [4] Underwater 3D reconstruction based on multi-view stereo
    Gu, Feifei
    Zhao, Juan
    Xu, Pei
    Huang, Shulan
    Zhang, Gaopeng
    Song, Zhan
    [J]. OCEAN OPTICS AND INFORMATION TECHNOLOGY, 2018, 10850
  • [5] 3D Reconstruction for Multi-view Objects
    Yu, Jun
    Yin, Wenbin
    Hu, Zhiyi
    Liu, Yabin
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2023, 106
  • [6] Research on Multi-View 3D Reconstruction Technology Based on SFM
    Gao, Lei
    Zhao, Yingbao
    Han, Jingchang
    Liu, Huixian
    [J]. SENSORS, 2022, 22 (12)
  • [7] FLAME-Based Multi-view 3D Face Reconstruction
    Zheng, Wenzhuo
    Zhao, Junhao
    Liu, Xiaohong
    Pan, Yongyang
    Gan, Zhenghao
    Han, Haozhe
    Liu, Ning
    [J]. ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT IV, 2024, 14498 : 327 - 339
  • [8] Unsupervised 3D reconstruction method based on multi-view propagation
    Luo J.
    Yuan D.
    Zhang L.
    Qu Y.
    Su S.
    [J]. Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2024, 42 (01): : 129 - 137
  • [9] 3D Face Reconstruction based on Multi-View Stereo Algorithm
    Peng, Keju
    Guan, Tao
    Xu, Chao
    Zhou, Dongxiang
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS IEEE-ROBIO 2014, 2014, : 2310 - 2314
  • [10] COMPRESSED SENSING FOR MULTI-VIEW TRACKING AND 3-D VOXEL RECONSTRUCTION
    Reddy, Dikpal
    Sankaranarayanan, Aswin C.
    Cevher, Volkan
    Chellappa, Rama
    [J]. 2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 221 - 224