MULTI-VIEW 3D RECONSTRUCTION FROM VIDEO WITH TRANSFORMER

被引:0
|
作者
Zhong, Yijie [1 ]
Sun, Zhengxing [1 ]
Sun, Yunhan [1 ]
Luo, Shoutong [1 ]
Wang, Yi [1 ]
Zhang, Wei [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
关键词
Multi-view 3D reconstruction; Sequential modeling; Transformer-based model;
D O I
10.1109/ICIP46576.2022.9897753
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view 3D reconstruction is the base for many other applications in computer vision. Video provides multi-view images and temporal information, which can help us better complete the reconstruction goal. Redundant information handling in video and multi-view feature extraction and fusion become the key issues in the shape prior extraction for reconstruction. In this paper, inspired by the recent great success in Transformer models, we propose a transformer-based 3D reconstruction network. We formulate the multi-view 3D reconstruction into three parts: frame encoder, fusion module, and shape decoder. We apply several special used tokens and perform the fusion progressively in the encoder phase, called patch-level progressive fusion module. These tokens describe which part of the object the frame should focus on and the local structural detail progressively. Then we further design a transformer fusion module to aggregate the structure information. Finally, multi-head attention is utilized to build the transformer-based decoder to reuse the shallow features from encoder. In experiments not only can ours method achieve competitive performance, but it also has low model complexity and computation cost.
引用
收藏
页码:1661 / 1665
页数:5
相关论文
共 50 条
  • [41] 3D Shape Reconstruction from Sketches via Multi-view Convolutional Networks
    Lun, Zhaoliang
    Gadelha, Matheus
    Kalogerakis, Evangelos
    Maji, Subhransu
    Wang, Rui
    [J]. PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 67 - 77
  • [42] Combining Photometric Normals and Multi-View Stereo for 3D Reconstruction
    Grochulla, Martin
    Thormaehlen, Thorsten
    [J]. CVMP 2015: PROCEEDINGS OF THE 12TH EUROPEAN CONFERENCE ON VISUAL MEDIA PRODUCTION, 2015,
  • [43] Research on Multi-View 3D Reconstruction Technology Based on SFM
    Gao, Lei
    Zhao, Yingbao
    Han, Jingchang
    Liu, Huixian
    [J]. SENSORS, 2022, 22 (12)
  • [44] Prior-Guided Multi-View 3D Head Reconstruction
    Wang, Xueying
    Guo, Yudong
    Yang, Zhongqi
    Zhang, Juyong
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4028 - 4040
  • [45] Improvement on Matching Breakage of Multi-View Stereo 3D Reconstruction
    Lin, Hung-Lin
    Lin, Tsung-Yi
    Li, Yi-Xuan
    Tseng, Yu-Sheng
    Li, Xin-Yi
    Cal, Qlan-Wen
    Chen, Zheng
    Shi, Yi-Rou
    [J]. PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON ADVANCED MATERIALS FOR SCIENCE AND ENGINEERING (IEEE-ICAMSE 2016), 2016, : 423 - 425
  • [46] Unsupervised 3D reconstruction method based on multi-view propagation
    Luo, Jingfeng
    Yuan, Dongli
    Zhang, Lan
    Qu, Yaohong
    Su, Shihong
    [J]. Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2024, 42 (01): : 129 - 137
  • [47] IMPROVED MULTI-VIEW DEPTH ESTIMATION FOR VIEW SYNTHESIS IN 3D VIDEO CODING
    Zhang, Qiuwen
    An, Ping
    Zhang, Yan
    Shen, Liquan
    Zhang, Zhaoyang
    [J]. 2011 3DTV CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON), 2011,
  • [48] A View-Level Rate Distortion Model for Multi-View/3D Video
    Roodaki, Hoda
    Iravani, Zahra
    Hashemi, Mahmoud Reza
    Shirmohammadi, Shervin
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (01) : 14 - 24
  • [49] FLAME-Based Multi-view 3D Face Reconstruction
    Zheng, Wenzhuo
    Zhao, Junhao
    Liu, Xiaohong
    Pan, Yongyang
    Gan, Zhenghao
    Han, Haozhe
    Liu, Ning
    [J]. ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT IV, 2024, 14498 : 327 - 339
  • [50] Multi-view 3D Reconstruction in Underwater Environments: Evaluation and Benchmark
    Iscar, Eduardo
    Skinner, Katherine A.
    Johnson-Roberson, Matthew
    [J]. OCEANS 2017 - ANCHORAGE, 2017,