A Coarse-to-Fine Transformer-Based Network for 3D Reconstruction from Non-Overlapping Multi-View Images

被引:0
|
作者
Shan, Yue [1 ]
Xiao, Jun [1 ]
Liu, Lupeng [1 ]
Wang, Yunbiao [1 ]
Yu, Dongbo [1 ]
Zhang, Wenniu [1 ]
机构
[1] Univ Chinese Acad & Sci, Sch Artificial Intelligence, 19 Yuquan Rd, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
point cloud reconstruction; Transformer; non-overlapping; multi-view; POINT CLOUD RECONSTRUCTION; SHAPE;
D O I
10.3390/rs16050901
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Reconstructing 3D structures from non-overlapping multi-view images is a crucial task in the field of 3D computer vision, since it is difficult to establish feature correspondences and infer depth from overlapping parts of views. Previous methods, whether generating the surface mesh or volume of an object, face challenges in simultaneously ensuring the accuracy of detailed topology and the integrity of the overall structure. In this paper, we introduce a novel coarse-to-fine Transformer-based reconstruction network to generate precise point clouds from multiple input images at sparse and non-overlapping viewpoints. Specifically, we firstly employ a general point cloud generation architecture enhanced by the concept of adaptive centroid constraint for the coarse point cloud corresponding to the object. Subsequently, a Transformer-based refinement module applies deformation to each point. We design an attention-based encoder to encode both image projection features and point cloud geometric features, along with a decoder to calculate deformation residuals. Experiments on ShapeNet demonstrate that our proposed method outperforms other competing methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] MVS-T: A Coarse-to-Fine Multi-View Stereo Network with Transformer for Low-Resolution Images 3D Reconstruction
    Jia, Ruiming
    Chen, Xin
    Cui, Jiali
    Hu, Zhenghui
    SENSORS, 2022, 22 (19)
  • [2] 3D-C2FT: Coarse-to-Fine Transformer for Multi-view 3D Reconstruction
    Tiong, Leslie Ching Ow
    Sigmund, Dick
    Teoh, Andrew Beng Jin
    COMPUTER VISION - ACCV 2022, PT I, 2023, 13841 : 211 - 227
  • [3] A Transformer-based Network for Multi-view 3D Mesh Generation
    Shi, Wuzhen
    Liu, Zhijie
    Li, Yingxiang
    Wen, Yang
    Liu, Yutao
    Proceedings - 2023 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Autonomous and Trusted Vehicles, Scalable Computing and Communications, Digital Twin, Privacy Computing and Data Security, Metaverse, SmartWorld/UIC/ATC/ScalCom/DigitalTwin/PCDS/Metaverse 2023, 2023,
  • [4] MULTI-VIEW 3D RECONSTRUCTION FROM VIDEO WITH TRANSFORMER
    Zhong, Yijie
    Sun, Zhengxing
    Sun, Yunhan
    Luo, Shoutong
    Wang, Yi
    Zhang, Wei
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1661 - 1665
  • [5] C2FNet: A Coarse-to-Fine Network for Multi-View 3D Point Cloud Generation
    Lei, Jianjun
    Song, Jiahui
    Peng, Bo
    Li, Wanqing
    Pan, Zhaoqing
    Huang, Qingming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6707 - 6718
  • [6] Photon-Efficient 3D Reconstruction with A Coarse-to-Fine Neural Network
    Guo, Shangwei
    Lai, Zhengchao
    Li, Jun
    Han, Shaokun
    OPTICS AND LASERS IN ENGINEERING, 2022, 159
  • [7] Multi-View Images 3D Reconstruction based on Spatial Geometric Constraint
    Liu, Haibo
    PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 1217 - 1220
  • [8] TMSDNet: Transformer with multi-scale dense network for single and multi-view 3D reconstruction
    Zhu, Xiaoqiang
    Yao, Xinsheng
    Zhang, Junjie
    Zhu, Mengyao
    You, Lihua
    Yang, Xiaosong
    Zhang, Jianjun
    Zhao, He
    Zeng, Dan
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2024, 35 (01)
  • [9] An Extension of PatchMatch Stereo for 3D Reconstruction from Multi-View Images
    Hiradate, Mutsuki
    Ito, Koichi
    Aoki, Takafumi
    Watanabe, Takafumi
    Unten, Hiroki
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 61 - 65
  • [10] Coarse-to-fine cascaded 3D hand reconstruction based on SSGC and MHSA
    Yang, Wenji
    Xie, Liping
    Qian, Wenbin
    Wu, Canghai
    Yang, Hongyun
    VISUAL COMPUTER, 2024, 41 (1): : 11 - 24