Adaptive region aggregation for multi-view stereo matching using deformable convolutional networks

被引:0
|
作者
Hu, Han [1 ]
Su, Liupeng [1 ]
Mao, Shunfu [1 ]
Chen, Min [1 ,3 ]
Pan, Guoqiang [2 ]
Xu, Bo [1 ]
Zhu, Qing [1 ]
机构
[1] Southwest Jiaotong Univ, Fac Geosci & Environm Engn, Chengdu, Peoples R China
[2] Chinese Peoples Armed Police Force, Equipment Project Management Ctr, Beijing, Peoples R China
[3] Southwest Jiaotong Univ, Fac Geosci & Environm Engn, Chengdu 611756, Peoples R China
来源
PHOTOGRAMMETRIC RECORD | 2023年 / 38卷 / 183期
基金
中国国家自然科学基金;
关键词
adaptive region aggregation; deformable convolutional network; dense matching; multi-view stereo; RECONSTRUCTION; IMAGES; POINT;
D O I
10.1111/phor.12459
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Deep-learning methods have demonstrated promising performance in multi-view stereo (MVS) applications. However, it remains challenging to apply a geometrical prior on the adaptive matching windows to achieve efficient three-dimensional reconstruction. To address this problem, this paper proposes a learnable adaptive region aggregation method based on deformable convolutional networks (DCNs), which is integrated into the feature extraction workflow for MVSNet method that uses coarse-to-fine structure. Following the conventional pipeline of MVSNet, a DCN is used to densely estimate and apply transformations in our feature extractor, which is composed of a deformable feature pyramid network (DFPN). Furthermore, we introduce a dedicated offset regulariser to promote the convergence of the learnable offsets of the DCN. The effectiveness of the proposed DFPN is validated through quantitative and qualitative evaluations on the BlendedMVS and Tanks and Temples benchmark datasets within a cross-dataset evaluation setting.
引用
收藏
页码:430 / 449
页数:20
相关论文
共 50 条
  • [21] FADE: Feature Aggregation for Depth Estimation With Multi-View Stereo
    Yang, Hsiao-Chien
    Chen, Po-Heng
    Chen, Kuan-Wen
    Lee, Chen-Yi
    Chen, Yong-Sheng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 6590 - 6600
  • [22] CostFormer: Cost Transformer for Cost Aggregation in Multi-view Stereo
    Chen, Weitao
    Xu, Hongbin
    Zhou, Zhipeng
    Liu, Yang
    Sun, Baigui
    Kang, Wenxiong
    Xie, Xuansong
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 599 - 608
  • [23] P-MVSNet: Learning Patch-wise Matching Confidence Aggregation for Multi-View Stereo
    Luo, Keyang
    Guan, Tao
    Ju, Lili
    Huang, Haipeng
    Luo, Yawei
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10451 - 10460
  • [24] Multi-view Face Detection Using Deep Convolutional Neural Networks
    Farfade, Sachin Sudhakar
    Saberian, Mohammad
    Li, Li-Jia
    [J]. ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 643 - 650
  • [25] Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching
    Gu, Xiaodong
    Fan, Zhiwen
    Zhu, Siyu
    Dai, Zuozhuo
    Tan, Feitong
    Tan, Ping
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2492 - 2501
  • [26] Adaptive multi-view graph convolutional networks for skeleton-based action recognition
    Liu, Xing
    Li, Yanshan
    Xia, Rongjie
    [J]. NEUROCOMPUTING, 2021, 444 : 288 - 300
  • [27] Multi-view graph convolutional networks with attention mechanism
    Yao, Kaixuan
    Liang, Jiye
    Liang, Jianqing
    Li, Ming
    Cao, Feilong
    [J]. ARTIFICIAL INTELLIGENCE, 2022, 307
  • [28] Multi-view knowledge graph convolutional networks for recommendation
    Wang, Xiaofeng
    Zhang, Zengjie
    Shen, Guodong
    Lai, Shuaiming
    Chen, Yuntao
    Zhu, Shuailei
    [J]. Applied Soft Computing, 2025, 169
  • [29] Refractive Multi-view Stereo
    Cassidy, Matthew
    Melou, Jean
    Queau, Yvain
    Lauze, Francois
    Durou, Jean-Denis
    [J]. 2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 384 - 393
  • [30] Polarimetric Multi-View Stereo
    Cui, Zhaopeng
    Gu, Jinwei
    Shi, Boxin
    Tan, Ping
    Kautz, Jan
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 369 - 378