CLN: Cross-Domain Learning Network for 2D Image-Based 3D Shape Retrieval

被引:19
|
作者
Nie, Weizhi [1 ]
Zhao, Yue [1 ]
Nie, Jie [2 ]
Liu, An-An [1 ]
Zhao, Sicheng [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Ocean Univ China, Coll Informat Sci & Engn, Qingdao 266100, Peoples R China
[3] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
基金
中国国家自然科学基金;
关键词
Shape; Three-dimensional displays; Two dimensional displays; Feature extraction; Task analysis; Visualization; Computer architecture; Image processing; information retrieval; content-based retrieval; multimedia computing; MODEL RETRIEVAL; NEURAL-NETWORK; FEATURES;
D O I
10.1109/TCSVT.2021.3070969
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Retrieving 3D shapes based on 2D images is a challenging research topic, due to the significant gap between different domains. Recently, various approaches have been proposed to handle this problem. However, the majority of methods target the cross-domain retrieval task as a pure domain adaptation problem, which focuses on the alignment but ignores the visual relevance between the 2D images and their corresponding 3D shapes. To fundamentally decrease the divergence between different domains, we propose a novel cross-domain learning network (CLN) for 2D image-based 3D shape retrieval task. First, we estimate the pose information from the 2D image to guide the view rendering of 3D shapes, which increases the visual correlations of the cross-domain data to eliminate the divergence between them. Second, we introduce a novel joint learning network, considering both the domain-specific characteristics and the cross-domain interactions for data alignment, which further compensates for the gap between different domains by controlling the distance of intra- and inter-classes. After the metric learning process, discriminative descriptors of images and shapes are generated for the cross-domain retrieval task. To prove the effectiveness and robustness of the proposed method, we conduct extensive experiments on the MI3DOR, SHREC'13, and SHREC'14 datasets. The experimental results demonstrate the superiority of our proposed method, and significant improvements have been achieved compared with state-of-the-art methods.
引用
收藏
页码:992 / 1005
页数:14
相关论文
共 50 条
  • [1] Cross-Domain Image-Based 3D Shape Retrieval by View Sequence Learning
    Lee, Tang
    Lin, Yen-Liang
    Chiang, HungYueh
    Chiu, Ming-Wei
    Hsu, Winston H.
    Huang, Polly
    2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 258 - 266
  • [2] Cross-domain Prototype Contrastive loss for Few-shot 2D Image-Based 3D Model Retrieval
    Zhou, Yaqian
    Liu, Yu
    Song, Dan
    Li, Jiayu
    Li, Xuanya
    Liu, An-An
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2897 - 2902
  • [3] Consistent Domain Structure Learning and Domain Alignment for 2D Image-Based 3D Objects Retrieval
    Su, Yuting
    Li, Yuqian
    Song, Dan
    Nie, Weizhi
    Li, Wenhui
    Liu, An-An
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 883 - 889
  • [4] Collaborative Distribution Alignment for 2D image-based 3D shape retrieval
    Hu, Nian
    Zhou, Heyu
    Liu, An-An
    Huang, Xiangdong
    Zhang, Shenyuan
    Jin, Guoqing
    Guo, Junbo
    Li, Xuanya
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 83
  • [5] 3D shape knowledge graph for cross-domain 3D shape retrieval
    Chang, Rihao
    Ma, Yongtao
    Hao, Tong
    Wang, Weijie
    Nie, Weizhi
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024,
  • [6] Prototype-based semantic consistency learning for unsupervised 2D image-based 3D shape retrieval
    Liu, An-An
    Zhang, Yuwei
    Zhang, Chenyu
    Li, Wenhui
    Lv, Bo
    Lei, Lei
    Li, Xuanya
    MULTIMEDIA SYSTEMS, 2023, 29 (04) : 1995 - 2007
  • [7] Prototype-based semantic consistency learning for unsupervised 2D image-based 3D shape retrieval
    An-An Liu
    Yuwei Zhang
    Chenyu Zhang
    Wenhui Li
    Bo Lv
    Lei Lei
    Xuanya Li
    Multimedia Systems, 2023, 29 : 1995 - 2007
  • [8] Self-Supervised Auxiliary Domain Alignment for Unsupervised 2D Image-Based 3D Shape Retrieval
    Liu, An-An
    Zhang, Chenyu
    Li, Wenhui
    Gao, Xingyu
    Sun, Zhengya
    Li, Xuanya
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8809 - 8821
  • [9] Hierarchical Instance Feature Alignment for 2D Image-Based 3D Shape Retrieval
    Zhou, Heyu
    Nie, Weizhi
    Li, Wenhui
    Song, Dan
    Liu, An-An
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 839 - 845
  • [10] 3D Pose Estimation Based on Reinforce Learning for 2D Image-Based 3D Model Retrieval
    Nie, Wei-Zhi
    Jia, Wen-Wu
    Li, Wen-Hui
    Liu, An-An
    Zhao, Si-Cheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 (23) : 1021 - 1034