Learning Transferable and Discriminative Representations for 2D Image-Based 3D Model Retrieval

被引:0
|
作者
Zhou, Yaqian [1 ,2 ]
Liu, Yu [1 ]
Zhou, Heyu [3 ]
Cheng, Zhiyong [4 ]
Li, Xuanya [5 ]
Liu, An-An [3 ]
机构
[1] Tianjin Univ, Sch Microelect, Tianjin 300072, Peoples R China
[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei 230088, Peoples R China
[3] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[4] Qilu Univ Technol, Shandong Acad Sci, Shandong Artificial Intelligence Inst, Jinan 250014, Peoples R China
[5] Baidu Inc, Beijing 100105, Peoples R China
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Solid modeling; Adaptation models; Feature extraction; Error analysis; Data models; Generative adversarial networks; 3D model retrieval; unsupervised domain adaptation; multi-view; KERNEL;
D O I
10.1109/TCSVT.2022.3168967
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Existing research on the 2D image-based 3D model retrieval task focuses on learning transferable representations directly to narrow the domain discrepancy. However, it is not easy to achieve in practice due to the significant variations across two domains. In addition, some methods design a domain discriminator to distinguish the feature arising from source or target domains for transferable feature representations learning, which will lead to an unexpected deterioration of the feature discriminability. To settle these problems, we propose jointly learning transferable and discriminative representations for 2D image-based 3D model retrieval. Specifically, we extract features from the 2D images and 3D models (described as multiple views) by CNN. Considering the difficulty of directly narrowing the discrepancy of two domains, we are prone to connect 2D image and 3D model domains to an intermediate domain, where the domain gap aims to be eliminated. However, the feature transferability does not denote well discriminability. Based on the batch spectral penalization (BSP) theory, the feature transferability is dominated by feature vectors with higher singular values, while the feature discriminability depends on more eigenvectors with lower singular values to convey rich discriminative structures. Therefore, we penalize the largest singular values so that the feature vectors with lower singular values are appropriately enhanced, thereby strengthening feature discriminability. A series of experiments on two challenging datasets, MI3DOR and MI3DOR-2, indicate that our method can significantly improve performance.
引用
收藏
页码:7147 / 7159
页数:13
相关论文
共 50 条
  • [1] 3D Pose Estimation Based on Reinforce Learning for 2D Image-Based 3D Model Retrieval
    Nie, Wei-Zhi
    Jia, Wen-Wu
    Li, Wen-Hui
    Liu, An-An
    Zhao, Si-Cheng
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1021 - 1034
  • [2] Wasserstein distance feature alignment learning for 2D image-based 3D model retrieval*
    Zhou, Yaqian
    Liu, Yu
    Zhou, Heyu
    Li, Wenhui
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 79
  • [3] Unsupervised self-training correction learning for 2D image-based 3D model retrieval
    Zhou, Yaqian
    Liu, Yu
    Xiao, Jun
    Liu, Min
    Li, Xuanya
    Liu, An-An
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (04)
  • [4] Image-based 3D model retrieval using manifold learning
    Pan-pan MU
    San-yuan ZHANG
    Yin ZHANG
    Xiu-zi YE
    Xiang PAN
    [J]. Frontiers of Information Technology & Electronic Engineering, 2018, 19 (11) : 1397 - 1408
  • [5] Image-based 3D model retrieval using manifold learning
    Mu, Pan-pan
    Zhang, San-yuan
    Zhang, Yin
    Ye, Xiu-zi
    Pan, Xiang
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2018, 19 (11) : 1397 - 1408
  • [6] Image-based 3D model retrieval using manifold learning
    Pan-pan Mu
    San-yuan Zhang
    Yin Zhang
    Xiu-zi Ye
    Xiang Pan
    [J]. Frontiers of Information Technology & Electronic Engineering, 2018, 19 : 1397 - 1408
  • [7] Adaptive semantic transfer network for unsupervised 2D image-based 3D model retrieval
    Song, Dan
    Yang, Yuanxiang
    Li, Wenhui
    Shao, Zhuang
    Nie, Weizhi
    Li, Xuanya
    Liu, An-An
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 238
  • [8] Collaborative Distribution Alignment for 2D image-based 3D shape retrieval
    Hu, Nian
    Zhou, Heyu
    Liu, An-An
    Huang, Xiangdong
    Zhang, Shenyuan
    Jin, Guoqing
    Guo, Junbo
    Li, Xuanya
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 83
  • [9] Vulnerability of Feature Extractors in 2D Image-Based 3D Object Retrieval
    Liu, An-An
    Zhou, He-Yu
    Li, Xuanya
    Wang, Lanjun
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5065 - 5076
  • [10] CLN: Cross-Domain Learning Network for 2D Image-Based 3D Shape Retrieval
    Nie, Weizhi
    Zhao, Yue
    Nie, Jie
    Liu, An-An
    Zhao, Sicheng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 992 - 1005