Unsupervised Cross-Media Graph Convolutional Network for 2D Image-Based 3D Model Retrieval

被引:5
|
作者
Liang, Qi [1 ,2 ]
Li, Qiang [1 ]
Nie, Weizhi [3 ]
Liu, An-An [2 ,3 ]
机构
[1] Tianjin Univ, Sch Microelect, Tianjin 300072, Peoples R China
[2] Hefei Comprehens Natl Sci Ctr, Inst Artifificial Intelligence, Hefei 230088, Peoples R China
[3] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
3D model; graph embedding; image based; unsupervised;
D O I
10.1109/TMM.2022.3160616
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of 3D construction technology, 3D models have been implemented in many applications. In particular, the fields of virtual and augmented reality have created a considerable demand for rapid access to large sets of 3D models in recent years. An effective method for addressing the demand is to search 3D models based on 2D images because 2D images can be easily captured by smartphones or other lightweight vision sensors. In this paper, we propose a novel unsupervised cross-media graph convolutional network (UCM-GCN) for 3D model retrieval based on 2D images. Here, we render views from 3D models to construct a graph model based on 3D model structural information. Then, we utilize the 2D image's visual information to bridge the gap between cross-modality data. Then, the proposed UCM-GCN is utilized to update the feature vector of the 2D image and the 3D model. Here, we introduce correlation loss to mitigate the distribution discrepancy across different modalities, which can fully consider the structural and visual similarities between the 2D image and 3D model to embed the final different modalities into the same feature space. To demonstrate the performance of our approach, we conducted a series of experiments on the MI3DOR dataset, which is utilized in SHREC19. We also compared it with other similar methods on the 3D-FUTURE dataset. The experimental results demonstrate the superiority of our proposed method over state-of-the-art methods.
引用
收藏
页码:3443 / 3455
页数:13
相关论文
共 50 条
  • [1] Adaptive semantic transfer network for unsupervised 2D image-based 3D model retrieval
    Song, Dan
    Yang, Yuanxiang
    Li, Wenhui
    Shao, Zhuang
    Nie, Weizhi
    Li, Xuanya
    Liu, An-An
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 238
  • [2] Unsupervised self-training correction learning for 2D image-based 3D model retrieval
    Zhou, Yaqian
    Liu, Yu
    Xiao, Jun
    Liu, Min
    Li, Xuanya
    Liu, An-An
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (04)
  • [3] Instance-prototype similarity consistency for unsupervised 2D image-based 3D model retrieval
    Li, Wenhui
    Zhang, Yuwei
    Wang, Fan
    Li, Xuanya
    Duan, Yulong
    Liu, An-An
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (04)
  • [4] CLN: Cross-Domain Learning Network for 2D Image-Based 3D Shape Retrieval
    Nie, Weizhi
    Zhao, Yue
    Nie, Jie
    Liu, An-An
    Zhao, Sicheng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 992 - 1005
  • [5] M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval
    Nie, Wei-Zhi
    Ren, Min-Jie
    Liu, An-An
    Mao, Zhendong
    Nie, Jie
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1962 - 1976
  • [6] M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval
    Nie, Wei-Zhi
    Ren, Min-Jie
    Liu, An-An
    Mao, Zhendong
    Nie, Jie
    [J]. Liu, An-An (anan0422@gmail.com); Nie, Jie (niejie@ouc.edu.cn), 1962, Institute of Electrical and Electronics Engineers Inc. (23): : 1962 - 1976
  • [7] 3D Pose Estimation Based on Reinforce Learning for 2D Image-Based 3D Model Retrieval
    Nie, Wei-Zhi
    Jia, Wen-Wu
    Li, Wen-Hui
    Liu, An-An
    Zhao, Si-Cheng
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 (23) : 1021 - 1034
  • [8] Learning Transferable and Discriminative Representations for 2D Image-Based 3D Model Retrieval
    Zhou, Yaqian
    Liu, Yu
    Zhou, Heyu
    Cheng, Zhiyong
    Li, Xuanya
    Liu, An-An
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 7147 - 7159
  • [9] Prototype-based semantic consistency learning for unsupervised 2D image-based 3D shape retrieval
    Liu, An-An
    Zhang, Yuwei
    Zhang, Chenyu
    Li, Wenhui
    Lv, Bo
    Lei, Lei
    Li, Xuanya
    [J]. MULTIMEDIA SYSTEMS, 2023, 29 (04) : 1995 - 2007
  • [10] Prototype-based semantic consistency learning for unsupervised 2D image-based 3D shape retrieval
    An-An Liu
    Yuwei Zhang
    Chenyu Zhang
    Wenhui Li
    Bo Lv
    Lei Lei
    Xuanya Li
    [J]. Multimedia Systems, 2023, 29 : 1995 - 2007