Self-Supervised Auxiliary Domain Alignment for Unsupervised 2D Image-Based 3D Shape Retrieval

被引:9
|
作者
Liu, An-An [1 ,2 ]
Zhang, Chenyu [1 ]
Li, Wenhui [1 ]
Gao, Xingyu [3 ]
Sun, Zhengya [4 ]
Li, Xuanya [5 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei 230088, Peoples R China
[3] Chinese Acad Sci, Inst Microelect, Beijing 100045, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Beijing 100045, Peoples R China
[5] Baidu Inc, Beijing 100085, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Shape; Three-dimensional displays; Task analysis; Representation learning; Semantics; Feature extraction; Visualization; Unsupervised 3D shape retrieval; cross-domain representation; domain adaptation;
D O I
10.1109/TCSVT.2022.3191761
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unsupervised 2D image-based 3D shape retrieval aims to match the similar 3D unlabeled shapes when given a 2D labeled sample. Although a lot of methods have made a certain degree of progress, the performance of this task is still restricted due to the lack of target labels resulting in tremendous domain gap. In this paper, we aim to explore the discriminative representation of the unlabeled target 3D shapes and facilitate the procedure of domain adaptation by taking full advantage of multi-view information. To achieve the above goals, we propose an effective self-supervised auxiliary domain alignment (SADA) for unsupervised 2D image-based 3D shape retrieval. SADA mainly contains multi-view guided self-supervised feature learning and two auxiliary domain alignments, including intermediate domain alignment and multi-domain alignment. Firstly, we group multiple views of each 3D shape into two sub-target domains based on the view similarities and regard each other as the constraint to optimize the feature learning in an unsupervised manner. To reduce the difficulty of directly aligning the domain discrepancy, we combine the source labeled samples and target samples (pseudo labels) with the same category to generate an intermediate domain, which translates the source-target alignment into source-intermediate and intermediate-target alignments. Moreover, to explore the inner characteristics of target 3D shapes and provide more clues for better adaptation, multi-domain alignment is proposed to convert the source and single target domain alignment to the source and multiple target domain (one target domain and two sub-target domains) alignments. The adversarial training and semantic alignment are employed to fully excavate the relations between source domain and multiple target domains. Experiments on two challenging datasets show that the proposed method achieves competing performance in the unsupervised 2D image-based 3D shape retrieval task.
引用
收藏
页码:8809 / 8821
页数:13
相关论文
共 50 条
  • [11] CLN: Cross-Domain Learning Network for 2D Image-Based 3D Shape Retrieval
    Nie, Weizhi
    Zhao, Yue
    Nie, Jie
    Liu, An-An
    Zhao, Sicheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 992 - 1005
  • [12] Adaptive semantic transfer network for unsupervised 2D image-based 3D model retrieval
    Song, Dan
    Yang, Yuanxiang
    Li, Wenhui
    Shao, Zhuang
    Nie, Weizhi
    Li, Xuanya
    Liu, An-An
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 238
  • [13] Wasserstein distance feature alignment learning for 2D image-based 3D model retrieval*
    Zhou, Yaqian
    Liu, Yu
    Zhou, Heyu
    Li, Wenhui
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 79
  • [14] Instance-prototype similarity consistency for unsupervised 2D image-based 3D model retrieval
    Li, Wenhui
    Zhang, Yuwei
    Wang, Fan
    Li, Xuanya
    Duan, Yulong
    Liu, An-An
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (04)
  • [15] Joint Heterogeneous Feature Learning and Distribution Alignment for 2D Image-Based 3D Object Retrieval
    Su, Yuting
    Li, Yuqian
    Nie, Weizhi
    Song, Dan
    Liu, An-An
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (10) : 3765 - 3776
  • [16] Dual-level Embedding Alignment Network for 2D Image-Based 3D Object Retrieval
    Zhou, Heyu
    Liu, An-An
    Nie, Weizhi
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1667 - 1675
  • [17] Unsupervised Cross-Media Graph Convolutional Network for 2D Image-Based 3D Model Retrieval
    Liang, Qi
    Li, Qiang
    Nie, Weizhi
    Liu, An-An
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3443 - 3455
  • [18] Vulnerability of Feature Extractors in 2D Image-Based 3D Object Retrieval
    Liu, An-An
    Zhou, He-Yu
    Li, Xuanya
    Wang, Lanjun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5065 - 5076
  • [19] 3D Pose Estimation Based on Reinforce Learning for 2D Image-Based 3D Model Retrieval
    Nie, Wei-Zhi
    Jia, Wen-Wu
    Li, Wen-Hui
    Liu, An-An
    Zhao, Si-Cheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 (23) : 1021 - 1034
  • [20] Domain-specific modeling and semantic alignment for image-based 3D model retrieval
    Song, Dan
    Jiang, Xue-Jing
    Zhang, Yue
    Zhang, Fang-Lue
    Jin, Yao
    Zhang, Yun
    COMPUTERS & GRAPHICS-UK, 2023, 115 : 25 - 34