Self-Supervised Auxiliary Domain Alignment for Unsupervised 2D Image-Based 3D Shape Retrieval

被引:9
|
作者
Liu, An-An [1 ,2 ]
Zhang, Chenyu [1 ]
Li, Wenhui [1 ]
Gao, Xingyu [3 ]
Sun, Zhengya [4 ]
Li, Xuanya [5 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei 230088, Peoples R China
[3] Chinese Acad Sci, Inst Microelect, Beijing 100045, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Beijing 100045, Peoples R China
[5] Baidu Inc, Beijing 100085, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Shape; Three-dimensional displays; Task analysis; Representation learning; Semantics; Feature extraction; Visualization; Unsupervised 3D shape retrieval; cross-domain representation; domain adaptation;
D O I
10.1109/TCSVT.2022.3191761
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unsupervised 2D image-based 3D shape retrieval aims to match the similar 3D unlabeled shapes when given a 2D labeled sample. Although a lot of methods have made a certain degree of progress, the performance of this task is still restricted due to the lack of target labels resulting in tremendous domain gap. In this paper, we aim to explore the discriminative representation of the unlabeled target 3D shapes and facilitate the procedure of domain adaptation by taking full advantage of multi-view information. To achieve the above goals, we propose an effective self-supervised auxiliary domain alignment (SADA) for unsupervised 2D image-based 3D shape retrieval. SADA mainly contains multi-view guided self-supervised feature learning and two auxiliary domain alignments, including intermediate domain alignment and multi-domain alignment. Firstly, we group multiple views of each 3D shape into two sub-target domains based on the view similarities and regard each other as the constraint to optimize the feature learning in an unsupervised manner. To reduce the difficulty of directly aligning the domain discrepancy, we combine the source labeled samples and target samples (pseudo labels) with the same category to generate an intermediate domain, which translates the source-target alignment into source-intermediate and intermediate-target alignments. Moreover, to explore the inner characteristics of target 3D shapes and provide more clues for better adaptation, multi-domain alignment is proposed to convert the source and single target domain alignment to the source and multiple target domain (one target domain and two sub-target domains) alignments. The adversarial training and semantic alignment are employed to fully excavate the relations between source domain and multiple target domains. Experiments on two challenging datasets show that the proposed method achieves competing performance in the unsupervised 2D image-based 3D shape retrieval task.
引用
收藏
页码:8809 / 8821
页数:13
相关论文
共 50 条
  • [1] Self-supervised Image-based 3D Model Retrieval
    Song, Dan
    Zhang, Chu-Meng
    Zhao, Xiao-Qian
    Wang, Teng
    Nie, Wei-Zhi
    Li, Xuan-Ya
    Liu, An-An
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [2] Collaborative Distribution Alignment for 2D image-based 3D shape retrieval
    Hu, Nian
    Zhou, Heyu
    Liu, An-An
    Huang, Xiangdong
    Zhang, Shenyuan
    Jin, Guoqing
    Guo, Junbo
    Li, Xuanya
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 83
  • [3] Hierarchical Instance Feature Alignment for 2D Image-Based 3D Shape Retrieval
    Zhou, Heyu
    Nie, Weizhi
    Li, Wenhui
    Song, Dan
    Liu, An-An
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 839 - 845
  • [4] Consistent Domain Structure Learning and Domain Alignment for 2D Image-Based 3D Objects Retrieval
    Su, Yuting
    Li, Yuqian
    Song, Dan
    Nie, Weizhi
    Li, Wenhui
    Liu, An-An
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 883 - 889
  • [5] Self-Supervised 2D Image to 3D Shape Translation with Disentangled Representations
    Kaya, Berk
    Timofte, Radu
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 1039 - 1048
  • [6] Prototype-based semantic consistency learning for unsupervised 2D image-based 3D shape retrieval
    Liu, An-An
    Zhang, Yuwei
    Zhang, Chenyu
    Li, Wenhui
    Lv, Bo
    Lei, Lei
    Li, Xuanya
    MULTIMEDIA SYSTEMS, 2023, 29 (04) : 1995 - 2007
  • [7] Prototype-based semantic consistency learning for unsupervised 2D image-based 3D shape retrieval
    An-An Liu
    Yuwei Zhang
    Chenyu Zhang
    Wenhui Li
    Bo Lv
    Lei Lei
    Xuanya Li
    Multimedia Systems, 2023, 29 : 1995 - 2007
  • [8] Semantic Consistency Guided Instance Feature Alignment for 2D Image-Based 3D Shape Retrieval
    Zhou, Heyu
    Nie, Weizhi
    Song, Dan
    Hu, Nian
    Li, Xuanya
    Liu, An-An
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 925 - 933
  • [9] Joint Intermediate Domain Generation and Distribution Alignment for 2D Image-Based 3D Objects Retrieval
    Su, Yuting
    Li, Yuqian
    Song, Dan
    Liu, Anan
    Nie, Jie
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2127 - 2138
  • [10] Unsupervised self-training correction learning for 2D image-based 3D model retrieval
    Zhou, Yaqian
    Liu, Yu
    Xiao, Jun
    Liu, Min
    Li, Xuanya
    Liu, An-An
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (04)