Multi-View 3D Object Retrieval With Deep Embedding Network

被引:85
|
作者
Guo, Haiyun [1 ,2 ]
Wang, Jinqiao [1 ,2 ]
Gao, Yue [3 ]
Li, Jianqiang [4 ]
Lu, Hanqing [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
[3] Tsinghua Univ, Sch Software, Tsinghua Natl Lab Informat Sci & Technol TNList, Key Lab Informat Syst Secur,Minist Educ, Beijing 100084, Peoples R China
[4] Beijing Univ Technol, Sch Software Engn, Beijing Engn Res Ctr IoT Software & Syst, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural network; multi-view 3D object retrieval; triplet loss; MODEL; DISTANCE; SIMILARITY;
D O I
10.1109/TIP.2016.2609814
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-view 3D object retrieval, each object is characterized by a group of 2D images captured from different views. Rather than using hand-crafted features, in this paper, we take advantage of the strong discriminative power of convolutional neural network to learn an effective 3D object representation tailored for this retrieval task. Specifically, we propose a deep embedding network jointly supervised by classification loss and triplet loss to map the high-dimensional image space into a low-dimensional feature space, where the Euclidean distance of features directly corresponds to the semantic similarity of images. By effectively reducing the intra-class variations while increasing the inter-class ones of the input images, the network guarantees that similar images are closer than dissimilar ones in the learned feature space. Besides, we investigate the effectiveness of deep features extracted from different layers of the embedding network extensively and find that an efficient 3D object representation should be a tradeoff between global semantic information and discriminative local characteristics. Then, with the set of deep features extracted from different views, we can generate a comprehensive description for each 3D object and formulate the multi-view 3D object retrieval as a set-to-set matching problem. Extensive experiments on SHREC'15 data set demonstrate the superiority of our proposed method over the previous state-of-the-art approaches with over 12% performance improvement.
引用
收藏
页码:5526 / 5537
页数:12
相关论文
共 50 条
  • [1] Comparative Study of Multi-View 3D Object Retrieval with Autoencoder & Deep Embedding Network
    Aktar, Sakifa
    Al Mamun, Md
    Hossain, Md Ali
    [J]. 2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
  • [2] Multi-View Hierarchical Fusion Network for 3D Object Retrieval and Classification
    Liu, An-An
    Hu, Nian
    Song, Dan
    Guo, Fu-Bin
    Zhou, He-Yu
    Hao, Tong
    [J]. IEEE ACCESS, 2019, 7 : 153021 - 153030
  • [3] An Improved Multi-View Convolutional Neural Network for 3D Object Retrieval
    He, Xinwei
    Bai, Song
    Chu, Jiajia
    Bai, Xiang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7917 - 7930
  • [4] A Compact Multi-View Descriptor for 3D Object Retrieval
    Daras, Petros
    Axenopoulos, Apostolos
    [J]. CBMI: 2009 INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, 2009, : 115 - 119
  • [5] Aggregated Deep Convolutional Neural Networks for Multi-View 3D Object Retrieval
    Alzu'bi, Ahmad
    Abuarqoub, Abdelrahman
    Al-Hmouz, Ahmed
    [J]. 2019 11TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT), 2019,
  • [6] Multi-view and multivariate gaussian descriptor for 3D object retrieval
    Gao, Zan
    Xue, Kai-Xin
    Zhang, Hua
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (01) : 555 - 572
  • [7] Multi-view and multivariate gaussian descriptor for 3D object retrieval
    Zan Gao
    Kai-Xin Xue
    Hua Zhang
    [J]. Multimedia Tools and Applications, 2019, 78 : 555 - 572
  • [8] CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
    Xiong, Kaixin
    Gong, Shi
    Ye, Xiaoqing
    Tan, Xiao
    Wan, Ji
    Ding, Errui
    Wang, Jingdong
    Bai, Xiang
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21570 - 21579
  • [9] Multi-View Saliency Guided Deep Neural Network for 3-D Object Retrieval and Classification
    Zhou, He-Yu
    Liu, An-An
    Nie, Wei-Zhi
    Nie, Jie
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (06) : 1496 - 1506
  • [10] Multi-view Moments Embedding Network for 3D Shape Recognition
    Xiao, Jun
    Zhang, Yuanxing
    Zhao, Pengyu
    Xiao, Kecheng
    Bian, Kaigui
    Zhang, Chunli
    Yan, Wei
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2257 - 2260