DRCNN: Dynamic Routing Convolutional Neural Network for Multi-View 3D Object Recognition

被引:0
|
作者
Sun, Kai [1 ]
Zhang, Jiangshe [1 ]
Liu, Junmin [1 ]
Yu, Ruixuan [1 ]
Song, Zengjie [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 215123, Peoples R China
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Routing; Heuristic algorithms; Object recognition; Fuses; Task analysis; Shape; 3D object recognition; view-based methods; dynamic routing layer; dynamic routing convolutional neural network; MODEL;
D O I
10.1109/TIP.2020.3039378
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object recognition is one of the most important tasks in 3D data processing, and has been extensively studied recently. Researchers have proposed various 3D recognition methods based on deep learning, among which a class of view-based approaches is a typical one. However, in the view-based methods, the commonly used view pooling layer to fuse multi-view features causes a loss of visual information. To alleviate this problem, in this paper, we construct a novel layer called Dynamic Routing Layer (DRL) by modifying the dynamic routing algorithm of capsule network, to more effectively fuse the features of each view. Concretely, in DRL, we use rearrangement and affine transformation to convert features, then leverage the modified dynamic routing algorithm to adaptively choose the converted features, instead of ignoring all but the most active feature in view pooling layer. We also illustrate that the view pooling layer is a special case of our DRL. In addition, based on DRL, we further present a Dynamic Routing Convolutional Neural Network (DRCNN) for multi-view 3D object recognition. Our experiments on three 3D benchmark datasets show that our proposed DRCNN outperforms many state-of-the-arts, which demonstrates the efficacy of our method.
引用
收藏
页码:868 / 877
页数:10
相关论文
共 50 条
  • [1] Drcnn: Dynamic routing convolutional neural network for multi-view 3d object recognition
    Sun, Kai
    Zhang, Jiangshe
    Liu, Junmin
    Yu, Ruixuan
    Song, Zengjie
    [J]. IEEE Transactions on Image Processing, 2021, 30 : 868 - 877
  • [2] An Improved Multi-View Convolutional Neural Network for 3D Object Retrieval
    He, Xinwei
    Bai, Song
    Chu, Jiajia
    Bai, Xiang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7917 - 7930
  • [3] Hierarchical Graph Attention Based Multi-View Convolutional Neural Network for 3D Object Recognition
    Zeng, Hui
    Zhao, Tianmeng
    Cheng, Ruting
    Wang, Fuzhou
    Liu, Jiwei
    [J]. IEEE ACCESS, 2021, 9 (09): : 33323 - 33335
  • [4] 3D object recognition based on pairwise Multi-view Convolutional Neural Networks
    Gao, Z.
    Wang, D. Y.
    Xue, Y. B.
    Xu, G. P.
    Zhang, H.
    Wang, Y. L.
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 56 : 305 - 315
  • [5] 3D Point Cloud Object Detection with Multi-View Convolutional Neural Network
    Pang, Guan
    Neumann, Ulrich
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 585 - 590
  • [6] Multi-view convolutional vision transformer for 3D object recognition
    Li, Jie
    Liu, Zhao
    Li, Li
    Lin, Junqin
    Yao, Jian
    Tu, Jingmin
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [7] 3D Point Cloud Recognition Based on a Multi-View Convolutional Neural Network
    Zhang, Le
    Sun, Jian
    Zheng, Qiang
    [J]. SENSORS, 2018, 18 (11)
  • [8] Multi-view Convolutional Neural Networks for 3D Shape Recognition
    Su, Hang
    Maji, Subhransu
    Kalogerakis, Evangelos
    Learned-Miller, Erik
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 945 - 953
  • [9] Multi-view dual attention network for 3D object recognition
    Wenju Wang
    Yu Cai
    Tao Wang
    [J]. Neural Computing and Applications, 2022, 34 : 3201 - 3212
  • [10] Multi-view dual attention network for 3D object recognition
    Wang, Wenju
    Cai, Yu
    Wang, Tao
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04): : 3201 - 3212