Walk in Views: Multi-view Path Aggregation Graph Network for 3D Shape Analysis

被引:2
|
作者
Xu, Lixiang [1 ,2 ]
Cui, Qingzhe [1 ]
Xu, Wei [1 ]
Chen, Enhong [2 ]
Tong, He [3 ]
Tang, Yuanyan [4 ]
机构
[1] Hefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Anhui, Peoples R China
[2] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China
[3] Chinese Peoples Liberat Army Aviat Inst, Dept Basic, Beijing 101123, Peoples R China
[4] FST Univ Macau, Zhuhai UM Sci & Technol Res Inst, Macau 999078, Macao, Peoples R China
基金
中国国家自然科学基金;
关键词
3D shape analysis; Path aggregation; Graph networks; Vision transformer; Multi-view fusion;
D O I
10.1016/j.inffus.2023.102131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The graph-based multi-view methods have achieved state-of-the-art results in 3D shape analysis tasks by taking advantage of graph convolutional networks (GCN) to process discrete data. However, the homogeneity of the traditional GCN aggregation operator leads to a problem in aggregating neighborhood information, i.e., if several views have the same neighbors, the same node embeddings will be generated, resulting in feature redundancy. To address this problem, we propose a Multi-view Path Aggregation Graph Network (MVPNet) for 3D shape analysis, which aims to extract a particular path from a graph composed of multiple views and aggregate it into an effective 3D shape descriptor. Specifically, we first extract a path in the graph through dynamic walking, and update the path status while searching for new nodes during the walking. Then we embed the position information of the nodes in the order of the nodes in the path. Finally, we propose to aggregate the features of a path employing a Path Transformer that is capable of handling ordered sequences. A path contains richer semantic and structural information than a traditional subgraph. To demonstrate the effectiveness of our proposed method, we conduct extensive experiments on three benchmark datasets, namely ModelNet, ShapeNetCore55 and MCB, and these experiments prove that the method outperforms the current methods in 3D shape classification and retrieval tasks.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Multi-view 3D object retrieval leveraging the aggregation of view and instance attentive features
    Lin, Dongyun
    Li, Yiqun
    Cheng, Yi
    Prasad, Shitala
    Nwe, Tin Lay
    Dong, Sheng
    Guo, Aiyuan
    KNOWLEDGE-BASED SYSTEMS, 2022, 247
  • [32] A multi-view recurrent neural network for 3D mesh segmentation
    Le, Truc
    Bui, Giang
    Duan, Ye
    COMPUTERS & GRAPHICS-UK, 2017, 66 : 103 - 112
  • [33] Multi-View 3D Object Retrieval With Deep Embedding Network
    Guo, Haiyun
    Wang, Jinqiao
    Gao, Yue
    Li, Jianqiang
    Lu, Hanqing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (12) : 5526 - 5537
  • [34] Multi-View 3D Object Detection Network for Autonomous Driving
    Chen, Xiaozhi
    Ma, Huimin
    Wan, Ji
    Li, Bo
    Xia, Tian
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6526 - 6534
  • [35] Multi-view dual attention network for 3D object recognition
    Wenju Wang
    Yu Cai
    Tao Wang
    Neural Computing and Applications, 2022, 34 : 3201 - 3212
  • [36] Multi-view dual attention network for 3D object recognition
    Wang, Wenju
    Cai, Yu
    Wang, Tao
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04): : 3201 - 3212
  • [37] Multi-view Harmonized Bilinear Network for 3D Object Recognition
    Yu, Tan
    Meng, Jingjing
    Yuan, Junsong
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 186 - 194
  • [38] Multi-view facial landmark detection by using a 3D shape model
    Cech, Jan
    Franc, Vojtech
    Uricar, Michal
    Matas, Jiri
    IMAGE AND VISION COMPUTING, 2016, 47 : 60 - 70
  • [39] Multi-view Shape Generation for a 3D Human-like Body
    Yu, Hang
    Cheang, Chilam
    Fu, Yanwei
    Xue, Xiangyang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [40] Minimum variance estimation of 3D face shape from multi-view
    Zhang, Zhenqiu
    Hu, Yuxiao
    Yu, Tianli
    Huang, Thomas
    PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION - PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE, 2006, : 547 - +