Walk in Views: Multi-view Path Aggregation Graph Network for 3D Shape Analysis

被引:2
|
作者
Xu, Lixiang [1 ,2 ]
Cui, Qingzhe [1 ]
Xu, Wei [1 ]
Chen, Enhong [2 ]
Tong, He [3 ]
Tang, Yuanyan [4 ]
机构
[1] Hefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Anhui, Peoples R China
[2] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China
[3] Chinese Peoples Liberat Army Aviat Inst, Dept Basic, Beijing 101123, Peoples R China
[4] FST Univ Macau, Zhuhai UM Sci & Technol Res Inst, Macau 999078, Macao, Peoples R China
基金
中国国家自然科学基金;
关键词
3D shape analysis; Path aggregation; Graph networks; Vision transformer; Multi-view fusion;
D O I
10.1016/j.inffus.2023.102131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The graph-based multi-view methods have achieved state-of-the-art results in 3D shape analysis tasks by taking advantage of graph convolutional networks (GCN) to process discrete data. However, the homogeneity of the traditional GCN aggregation operator leads to a problem in aggregating neighborhood information, i.e., if several views have the same neighbors, the same node embeddings will be generated, resulting in feature redundancy. To address this problem, we propose a Multi-view Path Aggregation Graph Network (MVPNet) for 3D shape analysis, which aims to extract a particular path from a graph composed of multiple views and aggregate it into an effective 3D shape descriptor. Specifically, we first extract a path in the graph through dynamic walking, and update the path status while searching for new nodes during the walking. Then we embed the position information of the nodes in the order of the nodes in the path. Finally, we propose to aggregate the features of a path employing a Path Transformer that is capable of handling ordered sequences. A path contains richer semantic and structural information than a traditional subgraph. To demonstrate the effectiveness of our proposed method, we conduct extensive experiments on three benchmark datasets, namely ModelNet, ShapeNetCore55 and MCB, and these experiments prove that the method outperforms the current methods in 3D shape classification and retrieval tasks.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Learning View-Based Graph Convolutional Network for Multi-View 3D Shape Analysis
    Wei, Xin
    Yu, Ruixuan
    Sun, Jian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7525 - 7541
  • [2] Dynamic View Aggregation for Multi-View 3D Shape Recognition
    Zhou, Yuan
    Sun, Zhongqi
    Huo, Shuwei
    Kung, Sun-Yuan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9163 - 9174
  • [3] Emphasizing 3D Properties in Recurrent Multi-View Aggregation for 3D Shape Retrieval
    Xu, Cheng
    Leng, Biao
    Zhang, Cheng
    Zhou, Xiaochen
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7428 - 7435
  • [4] MVPN: Multi-View Prototype Network for 3D Shape Recognition
    Wu, Zizhao
    Yang, Ping
    Wang, Yigang
    IEEE ACCESS, 2019, 7 : 130363 - 130372
  • [5] MVTN: Multi-View Transformation Network for 3D Shape Recognition
    Hamdi, Abdullah
    Giancola, Silvio
    Ghanem, Bernard
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1 - 11
  • [6] Multi-view Moments Embedding Network for 3D Shape Recognition
    Xiao, Jun
    Zhang, Yuanxing
    Zhao, Pengyu
    Xiao, Kecheng
    Bian, Kaigui
    Zhang, Chunli
    Yan, Wei
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2257 - 2260
  • [7] Principal views selection based on growing graph convolution network for multi-view 3D model recognition
    Qi Liang
    Qiang Li
    Weizhi Nie
    Yuting Su
    Applied Intelligence, 2023, 53 : 5320 - 5336
  • [8] Principal views selection based on growing graph convolution network for multi-view 3D model recognition
    Liang, Qi
    Li, Qiang
    Nie, Weizhi
    Su, Yuting
    APPLIED INTELLIGENCE, 2023, 53 (05) : 5320 - 5336
  • [9] Multi-view 3D shape style transformation
    Xiuping Liu
    Hua Huang
    Weiming Wang
    Jun Zhou
    The Visual Computer, 2022, 38 : 669 - 684
  • [10] Multi-view 3D shape style transformation
    Liu, Xiuping
    Huang, Hua
    Wang, Weiming
    Zhou, Jun
    VISUAL COMPUTER, 2022, 38 (02): : 669 - 684