3D mesh transformer: A hierarchical neural network with local shape tokens

被引:3
|
作者
Chen, Yu [1 ]
Zhao, Jieyu [1 ]
Huang, Lingfeng [1 ]
Chen, Hao [1 ]
机构
[1] Ningbo Univ, Fac Elect Engn & Comp Sci, Ningbo 315000, Peoples R China
基金
中国国家自然科学基金;
关键词
self-attention networks; 3D mesh Transformer; polynomial fitting; surface subdivision; multilayer Transformer;
D O I
10.1016/j.neucom.2022.09.138
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-attention networks have revolutionized Natural Language Processing (NLP) and are making impres-sive strides in image analysis tasks such as image classification and object detection. Inspired by this suc-cess, we specifically design a novel self-attention mechanism between local shapes and build a shape Transformer. We split the 3D mesh model into shape patches, which we call shape tokens, and provide polynomial fitting representations of these patches as input to the shape Transformer. The shape token encodes local geometric information and resembles the token (word) status in NLP. The simplification of the mesh model provides a hierarchical multiresolution structure, which allows us to realize the fea-ture learning of a multilayer Transformer. We set high-level features formed by the shape Transformer as visual tokens and propose a vector-type self-attention mechanism to construct a 3D visual Transformer. Finally, we realized a hierarchical network structure based on local shape tokens and high-level visual tokens. Experiments show that our fusion network of 3D shape Transformer with explicit local shape con-text augmentation and 3D visual Transformer with multi-level structural feature learning achieves excel-lent performance on shape classification and part segmentation tasks.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:328 / 340
页数:13
相关论文
共 50 条
  • [21] Feature-preserved convolutional neural network for 3D mesh recognition
    Liang, Yaqian
    He, Fazhi
    Zeng, Xiantao
    Yu, Baosheng
    APPLIED SOFT COMPUTING, 2022, 128
  • [22] A multi-view recurrent neural network for 3D mesh segmentation
    Le, Truc
    Bui, Giang
    Duan, Ye
    COMPUTERS & GRAPHICS-UK, 2017, 66 : 103 - 112
  • [23] Hierarchical segmentation algorithm for 3D mesh surfaces
    Yan, Jing-Qi
    Shi, Peng-Fei
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2002, 36 (04): : 494 - 497
  • [24] Hierarchical representation and coding of 3D mesh geometry
    Celasun, Isil
    Eroeksuez, Serkan
    Siddiqui, Rizwan A.
    Tekalp, A. Murat
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 1893 - +
  • [25] Topology driven 3D mesh hierarchical segmentation
    Tierny, Julien
    Vandeborre, Jean-Philippe
    Daoudi, Mohamed
    IEEE INTERNATIONAL CONFERENCE ON SHAPE MODELING AND APPLICATIONS 2007, PROCEEDINGS, 2007, : 215 - +
  • [26] THE PERCEPTION OF LOCAL 3D SHAPE
    PHILLIPS, F
    TODD, JT
    NORMAN, JF
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1994, 35 (04) : 1627 - 1627
  • [27] 3D Human Mesh Reconstruction by Learning to Sample Joint Adaptive Tokens for Transformers
    Xue, Youze
    Chen, Jiansheng
    Zhang, Yudong
    Yu, Cheng
    Ma, Huimin
    Ma, Hongbing
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 6765 - 6773
  • [28] Salient Local 3D Features for 3D Shape Retrieval
    Godil, Afzal
    Wagan, Asim Imdad
    THREE-DIMENSIONAL IMAGING, INTERACTION, AND MEASUREMENT, 2011, 7864
  • [29] SVHAN: Sequential View Based Hierarchical Attention Network for 3D Shape Recognition
    Zhao, Yue
    Nie, Weizhi
    Liu, An-An
    Gao, Zan
    Su, Yuting
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2130 - 2138
  • [30] GLCNet: Global-Local Complementary Network for 3D Shape Recognition
    Wang, Xiaofeng
    Cui, Qingzhe
    Xu, Lixiang
    Liu, Haifeng
    He, Lixin
    Luo, Bin
    Chen, Sibao
    Tang, Yuanyan
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,