3D mesh transformer: A hierarchical neural network with local shape tokens

被引:3
|
作者
Chen, Yu [1 ]
Zhao, Jieyu [1 ]
Huang, Lingfeng [1 ]
Chen, Hao [1 ]
机构
[1] Ningbo Univ, Fac Elect Engn & Comp Sci, Ningbo 315000, Peoples R China
基金
中国国家自然科学基金;
关键词
self-attention networks; 3D mesh Transformer; polynomial fitting; surface subdivision; multilayer Transformer;
D O I
10.1016/j.neucom.2022.09.138
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-attention networks have revolutionized Natural Language Processing (NLP) and are making impres-sive strides in image analysis tasks such as image classification and object detection. Inspired by this suc-cess, we specifically design a novel self-attention mechanism between local shapes and build a shape Transformer. We split the 3D mesh model into shape patches, which we call shape tokens, and provide polynomial fitting representations of these patches as input to the shape Transformer. The shape token encodes local geometric information and resembles the token (word) status in NLP. The simplification of the mesh model provides a hierarchical multiresolution structure, which allows us to realize the fea-ture learning of a multilayer Transformer. We set high-level features formed by the shape Transformer as visual tokens and propose a vector-type self-attention mechanism to construct a 3D visual Transformer. Finally, we realized a hierarchical network structure based on local shape tokens and high-level visual tokens. Experiments show that our fusion network of 3D shape Transformer with explicit local shape con-text augmentation and 3D visual Transformer with multi-level structural feature learning achieves excel-lent performance on shape classification and part segmentation tasks.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:328 / 340
页数:13
相关论文
共 50 条
  • [41] Multi-mesh and adaptivity in 3D shape optimization
    Younsi, R
    KnopfLenoir, C
    Selman, A
    COMPUTERS & STRUCTURES, 1996, 61 (06) : 1125 - 1133
  • [42] ROBUST 3D MESH HASHING BASED ON SHAPE FEATURES
    Lee, Suk-Hwan
    Lee, Eung-Joo
    Kwon, Ki-Ryong
    2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 1040 - 1043
  • [43] Shape-based retrieval of 3D mesh models
    Zaharia, T
    Prêteux, F
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : 437 - 440
  • [44] Residual Transformer Network for 3D Objects Classification
    Meng, Shan
    Liang, Daoyuan
    Li, Yumei
    IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 1175 - 1179
  • [45] 3D Mesh Model Classification with a Capsule Network
    Zheng, Yang
    Zhao, Jieyu
    Chen, Yu
    Tang, Chen
    Yu, Shushi
    ALGORITHMS, 2021, 14 (03)
  • [46] TUPocket: 3D Convolutional Neural Network Combined with Transformer for Ligand Binding Site Detection
    Meng, Yangtao
    Yan, Tianhao
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14864 : 419 - 430
  • [47] IMAGE-BASED 3D MESH DENOISING THROUGH A BLOCK MATCHING 3D CONVOLUTIONAL NEURAL NETWORK FILTERING APPROACH
    Arvanitis, Gerasimos
    Lalos, Aris S.
    Moustakas, Konstantinos
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [48] 3D mesh simplification for effective network transmission
    Ko, MC
    Choy, YC
    HSNMC 2002: 5TH IEEE INTERNATIONAL CONFERENCE ON HIGH SPEED NETWORKS AND MULTIMEDIA COMMUNICATIONS, 2002, : 284 - 288
  • [49] LIMAN: Local Information-Based Multiattention Network for 3D Shape Recognition
    Nie, Weizhi
    Ke, Yuqi
    Zhao, Yue
    Liang, Qi
    Su, Yuting
    IEEE MULTIMEDIA, 2022, 29 (01) : 65 - 73
  • [50] 3D displacement measurement using a single-camera and mesh deformation neural network
    Shao, Yanda
    Li, Ling
    Li, Jun
    Li, Qilin
    An, Senjian
    Hao, Hong
    ENGINEERING STRUCTURES, 2024, 318