3D mesh transformer: A hierarchical neural network with local shape tokens

被引:3
|
作者
Chen, Yu [1 ]
Zhao, Jieyu [1 ]
Huang, Lingfeng [1 ]
Chen, Hao [1 ]
机构
[1] Ningbo Univ, Fac Elect Engn & Comp Sci, Ningbo 315000, Peoples R China
基金
中国国家自然科学基金;
关键词
self-attention networks; 3D mesh Transformer; polynomial fitting; surface subdivision; multilayer Transformer;
D O I
10.1016/j.neucom.2022.09.138
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-attention networks have revolutionized Natural Language Processing (NLP) and are making impres-sive strides in image analysis tasks such as image classification and object detection. Inspired by this suc-cess, we specifically design a novel self-attention mechanism between local shapes and build a shape Transformer. We split the 3D mesh model into shape patches, which we call shape tokens, and provide polynomial fitting representations of these patches as input to the shape Transformer. The shape token encodes local geometric information and resembles the token (word) status in NLP. The simplification of the mesh model provides a hierarchical multiresolution structure, which allows us to realize the fea-ture learning of a multilayer Transformer. We set high-level features formed by the shape Transformer as visual tokens and propose a vector-type self-attention mechanism to construct a 3D visual Transformer. Finally, we realized a hierarchical network structure based on local shape tokens and high-level visual tokens. Experiments show that our fusion network of 3D shape Transformer with explicit local shape con-text augmentation and 3D visual Transformer with multi-level structural feature learning achieves excel-lent performance on shape classification and part segmentation tasks.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:328 / 340
页数:13
相关论文
共 50 条
  • [31] Mesh Generation from Dense 3D Scattered Data Using Neural Network
    ZHANG Wei
    College of Mechanical and Electronic Engineering
    JinhuaCollege of Profession and Technology
    Computer Aided Drafting,Design and Manufacturing, 2004, Design and Manufacturing.2004 (01) : 30 - 35
  • [32] 3D visual saliency and convolutional neural network for blind mesh quality assessment
    Ilyass Abouelaziz
    Aladine Chetouani
    Mohammed El Hassouni
    Longin Jan Latecki
    Hocine Cherifi
    Neural Computing and Applications, 2020, 32 : 16589 - 16603
  • [33] 3D visual saliency and convolutional neural network for blind mesh quality assessment
    Abouelaziz, Ilyass
    Chetouani, Aladine
    El Hassouni, Mohammed
    Latecki, Longin Jan
    Cherifi, Hocine
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (21): : 16589 - 16603
  • [34] ExMeshCNN: An Explainable Convolutional Neural Network Architecture for 3D Shape Analysis
    Kim, Seonggyeom
    Chae, Dong-Kyu
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 795 - 803
  • [35] PolyNet: Polynomial Neural Network for 3D Shape Recognition with PolyShape Representation
    Yavartanoo, Mohsen
    Hung, Shih-Hsuan
    Neshatavar, Reyhaneh
    Zhang, Yue
    Lee, Kyoung Mu
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 1014 - 1023
  • [36] 3D shape detection based on a Bezier neural network of alight line
    Rodríguez, JAM
    Ciseña, MR
    Rodriguez-Vera, R
    Eighth International Symposium on Laser Metrology: MACRO-, MICRO-, AND NANO-TECHNOLOGIES APPLIED IN SCIENCE, ENGINEERING, AND INDUSTRY, 2005, 5776 : 630 - 639
  • [37] Hierarchical 3D Diffusion Wavelet Shape Priors
    Essafi, Salma
    Langs, Georg
    Paragios, Nikos
    2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 1717 - 1724
  • [38] An evaluation of local shape descriptors for 3D shape retrieval
    Tang, Sarah
    Godil, Afzal
    THREE-DIMENSIONAL IMAGE PROCESSING (3DIP) AND APPLICATIONS II, 2012, 8290
  • [39] WalkFormer: 3D mesh analysis via transformer on random walk
    Guo, Qing
    He, Fazhi
    Fan, Bo
    Song, Yupeng
    Dai, Jicheng
    Fan, Linkun
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (07): : 3499 - 3511
  • [40] WalkFormer: 3D mesh analysis via transformer on random walk
    Qing Guo
    Fazhi He
    Bo Fan
    Yupeng Song
    Jicheng Dai
    Linkun Fan
    Neural Computing and Applications, 2024, 36 : 3499 - 3511