3D mesh transformer: A hierarchical neural network with local shape tokens

被引：3

作者：

Chen, Yu ^{[1
]}

Zhao, Jieyu ^{[1
]}

Huang, Lingfeng ^{[1
]}

Chen, Hao ^{[1
]}

机构：

[1] Ningbo Univ, Fac Elect Engn & Comp Sci, Ningbo 315000, Peoples R China

来源：

NEUROCOMPUTING | 2022年 / 514卷

基金：

中国国家自然科学基金;

关键词：

self-attention networks; 3D mesh Transformer; polynomial fitting; surface subdivision; multilayer Transformer;

D O I：

10.1016/j.neucom.2022.09.138

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Self-attention networks have revolutionized Natural Language Processing (NLP) and are making impres-sive strides in image analysis tasks such as image classification and object detection. Inspired by this suc-cess, we specifically design a novel self-attention mechanism between local shapes and build a shape Transformer. We split the 3D mesh model into shape patches, which we call shape tokens, and provide polynomial fitting representations of these patches as input to the shape Transformer. The shape token encodes local geometric information and resembles the token (word) status in NLP. The simplification of the mesh model provides a hierarchical multiresolution structure, which allows us to realize the fea-ture learning of a multilayer Transformer. We set high-level features formed by the shape Transformer as visual tokens and propose a vector-type self-attention mechanism to construct a 3D visual Transformer. Finally, we realized a hierarchical network structure based on local shape tokens and high-level visual tokens. Experiments show that our fusion network of 3D shape Transformer with explicit local shape con-text augmentation and 3D visual Transformer with multi-level structural feature learning achieves excel-lent performance on shape classification and part segmentation tasks.(c) 2022 Elsevier B.V. All rights reserved.

引用

页码：328 / 340

页数：13

共 50 条

[1] MeshNet: Mesh Neural Network for 3D Shape Representation
Feng, Yutong
Feng, Yifan
You, Haoxuan
Zhao, Xibin
Gao, Yue
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8279 - 8286
[2] Deep Neural Network for 3D Shape Classification Based on Mesh Feature
Gao, Mengran
Ruan, Ningjun
Shi, Junpeng
Zhou, Wanli
SENSORS, 2022, 22 (18)
[3] HMTN: Hierarchical Multi-scale Transformer Network for 3D Shape Recognition
Zhao, Yue
Nie, Weizhi
Gao, Zan
Liu, An-an
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
[4] Feature preserving 3D mesh denoising with a Dense Local Graph Neural Network
Tang, Wenming
Gong, Yuanhao
Qiu, Guoping
COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 233
[5] 3D MESH STEGANALYSIS USING LOCAL SHAPE FEATURES
Li, Zhenyu
Bors, Adrian G.
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2144 - 2148
[6] Video2mesh: 3D human pose and shape recovery by a temporal convolutional transformer network
Chao, Xianjin
Ge, Zhipeng
Leung, Howard
IET COMPUTER VISION, 2023, 17 (04) : 379 - 388
[7] A 3D shape classifier with neural network supervision
Liu, Zhenbao
Mitani, Jun
Fukui, Yukio
Nishihara, Seiichi
INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2010, 38 (1-3) : 134 - 143
[8] Deformable Mesh Transformer for 3D Human Mesh Recovery
Yoshiyasu, Yusuke
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17006 - 17015
[9] Laplacian Mesh Transformer: Dual Attention and Topology Aware Network for 3D Mesh Classification and Segmentation
Li, Xiao-Juan
Yang, Jie
Zhang, Fang-Lue
COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 541 - 560
[10] Local Transformer Network on 3D Point Cloud Semantic Segmentation
Wang, Zijun
Wang, Yun
An, Lifeng
Liu, Jian
Liu, Haiyang
INFORMATION, 2022, 13 (04)

← 1 2 3 4 5 →