DESIGN AND RESEARCH OF A MULTI-VIEW GRAPH DEEP LEARNING 3D MODEL RETRIEVAL SYSTEM BASED ON FUSION VISION-TRANSFORMER

被引：0

作者：

Liang, Rong ^{[1
]}

Li, Fangping ^{[1
]}

机构：

[1] Taiyuan Univ, Dept Art & Design, 7 Fendong St,Tanghuai Ind Pk, Taiyuan 030000, Peoples R China

来源：

INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL | 2024年 / 20卷 / 06期

关键词：

Vision-Transformer; Multi-perspective graph convolutional neural network; 3D; Perspective image; Image entropy;

D O I：

10.24507/ijicic.20.06.1775

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The development of computer vision has made three-dimensional models play a crucial role in the field of image processing. However, compared to 2D models, 3D models have more features, making it difficult to extract features and mine correlation information between features. Based on this, this study is based on a multi-perspective graph convolutional neural network, which uses an image entropy weight pooling layer to improve the original view pooling layer. It assigns a weight based on image entropy to each perspective image, and then performs view pooling operations. The Vision-Transformer module is embedded into a multi-perspective graph convolutional neural network to mine information associations between multi-view graphs. The results show that the multi- perspective graph convolutional neural network model fused with Vision-Transformer is more concentrated in classifying features of the same category in the view graph, and there is a significant distance difference between different features. The multi-perspective graph convolutional neural network model fused with Vision-Transformer achieves accuracy of 89.0%, 92.0%, 94.0%, and mean average precision values of 80.0%, 85.0%, and 88.0% when the number of view images is 6, 10, and 14. This study improves the retrieval accuracy of 3D models and has certain reference value in the field of computer vision.

引用

页码：1775 / 1788

页数：14

共 50 条

[31] Review of multi-view 3D object recognition methods based on deep learning
Qi, Shaohua
Ning, Xin
Yang, Guowei
Zhang, Liping
Long, Peng
Cai, Weiwei
Li, Weijun
DISPLAYS, 2021, 69
[32] Efficient Hierarchical Multi-view Fusion Transformer for 3D Human Pose Estimation
Zhou, Kangkang
Zhang, Lijun
Lu, Feng
Zhou, Xiang-Dong
Shi, Yu
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7512 - 7520
[33] Triangular Patch Based Texture fusion for Multi-view 3D Face Model
Yang, Shan-min
Lin, Yi
Zhang, Jian-wei
TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
[34] Overview of Multi-View 3D Reconstruction Techniques in Deep Learning
Wang, Wenju
Tang, Bang
Gu, Zehua
Wang, Sen
Computer Engineering and Applications, 2025, 61 (06) : 22 - 35
[35] NON-RIGID 3D SHAPE RETRIEVAL BASED ON MULTI-VIEW METRIC LEARNING
Li, Haohao
Wang, Shengfa
Li, Nannan
Su, Zhixun
Liu, Ximin
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 441 - 446
[36] Aggregated Deep Convolutional Neural Networks for Multi-View 3D Object Retrieval
Alzu'bi, Ahmad
Abuarqoub, Abdelrahman
Al-Hmouz, Ahmed
2019 11TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT), 2019,
[37] A real sense 3D face reconstruction system based on multi-view stereo vision
Li, Ke
Zeng, Dong
Zhang, Jun
Lin, Rui
Gao, Luobin
Liao, Xiaoli
Journal of Information and Computational Science, 2015, 12 (10): : 3739 - 3753
[38] A Transformer-based Network for Multi-view 3D Mesh Generation
Shi, Wuzhen
Liu, Zhijie
Li, Yingxiang
Wen, Yang
Liu, Yutao
Proceedings - 2023 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Autonomous and Trusted Vehicles, Scalable Computing and Communications, Digital Twin, Privacy Computing and Data Security, Metaverse, SmartWorld/UIC/ATC/ScalCom/DigitalTwin/PCDS/Metaverse 2023, 2023,
[39] View-Based 3D Model Retrieval via Multi-graph Matching
Nie, Weizhi
Liu, Anan
Hao, Yahui
Su, Yuting
NEURAL PROCESSING LETTERS, 2018, 48 (03) : 1395 - 1404
[40] View-Based 3D Model Retrieval via Multi-graph Matching
Weizhi Nie
Anan Liu
Yahui Hao
Yuting Su
Neural Processing Letters, 2018, 48 : 1395 - 1404

← 1 2 3 4 5 →