DESIGN AND RESEARCH OF A MULTI-VIEW GRAPH DEEP LEARNING 3D MODEL RETRIEVAL SYSTEM BASED ON FUSION VISION-TRANSFORMER

被引:0
|
作者
Liang, Rong [1 ]
Li, Fangping [1 ]
机构
[1] Taiyuan Univ, Dept Art & Design, 7 Fendong St,Tanghuai Ind Pk, Taiyuan 030000, Peoples R China
关键词
Vision-Transformer; Multi-perspective graph convolutional neural network; 3D; Perspective image; Image entropy;
D O I
10.24507/ijicic.20.06.1775
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The development of computer vision has made three-dimensional models play a crucial role in the field of image processing. However, compared to 2D models, 3D models have more features, making it difficult to extract features and mine correlation information between features. Based on this, this study is based on a multi-perspective graph convolutional neural network, which uses an image entropy weight pooling layer to improve the original view pooling layer. It assigns a weight based on image entropy to each perspective image, and then performs view pooling operations. The Vision-Transformer module is embedded into a multi-perspective graph convolutional neural network to mine information associations between multi-view graphs. The results show that the multi- perspective graph convolutional neural network model fused with Vision-Transformer is more concentrated in classifying features of the same category in the view graph, and there is a significant distance difference between different features. The multi-perspective graph convolutional neural network model fused with Vision-Transformer achieves accuracy of 89.0%, 92.0%, 94.0%, and mean average precision values of 80.0%, 85.0%, and 88.0% when the number of view images is 6, 10, and 14. This study improves the retrieval accuracy of 3D models and has certain reference value in the field of computer vision.
引用
收藏
页码:1775 / 1788
页数:14
相关论文
共 50 条
  • [31] Review of multi-view 3D object recognition methods based on deep learning
    Qi, Shaohua
    Ning, Xin
    Yang, Guowei
    Zhang, Liping
    Long, Peng
    Cai, Weiwei
    Li, Weijun
    DISPLAYS, 2021, 69
  • [32] Efficient Hierarchical Multi-view Fusion Transformer for 3D Human Pose Estimation
    Zhou, Kangkang
    Zhang, Lijun
    Lu, Feng
    Zhou, Xiang-Dong
    Shi, Yu
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7512 - 7520
  • [33] Triangular Patch Based Texture fusion for Multi-view 3D Face Model
    Yang, Shan-min
    Lin, Yi
    Zhang, Jian-wei
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
  • [34] Overview of Multi-View 3D Reconstruction Techniques in Deep Learning
    Wang, Wenju
    Tang, Bang
    Gu, Zehua
    Wang, Sen
    Computer Engineering and Applications, 2025, 61 (06) : 22 - 35
  • [35] NON-RIGID 3D SHAPE RETRIEVAL BASED ON MULTI-VIEW METRIC LEARNING
    Li, Haohao
    Wang, Shengfa
    Li, Nannan
    Su, Zhixun
    Liu, Ximin
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 441 - 446
  • [36] Aggregated Deep Convolutional Neural Networks for Multi-View 3D Object Retrieval
    Alzu'bi, Ahmad
    Abuarqoub, Abdelrahman
    Al-Hmouz, Ahmed
    2019 11TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT), 2019,
  • [37] A real sense 3D face reconstruction system based on multi-view stereo vision
    Li, Ke
    Zeng, Dong
    Zhang, Jun
    Lin, Rui
    Gao, Luobin
    Liao, Xiaoli
    Journal of Information and Computational Science, 2015, 12 (10): : 3739 - 3753
  • [38] A Transformer-based Network for Multi-view 3D Mesh Generation
    Shi, Wuzhen
    Liu, Zhijie
    Li, Yingxiang
    Wen, Yang
    Liu, Yutao
    Proceedings - 2023 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Autonomous and Trusted Vehicles, Scalable Computing and Communications, Digital Twin, Privacy Computing and Data Security, Metaverse, SmartWorld/UIC/ATC/ScalCom/DigitalTwin/PCDS/Metaverse 2023, 2023,
  • [39] View-Based 3D Model Retrieval via Multi-graph Matching
    Nie, Weizhi
    Liu, Anan
    Hao, Yahui
    Su, Yuting
    NEURAL PROCESSING LETTERS, 2018, 48 (03) : 1395 - 1404
  • [40] View-Based 3D Model Retrieval via Multi-graph Matching
    Weizhi Nie
    Anan Liu
    Yahui Hao
    Yuting Su
    Neural Processing Letters, 2018, 48 : 1395 - 1404