3D shape recognition based on multi-modal information fusion

被引:4
|
作者
Liang, Qi [1 ]
Xiao, Mengmeng [1 ]
Song, Dan [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
3D shape; Classification; Multi-view; Multi-modal;
D O I
10.1007/s11042-019-08552-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The classification and retrieval of 3D models have been widely used in the field of multimedia and computer vision. With the rapid development of computer graphics, different algorithms corresponding to different representations of 3D models have achieved the best performance. The advances in deep learning also encourage various deep models for 3D feature representation. For multi-view, point cloud, and PANORAMA-view, different models have shown significant performance on 3D shape classification. However, There's not a way to consider utilizing the fusion information of multi-modal for 3D shape classification. In our opinion, We propose a novel multi-modal information fusion method for 3D shape classification, which can fully utilize the advantage of different modal to predict the label of class. More specifically, the proposed can effectively fuse more modal information. it is easy to utilize in other similar applications. We have evaluated our framework on the popular dataset ModelNet40 for the classification task on 3D shape. Series experimental results and comparisons with state-of-the-art methods demonstrate the validity of our approach.
引用
收藏
页码:16173 / 16184
页数:12
相关论文
共 50 条
  • [31] On Multi-modal Fusion for Freehand Gesture Recognition
    Schak, Monika
    Gepperth, Alexander
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT I, 2020, 12396 : 862 - 873
  • [32] Visual Sorting Method Based on Multi-Modal Information Fusion
    Han, Song
    Liu, Xiaoping
    Wang, Gang
    APPLIED SCIENCES-BASEL, 2022, 12 (06):
  • [33] News video classification based on multi-modal information fusion
    Lie, WN
    Su, CK
    2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 1021 - 1024
  • [34] Multi-Modal Fusion Technology Based on Vehicle Information: A Survey
    Zhang, Xinyu
    Gong, Yan
    Lu, Jianli
    Wu, Jiayi
    Li, Zhiwei
    Jin, Dafeng
    Li, Jun
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (06): : 3605 - 3619
  • [35] Multi-modal CrossViT using 3D spatial information for visual localization
    Junekoo Kang
    Mark Mpabulungi
    Hyunki Hong
    Multimedia Tools and Applications, 2025, 84 (5) : 2059 - 2083
  • [36] Generating Adversarial Point Clouds on Multi-modal Fusion Based 3D Object Detection Model
    Wang, Huiying
    Shen, Huixin
    Zhang, Boyang
    Wen, Yu
    Meng, Dan
    INFORMATION AND COMMUNICATIONS SECURITY (ICICS 2021), PT I, 2021, 12918 : 187 - 203
  • [37] Smartphone-Based 3D Indoor Pedestrian Positioning through Multi-Modal Data Fusion
    Zhao, Hongyu
    Cheng, Wanli
    Yang, Ning
    Qiu, Sen
    Wang, Zhelong
    Wang, Jianjun
    SENSORS, 2019, 19 (20)
  • [38] MV-LFN: Multi-view based local information fusion network for 3D shape recognition
    Zhang, Jing
    Zhou, Dangdang
    Zhao, Yue
    Nie, Weizhi
    Su, Yuting
    VISUAL INFORMATICS, 2021, 5 (03) : 114 - 119
  • [39] DmifNet: 3D Shape Reconstruction based on Dynamic Multi Branch Information Fusion
    Li, Lei
    Wu, Suping
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7219 - 7225
  • [40] A survey of approaches and challenges in 3D and multi-modal 3D+2D face recognition
    Bowyer, KW
    Chang, K
    Flynn, P
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2006, 101 (01) : 1 - 15