3D shape recognition based on multi-modal information fusion

被引:4
|
作者
Liang, Qi [1 ]
Xiao, Mengmeng [1 ]
Song, Dan [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
3D shape; Classification; Multi-view; Multi-modal;
D O I
10.1007/s11042-019-08552-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The classification and retrieval of 3D models have been widely used in the field of multimedia and computer vision. With the rapid development of computer graphics, different algorithms corresponding to different representations of 3D models have achieved the best performance. The advances in deep learning also encourage various deep models for 3D feature representation. For multi-view, point cloud, and PANORAMA-view, different models have shown significant performance on 3D shape classification. However, There's not a way to consider utilizing the fusion information of multi-modal for 3D shape classification. In our opinion, We propose a novel multi-modal information fusion method for 3D shape classification, which can fully utilize the advantage of different modal to predict the label of class. More specifically, the proposed can effectively fuse more modal information. it is easy to utilize in other similar applications. We have evaluated our framework on the popular dataset ModelNet40 for the classification task on 3D shape. Series experimental results and comparisons with state-of-the-art methods demonstrate the validity of our approach.
引用
收藏
页码:16173 / 16184
页数:12
相关论文
共 50 条
  • [1] 3D shape recognition based on multi-modal information fusion
    Qi Liang
    Mengmeng Xiao
    Dan Song
    Multimedia Tools and Applications, 2021, 80 : 16173 - 16184
  • [2] MMJN: Multi-Modal Joint Networks for 3D Shape Recognition
    Nie, Weizhi
    Liang, Qi
    Liu, An-An
    Mao, Zhendong
    Li, Yangyang
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 908 - 916
  • [3] FuseNet: a multi-modal feature fusion network for 3D shape classification
    Zhao, Xin
    Chen, Yinhuang
    Yang, Chengzhuan
    Fang, Lincong
    VISUAL COMPUTER, 2025, 41 (04): : 2973 - 2985
  • [4] Multi-modal information fusion for LiDAR-based 3D object detection framework
    Ruixin Ma
    Yong Yin
    Jing Chen
    Rihao Chang
    Multimedia Tools and Applications, 2024, 83 : 7995 - 8012
  • [5] Multi-modal information fusion for LiDAR-based 3D object detection framework
    Ma, Ruixin
    Yin, Yong
    Chen, Jing
    Chang, Rihao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 7995 - 8012
  • [6] A Fuzzy Interval Valued Fusion Technique for Multi-Modal 3D Face Recognition
    Ramalingam, Soodamani
    Maheswari, Uma
    2016 IEEE INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2016, : 225 - 232
  • [7] Research on 3D Object Detection Method Based on Multi-Modal Fusion
    Tian, Feng
    Zong, Neili
    Liu, Fang
    Lu, Yuanyuan
    Liu, Chao
    Jiang, Wenwen
    Zhao, Ling
    Han, Yuxiang
    Computer Engineering and Applications, 2024, 60 (13) : 113 - 123
  • [8] Multi-modal fusion network guided by prior knowledge for 3D CAD model recognition
    Li, Qiang
    Xu, Zibo
    Bai, Shaojin
    Nie, Weizhi
    Liu, Anan
    NEUROCOMPUTING, 2024, 590
  • [9] Adaptive information fusion network for multi-modal personality recognition
    Bao, Yongtang
    Liu, Xiang
    Qi, Yue
    Liu, Ruijun
    Li, Haojie
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2024, 35 (03)
  • [10] Multi-Modal Fusion Based on Depth Adaptive Mechanism for 3D Object Detection
    Liu, Zhanwen
    Cheng, Juanru
    Fan, Jin
    Lin, Shan
    Wang, Yang
    Zhao, Xiangmo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 707 - 717