3D shape recognition based on multi-modal information fusion

被引：4

作者：

Liang, Qi ^{[1
]}

Xiao, Mengmeng ^{[1
]}

Song, Dan ^{[1
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2021年 / 80卷 / 11期

基金：

中国国家自然科学基金;

关键词：

3D shape; Classification; Multi-view; Multi-modal;

D O I：

10.1007/s11042-019-08552-7

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The classification and retrieval of 3D models have been widely used in the field of multimedia and computer vision. With the rapid development of computer graphics, different algorithms corresponding to different representations of 3D models have achieved the best performance. The advances in deep learning also encourage various deep models for 3D feature representation. For multi-view, point cloud, and PANORAMA-view, different models have shown significant performance on 3D shape classification. However, There's not a way to consider utilizing the fusion information of multi-modal for 3D shape classification. In our opinion, We propose a novel multi-modal information fusion method for 3D shape classification, which can fully utilize the advantage of different modal to predict the label of class. More specifically, the proposed can effectively fuse more modal information. it is easy to utilize in other similar applications. We have evaluated our framework on the popular dataset ModelNet40 for the classification task on 3D shape. Series experimental results and comparisons with state-of-the-art methods demonstrate the validity of our approach.

引用

页码：16173 / 16184

页数：12

共 50 条

[31] On Multi-modal Fusion for Freehand Gesture Recognition
Schak, Monika
Gepperth, Alexander
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT I, 2020, 12396 : 862 - 873
[32] Visual Sorting Method Based on Multi-Modal Information Fusion
Han, Song
Liu, Xiaoping
Wang, Gang
APPLIED SCIENCES-BASEL, 2022, 12 (06):
[33] News video classification based on multi-modal information fusion
Lie, WN
Su, CK
2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 1021 - 1024
[34] Multi-Modal Fusion Technology Based on Vehicle Information: A Survey
Zhang, Xinyu
Gong, Yan
Lu, Jianli
Wu, Jiayi
Li, Zhiwei
Jin, Dafeng
Li, Jun
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (06): : 3605 - 3619
[35] Multi-modal CrossViT using 3D spatial information for visual localization
Junekoo Kang
Mark Mpabulungi
Hyunki Hong
Multimedia Tools and Applications, 2025, 84 (5) : 2059 - 2083
[36] Generating Adversarial Point Clouds on Multi-modal Fusion Based 3D Object Detection Model
Wang, Huiying
Shen, Huixin
Zhang, Boyang
Wen, Yu
Meng, Dan
INFORMATION AND COMMUNICATIONS SECURITY (ICICS 2021), PT I, 2021, 12918 : 187 - 203
[37] Smartphone-Based 3D Indoor Pedestrian Positioning through Multi-Modal Data Fusion
Zhao, Hongyu
Cheng, Wanli
Yang, Ning
Qiu, Sen
Wang, Zhelong
Wang, Jianjun
SENSORS, 2019, 19 (20)
[38] MV-LFN: Multi-view based local information fusion network for 3D shape recognition
Zhang, Jing
Zhou, Dangdang
Zhao, Yue
Nie, Weizhi
Su, Yuting
VISUAL INFORMATICS, 2021, 5 (03) : 114 - 119
[39] DmifNet: 3D Shape Reconstruction based on Dynamic Multi Branch Information Fusion
Li, Lei
Wu, Suping
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7219 - 7225
[40] A survey of approaches and challenges in 3D and multi-modal 3D+2D face recognition
Bowyer, KW
Chang, K
Flynn, P
COMPUTER VISION AND IMAGE UNDERSTANDING, 2006, 101 (01) : 1 - 15

← 1 2 3 4 5 →