Open-set 3D model retrieval algorithm based on multi-modal fusion

被引：0

作者：

Mao, Fuxin ^{[1
]}

Yang, Xu ^{[1
]}

Cheng, Jiaqiang ^{[2
]}

Peng, Tao ^{[3
]}

机构：

[1] Engineering Training Center, Tianjin University of Technology and Education, Tianjin,300222, China

[2] Tianjin Huada Technology Limited Company, Tianjin,300131, China

[3] College of Automobile and Transportation, Tianjin University of Technology and Education, Tianjin,300222, China

来源：

Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science) | 2024年 / 58卷 / 01期

关键词：

3D modeling - Semantics - Three dimensional computer graphics;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

An open domain 3D model retrieval algorithm was proposed in order to meet the requirement of management and retrieval of massive new model data under the open domain. The semantic consistency of multimodal information can be effectively used. The category information among unknown samples was explored with the help of unsupervised algorithm. Then the unknown class information was introduced into the parameter optimization process of the network model. The network model has better characterization and retrieval performance in the open domain condition. A hierarchical multi-modal information fusion model based on a Transformer structure was proposed, which could effectively remove the redundant information among the modalities and obtain a more robust model representation vector. Experiments were conducted on the dataset ModelNet40, and the experiments were compared with other typical algorithms. The proposed method outperformed all comparative methods in terms of mAP metrics, which verified the effectiveness of the method in terms of retrieval performance improvement. © 2024 Zhejiang University. All rights reserved.

引用

页码：61 / 70

共 50 条

[1] Hypergraph-Based Multi-Modal Representation for Open-Set 3D Object Retrieval
Feng, Yifan
Ji, Shuyi
Liu, Yu-Shen
Du, Shaoyi
Dai, Qionghai
Gao, Yue
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (04) : 2206 - 2223
[2] 3D shape recognition based on multi-modal information fusion
Qi Liang
Mengmeng Xiao
Dan Song
Multimedia Tools and Applications, 2021, 80 : 16173 - 16184
[3] 3D shape recognition based on multi-modal information fusion
Liang, Qi
Xiao, Mengmeng
Song, Dan
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (11) : 16173 - 16184
[4] SHREC'22 track: Open-Set 3D Object Retrieval
Feng, Yifan
Gao, Yue
Zhao, Xibin
Guo, Yandong
Bagewadi, Nihar
Bui, Nhat-Tan
Dao, Hieu
Gangisetty, Shankar
Guan, Ripeng
Han, Xie
Hua, Cong
Hunakunti, Chidambar
Jiang, Yu
Jiao, Shichao
Ke, Yuqi
Kuang, Liqun
Liu, Anan
Nguyen, Dinh-Huan
Nguyen, Hai-Dang
Nie, Weizhi
Pham, Bang-Dang
Raikar, Karthik
Tang, Qingmei
Tran, Minh-Triet
Wan, Jialong
Yan, Chenggang
You, Haoxuan
Zhu, Difei
COMPUTERS & GRAPHICS-UK, 2022, 107 : 231 - 240
[5] Multi-Modal Clique-Graph Matching for View-Based 3D Model Retrieval
Liu, An-An
Nie, Wei-Zhi
Gao, Yue
Su, Yu-Ting
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (05) : 2103 - 2116
[6] Research on 3D Object Detection Method Based on Multi-Modal Fusion
Tian, Feng
Zong, Neili
Liu, Fang
Lu, Yuanyuan
Liu, Chao
Jiang, Wenwen
Zhao, Ling
Han, Yuxiang
Computer Engineering and Applications, 2024, 60 (13) : 113 - 123
[7] Generating Adversarial Point Clouds on Multi-modal Fusion Based 3D Object Detection Model
Wang, Huiying
Shen, Huixin
Zhang, Boyang
Wen, Yu
Meng, Dan
INFORMATION AND COMMUNICATIONS SECURITY (ICICS 2021), PT I, 2021, 12918 : 187 - 203
[8] Open-set 3D Object Detection
Cen, Jun
Yun, Peng
Cai, Junhao
Wang, Michael Yu
Liu, Ming
2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 869 - 878
[9] Triadic Elastic Structure Representation for Open-Set Incremental 3D Object Retrieval
Xu, Yang
Feng, Yifan
Bie, Lin
PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 20 - 28
[10] Multi-Modal Fusion Based on Depth Adaptive Mechanism for 3D Object Detection
Liu, Zhanwen
Cheng, Juanru
Fan, Jin
Lin, Shan
Wang, Yang
Zhao, Xiangmo
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 707 - 717

← 1 2 3 4 5 →