Open-set 3D model retrieval algorithm based on multi-modal fusion

被引:0
|
作者
Mao, Fuxin [1 ]
Yang, Xu [1 ]
Cheng, Jiaqiang [2 ]
Peng, Tao [3 ]
机构
[1] Engineering Training Center, Tianjin University of Technology and Education, Tianjin,300222, China
[2] Tianjin Huada Technology Limited Company, Tianjin,300131, China
[3] College of Automobile and Transportation, Tianjin University of Technology and Education, Tianjin,300222, China
关键词
3D modeling - Semantics - Three dimensional computer graphics;
D O I
暂无
中图分类号
学科分类号
摘要
An open domain 3D model retrieval algorithm was proposed in order to meet the requirement of management and retrieval of massive new model data under the open domain. The semantic consistency of multimodal information can be effectively used. The category information among unknown samples was explored with the help of unsupervised algorithm. Then the unknown class information was introduced into the parameter optimization process of the network model. The network model has better characterization and retrieval performance in the open domain condition. A hierarchical multi-modal information fusion model based on a Transformer structure was proposed, which could effectively remove the redundant information among the modalities and obtain a more robust model representation vector. Experiments were conducted on the dataset ModelNet40, and the experiments were compared with other typical algorithms. The proposed method outperformed all comparative methods in terms of mAP metrics, which verified the effectiveness of the method in terms of retrieval performance improvement. © 2024 Zhejiang University. All rights reserved.
引用
收藏
页码:61 / 70
相关论文
共 50 条
  • [21] Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection
    Li, Xin
    Shi, Botian
    Hou, Yuenan
    Wu, Xingjiao
    Ma, Tianlong
    Li, Yikang
    He, Liang
    COMPUTER VISION, ECCV 2022, PT XXXVIII, 2022, 13698 : 691 - 707
  • [22] Multi-modal feature fusion for 3D object detection in the production workshop
    Hou, Rui
    Chen, Guangzhu
    Han, Yinhe
    Tang, Zaizuo
    Ru, Qingjun
    APPLIED SOFT COMPUTING, 2022, 115
  • [23] FuseNet: a multi-modal feature fusion network for 3D shape classification
    Zhao, Xin
    Chen, Yinhuang
    Yang, Chengzhuan
    Fang, Lincong
    VISUAL COMPUTER, 2025, 41 (04): : 2973 - 2985
  • [24] Deformable Feature Fusion Network for Multi-Modal 3D Object Detection
    Guo, Kun
    Gan, Tong
    Ding, Zhao
    Ling, Qiang
    2024 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, ARTIFICIAL INTELLIGENCE AND INTELLIGENT CONTROL, RAIIC 2024, 2024, : 363 - 367
  • [25] MLF3D: Multi-Level Fusion for Multi-Modal 3D Object Detection
    Jiang, Han
    Wang, Jianbin
    Xiao, Jianru
    Zhao, Yanan
    Chen, Wanqing
    Ren, Yilong
    Yu, Haiyang
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 1588 - 1593
  • [26] 3DMGNet: 3D Model Generation Network Based on Multi-Modal Data Constraints and Multi-Level Feature Fusion
    Wang, Ende
    Xue, Lei
    Li, Yong
    Zhang, Zhenxin
    Hou, Xukui
    SENSORS, 2020, 20 (17) : 1 - 16
  • [27] Robust 3D Semantic Segmentation Based on Multi-Phase Multi-Modal Fusion for Intelligent Vehicles
    Ni, Peizhou
    Li, Xu
    Xu, Wang
    Kong, Dong
    Hu, Yue
    Wei, Kun
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1602 - 1614
  • [28] A Multi-Modal Fusion-Based 3D Multi-Object Tracking Framework With Joint Detection
    Wang, Xiyang
    Fu, Chunyun
    He, Jiawei
    Huang, Mingguang
    Meng, Ting
    Zhang, Siyu
    Zhou, Hangning
    Xu, Ziyao
    Zhang, Chi
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 532 - 539
  • [29] A 3D model retrieval method based on multi-feature fusion
    Tu H.
    International Journal of Information and Communication Technology, 2019, 15 (02) : 121 - 131
  • [30] Multi-modal fusion for associated news story retrieval
    Younessian, Ehsan
    Rajan, Deepu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (08) : 2563 - 2585