Open-set 3D model retrieval algorithm based on multi-modal fusion

被引:0
|
作者
Mao, Fuxin [1 ]
Yang, Xu [1 ]
Cheng, Jiaqiang [2 ]
Peng, Tao [3 ]
机构
[1] Engineering Training Center, Tianjin University of Technology and Education, Tianjin,300222, China
[2] Tianjin Huada Technology Limited Company, Tianjin,300131, China
[3] College of Automobile and Transportation, Tianjin University of Technology and Education, Tianjin,300222, China
关键词
3D modeling - Semantics - Three dimensional computer graphics;
D O I
暂无
中图分类号
学科分类号
摘要
An open domain 3D model retrieval algorithm was proposed in order to meet the requirement of management and retrieval of massive new model data under the open domain. The semantic consistency of multimodal information can be effectively used. The category information among unknown samples was explored with the help of unsupervised algorithm. Then the unknown class information was introduced into the parameter optimization process of the network model. The network model has better characterization and retrieval performance in the open domain condition. A hierarchical multi-modal information fusion model based on a Transformer structure was proposed, which could effectively remove the redundant information among the modalities and obtain a more robust model representation vector. Experiments were conducted on the dataset ModelNet40, and the experiments were compared with other typical algorithms. The proposed method outperformed all comparative methods in terms of mAP metrics, which verified the effectiveness of the method in terms of retrieval performance improvement. © 2024 Zhejiang University. All rights reserved.
引用
收藏
页码:61 / 70
相关论文
共 50 条
  • [31] Multi-modal fusion for associated news story retrieval
    Ehsan Younessian
    Deepu Rajan
    Multimedia Tools and Applications, 2015, 74 : 2563 - 2585
  • [32] Smartphone-Based 3D Indoor Pedestrian Positioning through Multi-Modal Data Fusion
    Zhao, Hongyu
    Cheng, Wanli
    Yang, Ning
    Qiu, Sen
    Wang, Zhelong
    Wang, Jianjun
    SENSORS, 2019, 19 (20)
  • [33] Height-Adaptive Deformable Multi-Modal Fusion for 3D Object Detection
    Li, Jiahao
    Chen, Lingshan
    Li, Zhen
    IEEE ACCESS, 2025, 13 : 52385 - 52396
  • [34] Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion
    Zheng, Meng
    Planche, Benjamin
    Gong, Xuan
    Yang, Fan
    Chen, Terrence
    Wu, Ziyan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VII, 2022, 13437 : 115 - 125
  • [35] Frustum FusionNet: Amodal 3D Object Detection with Multi-Modal Feature Fusion
    Zuo, Liangyu
    Li, Yaochen
    Han, Mengtao
    Li, Qiao
    Liu, Yuehu
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2746 - 2751
  • [36] ObjectFusion: Multi-modal 3D Object Detection with Object-Centric Fusion
    Cai, Qi
    Pan, Yingwei
    Yao, Ting
    Ngo, Chong-Wah
    Mei, Tao
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18021 - 18030
  • [37] A Fuzzy Interval Valued Fusion Technique for Multi-Modal 3D Face Recognition
    Ramalingam, Soodamani
    Maheswari, Uma
    2016 IEEE INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2016, : 225 - 232
  • [38] Enhancing 3D object detection through multi-modal fusion for cooperative perception
    Xia, Bin
    Zhou, Jun
    Kong, Fanyu
    You, Yuhe
    Yang, Jiarui
    Lin, Lin
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 104 : 46 - 55
  • [39] Multi-Modal Meta-Transfer Fusion Network for Few-Shot 3D Model Classification
    He-Yu Zhou
    An-An Liu
    Chen-Yu Zhang
    Ping Zhu
    Qian-Yi Zhang
    Mohan Kankanhalli
    International Journal of Computer Vision, 2024, 132 (3) : 673 - 688
  • [40] Multi-Modal Meta-Transfer Fusion Network for Few-Shot 3D Model Classification
    Zhou, He-Yu
    Liu, An-An
    Zhang, Chen-Yu
    Zhu, Ping
    Zhang, Qian-Yi
    Kankanhalli, Mohan
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (03) : 673 - 688