Open-set 3D model retrieval algorithm based on multi-modal fusion

被引:0
|
作者
Mao, Fuxin [1 ]
Yang, Xu [1 ]
Cheng, Jiaqiang [2 ]
Peng, Tao [3 ]
机构
[1] Engineering Training Center, Tianjin University of Technology and Education, Tianjin,300222, China
[2] Tianjin Huada Technology Limited Company, Tianjin,300131, China
[3] College of Automobile and Transportation, Tianjin University of Technology and Education, Tianjin,300222, China
关键词
3D modeling - Semantics - Three dimensional computer graphics;
D O I
暂无
中图分类号
学科分类号
摘要
An open domain 3D model retrieval algorithm was proposed in order to meet the requirement of management and retrieval of massive new model data under the open domain. The semantic consistency of multimodal information can be effectively used. The category information among unknown samples was explored with the help of unsupervised algorithm. Then the unknown class information was introduced into the parameter optimization process of the network model. The network model has better characterization and retrieval performance in the open domain condition. A hierarchical multi-modal information fusion model based on a Transformer structure was proposed, which could effectively remove the redundant information among the modalities and obtain a more robust model representation vector. Experiments were conducted on the dataset ModelNet40, and the experiments were compared with other typical algorithms. The proposed method outperformed all comparative methods in terms of mAP metrics, which verified the effectiveness of the method in terms of retrieval performance improvement. © 2024 Zhejiang University. All rights reserved.
引用
收藏
页码:61 / 70
相关论文
共 50 条
  • [1] Hypergraph-Based Multi-Modal Representation for Open-Set 3D Object Retrieval
    Feng, Yifan
    Ji, Shuyi
    Liu, Yu-Shen
    Du, Shaoyi
    Dai, Qionghai
    Gao, Yue
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (04) : 2206 - 2223
  • [2] 3D shape recognition based on multi-modal information fusion
    Qi Liang
    Mengmeng Xiao
    Dan Song
    Multimedia Tools and Applications, 2021, 80 : 16173 - 16184
  • [3] 3D shape recognition based on multi-modal information fusion
    Liang, Qi
    Xiao, Mengmeng
    Song, Dan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (11) : 16173 - 16184
  • [4] SHREC'22 track: Open-Set 3D Object Retrieval
    Feng, Yifan
    Gao, Yue
    Zhao, Xibin
    Guo, Yandong
    Bagewadi, Nihar
    Bui, Nhat-Tan
    Dao, Hieu
    Gangisetty, Shankar
    Guan, Ripeng
    Han, Xie
    Hua, Cong
    Hunakunti, Chidambar
    Jiang, Yu
    Jiao, Shichao
    Ke, Yuqi
    Kuang, Liqun
    Liu, Anan
    Nguyen, Dinh-Huan
    Nguyen, Hai-Dang
    Nie, Weizhi
    Pham, Bang-Dang
    Raikar, Karthik
    Tang, Qingmei
    Tran, Minh-Triet
    Wan, Jialong
    Yan, Chenggang
    You, Haoxuan
    Zhu, Difei
    COMPUTERS & GRAPHICS-UK, 2022, 107 : 231 - 240
  • [5] Multi-Modal Clique-Graph Matching for View-Based 3D Model Retrieval
    Liu, An-An
    Nie, Wei-Zhi
    Gao, Yue
    Su, Yu-Ting
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (05) : 2103 - 2116
  • [6] Research on 3D Object Detection Method Based on Multi-Modal Fusion
    Tian, Feng
    Zong, Neili
    Liu, Fang
    Lu, Yuanyuan
    Liu, Chao
    Jiang, Wenwen
    Zhao, Ling
    Han, Yuxiang
    Computer Engineering and Applications, 2024, 60 (13) : 113 - 123
  • [7] Generating Adversarial Point Clouds on Multi-modal Fusion Based 3D Object Detection Model
    Wang, Huiying
    Shen, Huixin
    Zhang, Boyang
    Wen, Yu
    Meng, Dan
    INFORMATION AND COMMUNICATIONS SECURITY (ICICS 2021), PT I, 2021, 12918 : 187 - 203
  • [8] Open-set 3D Object Detection
    Cen, Jun
    Yun, Peng
    Cai, Junhao
    Wang, Michael Yu
    Liu, Ming
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 869 - 878
  • [9] Triadic Elastic Structure Representation for Open-Set Incremental 3D Object Retrieval
    Xu, Yang
    Feng, Yifan
    Bie, Lin
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 20 - 28
  • [10] Multi-Modal Fusion Based on Depth Adaptive Mechanism for 3D Object Detection
    Liu, Zhanwen
    Cheng, Juanru
    Fan, Jin
    Lin, Shan
    Wang, Yang
    Zhao, Xiangmo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 707 - 717