Flexible Dual Multi-Modal Hashing for Incomplete Multi-Modal Retrieval

被引:0
|
作者
Wei, Yuhong [1 ]
An, Junfeng [2 ]
机构
[1] Harbin Inst Technol, Educ Ctr Expt & Innovat, Shenzhen 518055, Peoples R China
[2] Inst Technol, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China
关键词
Incomplete multi-modal hashing; similarity search; tensor optimization;
D O I
10.1142/S021946782650021X
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Multimodal hashing aims to efficiently integrate multi-source data into a unified discrete Hamming space, facilitating fast similarity searches with minimal query and storage overhead. Traditional multimodal hashing assumes that data from different sources are fully observed, an assumption that fails in real-world scenarios involving large-scale multimodal data, thereby compromising conventional methods. To address these limitations during both training and retrieval, our approach manages dual-stage data missing, occurring in both phases. In this paper, we introduce a novel framework called Flexible Dual Multimodal Hashing (FDMH), which recovers missing data at both stages by jointly leveraging low-dimensional data relations and semantic graph structural relationships in multi-source data, achieving promising performance in incomplete multimodal retrieval. We transform the original features into anchor graphs and use existing modalities to reconstruct the anchor graphs of missing modalities. Based on these anchor graphs, we perform weight-adaptive fusion in the semantic space, supervised by original semantic labels, and apply a tensor nuclear norm to enforce consistency constraints on the projection matrices across different modalities. Furthermore, our method flexibly fuses existing and recovered modalities during retrieval. We validate the effectiveness of our approach through extensive experiments on four large-scale multimodal datasets, demonstrating its robust performance in real-world dual-missing retrieval scenarios.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Review of Multi-Modal Retrieval in Medicine
    Ding, Guohui
    Zhang, Qi
    Fang, Shichao
    Li, Qing
    Sun, Xiaoyu
    Zhang, Luxia
    Kong, Guilan
    Computer Engineering and Applications, 2023, 59 (01) : 26 - 36
  • [22] A Framework for Enabling Unpaired Multi-Modal Learning for Deep Cross-Modal Hashing Retrieval
    Williams-Lekuona, Mikel
    Cosma, Georgina
    Phillips, Iain
    JOURNAL OF IMAGING, 2022, 8 (12)
  • [23] RetrievalMMT: Retrieval-Constrained Multi-Modal Prompt Learning for Multi-Modal Machine Translation
    Wang, Yan
    Zeng, Yawen
    Liang, Junjie
    Xing, Xiaofen
    Xu, Jin
    Xu, Xiangmin
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 860 - 868
  • [24] Multi-modal anchor adaptation learning for multi-modal summarization
    Chen, Zhongfeng
    Lu, Zhenyu
    Rong, Huan
    Zhao, Chuanjun
    Xu, Fan
    NEUROCOMPUTING, 2024, 570
  • [25] Multi-modal and cross-modal for lecture videos retrieval
    Nhu Van Nguyen
    Coustaty, Mickal
    Ogier, Jean-Marc
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2667 - 2672
  • [26] EDMH: Efficient discrete matrix factorization hashing for multi-modal similarity retrieval
    Yang, Fan
    Ding, Xiaojian
    Ma, Fumin
    Tong, Deyu
    Cao, Jie
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [27] Cross-Modal Retrieval Augmentation for Multi-Modal Classification
    Gur, Shir
    Neverova, Natalia
    Stauffer, Chris
    Lim, Ser-Nam
    Kiela, Douwe
    Reiter, Austin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 111 - 123
  • [28] One for more: Structured Multi-Modal Hashing for multiple multimedia retrieval tasks
    Zheng, Chaoqun
    Li, Fengling
    Zhu, Lei
    Zhang, Zheng
    Lu, Wenpeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 233
  • [29] Multi-modal semantic autoencoder for cross-modal retrieval
    Wu, Yiling
    Wang, Shuhui
    Huang, Qingming
    NEUROCOMPUTING, 2019, 331 : 165 - 175
  • [30] Parametric CAD Primitive Retrieval via Multi-Modal Fusion and Deep Hashing
    Xu, Minyang
    Lou, Yunzhong
    Ma, Weijian
    Li, Xueyang
    Zhou, Xiangdong
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1061 - 1069