Modality-specific Adaptive Scaling Method for Cross-modal Retrieval

被引:0
|
作者
Chen, Baitao [1 ]
Ke, Xiao [1 ]
机构
[1] Fuzhou Univ, Fujian Key Lab Network Comp & Intelligent, Informat Proc Coll Comp & Data Sci, Fuzhou, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modal retrieval (CMR); common representation learning; modality-specific adaptive scaling;
D O I
10.1109/ICICML57342.2022.10009863
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There are huge differences in data distribution and feature representation of different modalities. How to flexibly and accurately retrieve data from different modalities is a challenging problem. The mainstream common subspace method only focus on the heterogeneity gap between modalities, and use a unified method to jointly learn the common representation of different modalities, which can easily lead to the difficulty of multi-modal unified fitting. In this work, we innovatively propose the concept of multi-modal information density discrepancy, and propose a modality-specific adaptive scaling method incorporating prior knowledge, which can adaptively learn the most suitable network for different modalities. Comprehensive experimental results on three widely used cross-modal retrieval datasets show the proposed MASM achieves the state-of-the-art results and significantly outperforms other existing methods.
引用
收藏
页码:202 / 205
页数:4
相关论文
共 50 条
  • [21] Does cross-modal correspondence modulate modality-specific perceptual processing? Study using timing judgment tasks
    Uno, Kyuto
    Yokosawa, Kazuhiko
    [J]. ATTENTION PERCEPTION & PSYCHOPHYSICS, 2024, 86 (01) : 273 - 284
  • [22] Does cross-modal correspondence modulate modality-specific perceptual processing? Study using timing judgment tasks
    Kyuto Uno
    Kazuhiko Yokosawa
    [J]. Attention, Perception, & Psychophysics, 2024, 86 : 273 - 284
  • [23] Adaptive Adversarial Learning based cross-modal retrieval
    Li, Zhuoyi
    Lu, Huibin
    Fu, Hao
    Wang, Zhongrui
    Gu, Guanghun
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [24] Modality-Dependent Cross-Modal Retrieval Based on Graph Regularization
    Wang, Guanhua
    Ji, Hua
    Kong, Dexin
    Zhang, Na
    [J]. MOBILE INFORMATION SYSTEMS, 2020, 2020
  • [25] Combining Generic and Specific Information for Cross-modal Retrieval
    Thi Quynh Nhi Tran
    Le Borgne, Nerve
    Crucianu, Michel
    [J]. ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 551 - 554
  • [26] When the modality keeps changing: Event-related potentials (ERPS) show general and modality-specific preparation in a cross-modal stroop task
    Low, Kathy A.
    Jackson, Colleen
    Fabiani, Monica
    Gratton, Gabriele
    [J]. PSYCHOPHYSIOLOGY, 2007, 44 : S55 - S55
  • [27] Adaptive Marginalized Semantic Hashing for Unpaired Cross-Modal Retrieval
    Luo, Kaiyi
    Zhang, Chao
    Li, Huaxiong
    Jia, Xiuyi
    Chen, Chunlin
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 9082 - 9095
  • [28] A Cross-Modal Hash Retrieval Method with Fused Triples
    Li, Wenxiao
    Mei, Hongyan
    Li, Yutian
    Yu, Jiayao
    Zhang, Xing
    Xue, Xiaorong
    Wang, Jiahao
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (18):
  • [29] An Orthogonal Subspace Decomposition Method for Cross-Modal Retrieval
    Zeng, Zhixiong
    Xu, Nan
    Mao, Wenji
    Zeng, Daniel
    [J]. IEEE INTELLIGENT SYSTEMS, 2022, 37 (03) : 45 - 53
  • [30] Cross-Modal Retrieval Based on Semantic Filtering and Adaptive Pooling
    Qiao, Nan
    Mao, Junyi
    Xie, Hao
    Wang, Zhiguo
    Yin, Guangqiang
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND NETWORKS, VOL II, CENET 2023, 2024, 1126 : 296 - 310