Modality-specific Adaptive Scaling Method for Cross-modal Retrieval

被引:0
|
作者
Chen, Baitao [1 ]
Ke, Xiao [1 ]
机构
[1] Fuzhou Univ, Fujian Key Lab Network Comp & Intelligent, Informat Proc Coll Comp & Data Sci, Fuzhou, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modal retrieval (CMR); common representation learning; modality-specific adaptive scaling;
D O I
10.1109/ICICML57342.2022.10009863
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There are huge differences in data distribution and feature representation of different modalities. How to flexibly and accurately retrieve data from different modalities is a challenging problem. The mainstream common subspace method only focus on the heterogeneity gap between modalities, and use a unified method to jointly learn the common representation of different modalities, which can easily lead to the difficulty of multi-modal unified fitting. In this work, we innovatively propose the concept of multi-modal information density discrepancy, and propose a modality-specific adaptive scaling method incorporating prior knowledge, which can adaptively learn the most suitable network for different modalities. Comprehensive experimental results on three widely used cross-modal retrieval datasets show the proposed MASM achieves the state-of-the-art results and significantly outperforms other existing methods.
引用
收藏
页码:202 / 205
页数:4
相关论文
共 50 条
  • [1] Modality-specific matrix factorization hashing for cross-modal retrieval
    Haixia Xiong
    Weihua Ou
    Zengxian Yan
    Jianping Gou
    Quan Zhou
    Anzhi Wang
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 5067 - 5081
  • [2] MODALITY-SPECIFIC STRUCTURE PRESERVING HASHING FOR CROSS-MODAL RETRIEVAL
    Liu, Xingbo
    Nie, Xiushan
    Sun, Haoliang
    Cui, Chaoran
    Yin, Yilong
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 1678 - 1682
  • [3] Modality-specific matrix factorization hashing for cross-modal retrieval
    Xiong, Haixia
    Ou, Weihua
    Yan, Zengxian
    Gou, Jianping
    Zhou, Quan
    Wang, Anzhi
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 13 (11) : 5067 - 5081
  • [4] Modality-specific and shared generative adversarial network for cross-modal retrieval
    Wu, Fei
    Jing, Xiao-Yuan
    Wu, Zhiyong
    Ji, Yimu
    Dong, Xiwei
    Luo, Xiaokai
    Huang, Qinghua
    Wang, Ruchuan
    [J]. PATTERN RECOGNITION, 2020, 104
  • [5] Enhancing Cross-Modal Retrieval Based on Modality-Specific and Embedding Spaces
    Yanagi, Rintaro
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    [J]. IEEE ACCESS, 2020, 8 : 96777 - 96786
  • [6] Image-Text Cross-Modal Retrieval via Modality-Specific Feature Learning
    Wang, Jian
    He, Yonghao
    Kang, Cuicui
    Xiang, Shiming
    Pan, Chunhong
    [J]. ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 347 - 354
  • [7] Cross-modal deactivations during modality-specific selective attention
    Mozolic, Jennifer L.
    Joyner, David
    Hugenschmidt, Christina E.
    Peiffer, Ann M.
    Kraft, Robert A.
    Maldjian, Joseph A.
    Laurienti, Paul J.
    [J]. BMC NEUROLOGY, 2008, 8 (1)
  • [8] Cross-modal deactivations during modality-specific selective attention
    Jennifer L Mozolic
    David Joyner
    Christina E Hugenschmidt
    Ann M Peiffer
    Robert A Kraft
    Joseph A Maldjian
    Paul J Laurienti
    [J]. BMC Neurology, 8
  • [9] Modality-Specific Cross-Modal Similarity Measurement With Recurrent Attention Network
    Peng, Yuxin
    Qi, Jinwei
    Yuan, Yuxin
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) : 5585 - 5599
  • [10] Modeling cross-modal enhancement and modality-specific suppression in multisensory neurons
    Patton, PE
    Anastasio, TJ
    [J]. NEURAL COMPUTATION, 2003, 15 (04) : 783 - 810