Learning Cross-Modal Retrieval with Noisy Labels

被引:59
|
作者
Hu, Peng [1 ,2 ]
Peng, Xi [1 ]
Zhu, Hongyuan [2 ]
Zhen, Liangli [3 ]
Lin, Jie [2 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[2] Agcy Sci Technol & Res, Inst Infocomm Res, Singapore, Singapore
[3] Agcy Sci Technol & Res, Inst High Performance Comp, Singapore, Singapore
基金
国家重点研发计划;
关键词
HASHING NETWORK;
D O I
10.1109/CVPR46437.2021.00536
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently; cross-modal retrieval is emerging with the help of deep multimodal learning. However, even for unimodal data, collecting large-scale well-annotated data is expensive and time-consuming, and not to mention the additional challenges from multiple modalities. Although crowd-sourcing annotation, e.g., Amazon's Mechanical Turk, can be utilized to mitigate the labeling cost, but leading to the unavoidable noise in labels for the non-expert annotating. To tackle the challenge, this paper presents a general Multimodal Robust Learning framework (MRL) for learning with multimodal noisy labels to mitigate noisy samples and correlate distinct modalities simultaneously. To be specific, we propose a Robust Clustering loss (RC) to make the deep networks focus on clean samples instead of noisy ones. Besides, a simple yet effective multimodal loss function, called Multimodal Contrastive loss (MC), is proposed to maximize the mutual information between different modalities, thus alleviating the interference of noisy samples and cross-modal discrepancy. Extensive experiments are conducted on four widely-used multimodal datasets to demonstrate the effectiveness of the proposed approach by comparing to 14 state-of-the-art methods.
引用
收藏
页码:5399 / 5409
页数:11
相关论文
共 50 条
  • [1] CROSS-MODAL RETRIEVAL WITH NOISY LABELS
    Mandal, Devraj
    Biswas, Soma
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2326 - 2330
  • [2] Neighborhood Learning from Noisy Labels for Cross-Modal Retrieval
    Li, Runhao
    Weng, Zhenyu
    Zhuang, Huiping
    Chen, Yongming
    Lin, Zhiping
    [J]. 2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [3] Early-Learning regularized Contrastive Learning for Cross-Modal Retrieval with Noisy Labels
    Xu, Tianyuan
    Liu, Xueliang
    Huang, Zhen
    Guo, Dan
    Hong, Richang
    Wang, Meng
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [4] Deep Evidential Learning with Noisy Correspondence for Cross-modal Retrieval
    Qin, Yang
    Peng, Dezhong
    Peng, Xi
    Wang, Xu
    Hu, Peng
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4948 - 4956
  • [5] Robust zero-shot discrete hashing with noisy labels for cross-modal retrieval
    Yong, Kailing
    Shu, Zhenqiu
    Wang, Hongbin
    Yu, Zhengtao
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,
  • [6] LCN: Label Correction Based on Network Prediction for Cross-Modal Retrieval with Noisy Labels
    Okamura, Daiki
    Harakawa, Ryosuke
    Iwahashi, Masahiro
    [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 354 - 358
  • [7] Unpaired robust hashing with noisy labels for zero-shot cross-modal retrieval
    Yong, Kailing
    Shu, Zhenqiu
    Yu, Zhengtao
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [8] RONO: Robust Discriminative Learning with Noisy Labels for 2D-3D Cross-Modal Retrieval
    Feng, Yanglin
    Zhu, Hongyuan
    Peng, Dezhong
    Peng, Xi
    Hu, Peng
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11610 - 11619
  • [9] Mutual Quantization for Cross-Modal Search with Noisy Labels
    Yang, Erkun
    Yao, Dongren
    Liu, Tongliang
    Deng, Cheng
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7541 - 7550
  • [10] HCMSL: Hybrid Cross-modal Similarity Learning for Cross-modal Retrieval
    Zhang, Chengyuan
    Song, Jiayu
    Zhu, Xiaofeng
    Zhu, Lei
    Zhang, Shichao
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (01)