Rethinking Modal-oriented Label Correlations for Multi-modal Multi-label Learning

被引:0
|
作者
Zhang, Yi [1 ]
Shen, Jundong [1 ]
Zhang, Zhecheng [1 ]
Zhang, Lei [1 ]
Wang, Chongjun [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-modal; multi-label; label correlations; modal-specific; cross-modal;
D O I
10.1109/ijcnn48605.2020.9207362
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-modal multi-label learning provides a fundamental framework for complex objects, which can be represented with multiple modalities and annotated with multiple labels simultaneously. Different modalities can usually provide complementary information, which may lead to improved performance. What's more, exploiting label correlations is crucially important to multi-label learning. However, most existing multi-label learning approaches do not sufficiently consider the complementary information among different modalities. In this paper, we propose a novel end-to-end deep learning framework named Rethinking Modal-oriented Label Correlations (RMLC), which sequentially polish the label prediction with each individual modality. In order to explicitly account for the correlated prediction of multiple labels, RMLC leverages an efficient sequential modal-based exploration to rethink label correlations. The final prediction of each label involves the collaboration between modal-specific prediction and the prediction of other labels based on cross-modal interaction. Comprehensive experiments on benchmark datasets validate the effectiveness and competitiveness of the proposed RMLC approach.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Collaboration based multi-modal multi-label learning
    Zhang, Yi
    Zhu, Yinlong
    Zhang, Zhecheng
    Wang, Chongjung
    [J]. APPLIED INTELLIGENCE, 2022, 52 (12) : 14204 - 14217
  • [2] Collaboration based multi-modal multi-label learning
    Yi Zhang
    Yinlong Zhu
    Zhecheng Zhang
    Chongjung Wang
    [J]. Applied Intelligence, 2022, 52 : 14204 - 14217
  • [3] Partial Modal Conditioned GANs for Multi-modal Multi-label Learning with Arbitrary Modal-Missing
    Zhang, Yi
    Shen, Jundong
    Zhang, Zhecheng
    Wang, Chongjun
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT II, 2021, 12682 : 413 - 428
  • [4] Multi-modal Multi-label Emotion Detection with Modality and Label Dependence
    Dong Zhang
    Ju, Xincheng
    Li, Junhui
    Li, Shoushan
    Zhu, Qiaoming
    Zhou, Guodong
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 3584 - 3593
  • [5] Common and Discriminative Semantic Pursuit for Multi-Modal Multi-Label Learning
    Zhang, Yi
    Shen, Jundong
    Zhang, Zhecheng
    Wang, Chongjun
    [J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1666 - 1673
  • [6] Tailor Versatile Multi-Modal Learning for Multi-Label Emotion Recognition
    Zhang, Yi
    Chen, Mingyuan
    Shen, Jundong
    Wang, Chongjun
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9100 - 9108
  • [7] Multi-modal Contextual Prompt Learning for Multi-label Classification with Partial Labels
    Wang, Rui
    Pan, Zhengxin
    Wu, Fangyu
    Lv, Yifan
    Zhang, Bailing
    [J]. 2024 16TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, ICMLC 2024, 2024, : 517 - 524
  • [8] Multi-Modal Multi-Instance Multi-Label Learning with Graph Convolutional Network
    Hang, Cheng
    Wang, Wei
    Zhan, De-Chuan
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [9] Multi-modal multi-label semantic indexing of images based on hybrid ensemble learning
    Li, Wei
    Sun, Maosong
    Habel, Christopher
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2007, 2007, 4810 : 744 - +
  • [10] Transformer-based Label Set Generation for Multi-modal Multi-label Emotion Detection
    Ju, Xincheng
    Zhang, Dong
    Li, Junhui
    Zhou, Guodong
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 512 - 520