Partial Modal Conditioned GANs for Multi-modal Multi-label Learning with Arbitrary Modal-Missing

被引:4
|
作者
Zhang, Yi [1 ]
Shen, Jundong [1 ]
Zhang, Zhecheng [1 ]
Wang, Chongjun [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Dept Comp Sci & Technol, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-modal multi-label; GAN; Arbitrary modal-missing; Label correlation;
D O I
10.1007/978-3-030-73197-7_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-modal multi-label (MMML) learning serves an important framework to learn from objects with multiple representations and annotations. Previous MMML approaches assume that all instances are with complete modalities, which usually does not hold for real-world MMML data. Meanwhile, most existing works focus on data generation using GAN, while few of them explore the downstream tasks, such as multi-modal multi-label learning. The major challenge is how to jointly model complex modal correlation and label correlation in a mutually beneficial way, especially under the arbitrary modal-missing pattern. Aim at addressing the aforementioned research challenges, we propose a novel framework named Partial Modal Conditioned Generative Adversarial Networks (PMC-GANs) for MMML learning with arbitrary modal-missing. The proposed model contains a modal completion part and a multi-modal multi-label learning part. Firstly, in order to strike a balance between consistency and complementary across different modalities, PMC-GANs incorporates all available modalities during training and generates high-quality missing modality in an efficient way. After that, PMC-GANs exploits label correlation by leveraging shared information from all modalities and specific information of each individual modality. Empirical studies on 3 MMML datasets clearly show the superior performance of PMC-GANs against other state-of-the-art approaches.
引用
收藏
页码:413 / 428
页数:16
相关论文
共 50 条
  • [1] Rethinking Modal-oriented Label Correlations for Multi-modal Multi-label Learning
    Zhang, Yi
    Shen, Jundong
    Zhang, Zhecheng
    Zhang, Lei
    Wang, Chongjun
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [2] Collaboration based multi-modal multi-label learning
    Zhang, Yi
    Zhu, Yinlong
    Zhang, Zhecheng
    Wang, Chongjung
    [J]. APPLIED INTELLIGENCE, 2022, 52 (12) : 14204 - 14217
  • [3] Collaboration based multi-modal multi-label learning
    Yi Zhang
    Yinlong Zhu
    Zhecheng Zhang
    Chongjung Wang
    [J]. Applied Intelligence, 2022, 52 : 14204 - 14217
  • [4] Multi-modal Contextual Prompt Learning for Multi-label Classification with Partial Labels
    Wang, Rui
    Pan, Zhengxin
    Wu, Fangyu
    Lv, Yifan
    Zhang, Bailing
    [J]. 2024 16TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, ICMLC 2024, 2024, : 517 - 524
  • [5] Common and Discriminative Semantic Pursuit for Multi-Modal Multi-Label Learning
    Zhang, Yi
    Shen, Jundong
    Zhang, Zhecheng
    Wang, Chongjun
    [J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1666 - 1673
  • [6] Tailor Versatile Multi-Modal Learning for Multi-Label Emotion Recognition
    Zhang, Yi
    Chen, Mingyuan
    Shen, Jundong
    Wang, Chongjun
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9100 - 9108
  • [7] Multi-modal Multi-label Emotion Detection with Modality and Label Dependence
    Dong Zhang
    Ju, Xincheng
    Li, Junhui
    Li, Shoushan
    Zhu, Qiaoming
    Zhou, Guodong
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 3584 - 3593
  • [8] Multi-Modal Multi-Instance Multi-Label Learning with Graph Convolutional Network
    Hang, Cheng
    Wang, Wei
    Zhan, De-Chuan
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [9] Multi-modal anchor adaptation learning for multi-modal summarization
    Chen, Zhongfeng
    Lu, Zhenyu
    Rong, Huan
    Zhao, Chuanjun
    Xu, Fan
    [J]. NEUROCOMPUTING, 2024, 570
  • [10] Multi-modal multi-label semantic indexing of images based on hybrid ensemble learning
    Li, Wei
    Sun, Maosong
    Habel, Christopher
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2007, 2007, 4810 : 744 - +