Comprehensive Semi-Supervised Multi-Modal Learning

被引:0
|
作者
Yang, Yang [1 ]
Wang, Ke-Tao [1 ]
Zhan, De-Chuan [1 ]
Xiong, Hui [2 ]
Jiang, Yuan [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China
[2] Rutgers State Univ, New Brunswick, NJ USA
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-modal learning refers to the process of learning a precise model to represent the joint representations of different modalities. Despite its promise for multi-modal learning, the co-regularization method is based on the consistency principle with a sufficient assumption, which usually does not hold for real-world multi-modal data. Indeed, due to the modal insufficiency in real-world applications, there are divergences among heterogeneous modalities. This imposes a critical challenge for multi-modal learning. To this end, in this paper, we propose a novel Comprehensive Multi-Modal Learning (CMML) framework, which can strike a balance between the consistency and divergency modalities by considering the insufficiency in one unified framework. Specifically, we utilize an instance level attention mechanism to weight the sufficiency for each instance on different modalities. Moreover, novel diversity regularization and robust consistency metrics are designed for discovering insufficient modalities. Our empirical studies show the superior performances of CMML on real-world data in terms of various criteria.
引用
收藏
页码:4092 / 4098
页数:7
相关论文
共 50 条
  • [21] Pseudo-Label Calibration Semi-supervised Multi-Modal Entity Alignment
    Wang, Luyao
    Qi, Pengnian
    Bao, Xigang
    Zhou, Chunlai
    Qin, Biao
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 9116 - 9124
  • [22] Markov random field based fusion for supervised and semi-supervised multi-modal image classification
    Liang Xie
    Peng Pan
    Yansheng Lu
    Multimedia Tools and Applications, 2015, 74 : 613 - 634
  • [23] SEMI-SUPERVISED GRAPH-BASED DEEP LEARNING FOR MULTI-MODAL PREDICTION OF KNEE OSTEOARTHRITIS INCIDENCE
    Razmjoo, A.
    Liu, F.
    Caliva, F.
    Martinez, A. Morales
    Majumdar, S.
    Pedoia, V.
    OSTEOARTHRITIS AND CARTILAGE, 2020, 28 : S305 - S306
  • [24] Heterogeneous Features Integration via Semi-supervised Multi-modal Deep Networks
    Zhao, Lei
    Hu, Qinghua
    Zhou, Yucan
    NEURAL INFORMATION PROCESSING, ICONIP 2015, PT IV, 2015, 9492 : 11 - 19
  • [25] SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition
    Lian, Zheng
    Liu, Bin
    Tao, Jianhua
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2415 - 2429
  • [26] AI-enabled Multi-modal Network Anomaly Association: A Deep Self/Semi-Supervised Learning Approach
    Tang, Yinan
    Zhang, Yabo
    Yin, Zhifeng
    Deng, Jianxi
    Li, Feng
    Cui, Yong
    Zhang, Xiaoxiao
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 4068 - 4073
  • [27] Multi-modal Semi-supervised Evidential Recycle Framework for Alzheimer's Disease Classification
    Feng, Yingjie
    Chen, Wei
    Gu, Xianfeng
    Xu, Xiaoyin
    Zhang, Min
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT I, 2023, 14220 : 130 - 140
  • [28] Local weight coupled network: multi-modal unequal semi-supervised domain adaptation
    Cai, Ziyun
    Song, Jie
    Zhang, Tengfei
    Hu, Changhui
    Jing, Xiao-Yuan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (2) : 4331 - 4357
  • [29] A Multi-Modal Graph-Based Semi-Supervised Pipeline for Predicting Cancer Survival
    Hassanzadeh, Hamid Reza
    Phan, John H.
    Wang, May D.
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 184 - 189
  • [30] Local weight coupled network: multi-modal unequal semi-supervised domain adaptation
    Ziyun Cai
    Jie Song
    Tengfei Zhang
    Changhui Hu
    Xiao-Yuan Jing
    Multimedia Tools and Applications, 2024, 83 : 4331 - 4357