Comprehensive Semi-Supervised Multi-Modal Learning

被引：0

作者：

Yang, Yang ^{[1
]}

Wang, Ke-Tao ^{[1
]}

Zhan, De-Chuan ^{[1
]}

Xiong, Hui ^{[2
]}

Jiang, Yuan ^{[1
]}

机构：

[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China

[2] Rutgers State Univ, New Brunswick, NJ USA

来源：

PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2019年

基金：

国家重点研发计划;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-modal learning refers to the process of learning a precise model to represent the joint representations of different modalities. Despite its promise for multi-modal learning, the co-regularization method is based on the consistency principle with a sufficient assumption, which usually does not hold for real-world multi-modal data. Indeed, due to the modal insufficiency in real-world applications, there are divergences among heterogeneous modalities. This imposes a critical challenge for multi-modal learning. To this end, in this paper, we propose a novel Comprehensive Multi-Modal Learning (CMML) framework, which can strike a balance between the consistency and divergency modalities by considering the insufficiency in one unified framework. Specifically, we utilize an instance level attention mechanism to weight the sufficiency for each instance on different modalities. Moreover, novel diversity regularization and robust consistency metrics are designed for discovering insufficient modalities. Our empirical studies show the superior performances of CMML on real-world data in terms of various criteria.

引用

页码：4092 / 4098

页数：7

共 50 条

[21] Pseudo-Label Calibration Semi-supervised Multi-Modal Entity Alignment
Wang, Luyao
Qi, Pengnian
Bao, Xigang
Zhou, Chunlai
Qin, Biao
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 9116 - 9124
[22] Markov random field based fusion for supervised and semi-supervised multi-modal image classification
Liang Xie
Peng Pan
Yansheng Lu
Multimedia Tools and Applications, 2015, 74 : 613 - 634
[23] SEMI-SUPERVISED GRAPH-BASED DEEP LEARNING FOR MULTI-MODAL PREDICTION OF KNEE OSTEOARTHRITIS INCIDENCE
Razmjoo, A.
Liu, F.
Caliva, F.
Martinez, A. Morales
Majumdar, S.
Pedoia, V.
OSTEOARTHRITIS AND CARTILAGE, 2020, 28 : S305 - S306
[24] Heterogeneous Features Integration via Semi-supervised Multi-modal Deep Networks
Zhao, Lei
Hu, Qinghua
Zhou, Yucan
NEURAL INFORMATION PROCESSING, ICONIP 2015, PT IV, 2015, 9492 : 11 - 19
[25] SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition
Lian, Zheng
Liu, Bin
Tao, Jianhua
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2415 - 2429
[26] AI-enabled Multi-modal Network Anomaly Association: A Deep Self/Semi-Supervised Learning Approach
Tang, Yinan
Zhang, Yabo
Yin, Zhifeng
Deng, Jianxi
Li, Feng
Cui, Yong
Zhang, Xiaoxiao
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 4068 - 4073
[27] Multi-modal Semi-supervised Evidential Recycle Framework for Alzheimer's Disease Classification
Feng, Yingjie
Chen, Wei
Gu, Xianfeng
Xu, Xiaoyin
Zhang, Min
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT I, 2023, 14220 : 130 - 140
[28] Local weight coupled network: multi-modal unequal semi-supervised domain adaptation
Cai, Ziyun
Song, Jie
Zhang, Tengfei
Hu, Changhui
Jing, Xiao-Yuan
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (2) : 4331 - 4357
[29] A Multi-Modal Graph-Based Semi-Supervised Pipeline for Predicting Cancer Survival
Hassanzadeh, Hamid Reza
Phan, John H.
Wang, May D.
2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 184 - 189
[30] Local weight coupled network: multi-modal unequal semi-supervised domain adaptation
Ziyun Cai
Jie Song
Tengfei Zhang
Changhui Hu
Xiao-Yuan Jing
Multimedia Tools and Applications, 2024, 83 : 4331 - 4357

← 1 2 3 4 5 →