Comprehensive Semi-Supervised Multi-Modal Learning

被引：0

作者：

Yang, Yang ^{[1
]}

Wang, Ke-Tao ^{[1
]}

Zhan, De-Chuan ^{[1
]}

Xiong, Hui ^{[2
]}

Jiang, Yuan ^{[1
]}

机构：

[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China

[2] Rutgers State Univ, New Brunswick, NJ USA

来源：

PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2019年

基金：

国家重点研发计划;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-modal learning refers to the process of learning a precise model to represent the joint representations of different modalities. Despite its promise for multi-modal learning, the co-regularization method is based on the consistency principle with a sufficient assumption, which usually does not hold for real-world multi-modal data. Indeed, due to the modal insufficiency in real-world applications, there are divergences among heterogeneous modalities. This imposes a critical challenge for multi-modal learning. To this end, in this paper, we propose a novel Comprehensive Multi-Modal Learning (CMML) framework, which can strike a balance between the consistency and divergency modalities by considering the insufficiency in one unified framework. Specifically, we utilize an instance level attention mechanism to weight the sufficiency for each instance on different modalities. Moreover, novel diversity regularization and robust consistency metrics are designed for discovering insufficient modalities. Our empirical studies show the superior performances of CMML on real-world data in terms of various criteria.

引用

页码：4092 / 4098

页数：7

共 50 条

[41] Inferring the Importance of Product Appearance with Semi-supervised Multi-modal Enhancement: A Step Towards the Screenless Retailing
Gong, Yongshun
Yi, Jinfeng
Chen, Dong-Dong
Zhang, Jian
Zhou, Jiayu
Zhou, Zhi-Hua
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1120 - 1128
[42] Semi-supervised learning on multi-manifold
Chen, Mingxia
Wang, Jing
Journal of Computational Information Systems, 2014, 10 (12): : 5131 - 5138
[43] Enhancing Semi-Supervised Learning with Cross-Modal Knowledge
Zhu, Hui
Lu, Yongchun
Wang, Hongbin
Zhou, Xunyi
Ma, Qin
Liu, Yanhong
Jiang, Ning
Wei, Xin
Zeng, Linchengxi
Zhao, Xiaofang
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4456 - 4465
[44] Supervised Multi-modal Dictionary Learning for Clothing Representation
Zhao, Qilu
Wang, Jiayan
Li, Zongmin
PROCEEDINGS OF THE FIFTEENTH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS - MVA2017, 2017, : 51 - 54
[45] Semi-supervised Learning
Adams, Niall
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2009, 172 : 530 - 530
[46] On semi-supervised learning
A. Cholaquidis
R. Fraiman
M. Sued
TEST, 2020, 29 : 914 - 937
[47] On semi-supervised learning
Cholaquidis, A.
Fraiman, R.
Sued, M.
TEST, 2020, 29 (04) : 914 - 937
[48] Self-paced semi-supervised feature selection with application to multi-modal Alzheimer's disease classification
Zhang, Chao
Fan, Wentao
Wang, Bo
Chen, Chunlin
Li, Huaxiong
INFORMATION FUSION, 2024, 107
[49] Self-paced semi-supervised feature selection with application to multi-modal Alzheimer's disease classification
Zhang, Chao
Fan, Wentao
Wang, Bo
Chen, Chunlin
Li, Huaxiong
Information Fusion, 2024, 107
[50] ACTIVE LEARNING FOR SEMI-SUPERVISED MULTI-TASK LEARNING
Li, Hui
Liao, Xuejun
Carin, Lawrence
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1637 - +

← 1 2 3 4 5 →