Comprehensive Semi-Supervised Multi-Modal Learning

被引:0
|
作者
Yang, Yang [1 ]
Wang, Ke-Tao [1 ]
Zhan, De-Chuan [1 ]
Xiong, Hui [2 ]
Jiang, Yuan [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China
[2] Rutgers State Univ, New Brunswick, NJ USA
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-modal learning refers to the process of learning a precise model to represent the joint representations of different modalities. Despite its promise for multi-modal learning, the co-regularization method is based on the consistency principle with a sufficient assumption, which usually does not hold for real-world multi-modal data. Indeed, due to the modal insufficiency in real-world applications, there are divergences among heterogeneous modalities. This imposes a critical challenge for multi-modal learning. To this end, in this paper, we propose a novel Comprehensive Multi-Modal Learning (CMML) framework, which can strike a balance between the consistency and divergency modalities by considering the insufficiency in one unified framework. Specifically, we utilize an instance level attention mechanism to weight the sufficiency for each instance on different modalities. Moreover, novel diversity regularization and robust consistency metrics are designed for discovering insufficient modalities. Our empirical studies show the superior performances of CMML on real-world data in terms of various criteria.
引用
收藏
页码:4092 / 4098
页数:7
相关论文
共 50 条
  • [41] Inferring the Importance of Product Appearance with Semi-supervised Multi-modal Enhancement: A Step Towards the Screenless Retailing
    Gong, Yongshun
    Yi, Jinfeng
    Chen, Dong-Dong
    Zhang, Jian
    Zhou, Jiayu
    Zhou, Zhi-Hua
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1120 - 1128
  • [42] Semi-supervised learning on multi-manifold
    Chen, Mingxia
    Wang, Jing
    Journal of Computational Information Systems, 2014, 10 (12): : 5131 - 5138
  • [43] Enhancing Semi-Supervised Learning with Cross-Modal Knowledge
    Zhu, Hui
    Lu, Yongchun
    Wang, Hongbin
    Zhou, Xunyi
    Ma, Qin
    Liu, Yanhong
    Jiang, Ning
    Wei, Xin
    Zeng, Linchengxi
    Zhao, Xiaofang
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4456 - 4465
  • [44] Supervised Multi-modal Dictionary Learning for Clothing Representation
    Zhao, Qilu
    Wang, Jiayan
    Li, Zongmin
    PROCEEDINGS OF THE FIFTEENTH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS - MVA2017, 2017, : 51 - 54
  • [45] Semi-supervised Learning
    Adams, Niall
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2009, 172 : 530 - 530
  • [46] On semi-supervised learning
    A. Cholaquidis
    R. Fraiman
    M. Sued
    TEST, 2020, 29 : 914 - 937
  • [47] On semi-supervised learning
    Cholaquidis, A.
    Fraiman, R.
    Sued, M.
    TEST, 2020, 29 (04) : 914 - 937
  • [48] Self-paced semi-supervised feature selection with application to multi-modal Alzheimer's disease classification
    Zhang, Chao
    Fan, Wentao
    Wang, Bo
    Chen, Chunlin
    Li, Huaxiong
    INFORMATION FUSION, 2024, 107
  • [49] Self-paced semi-supervised feature selection with application to multi-modal Alzheimer's disease classification
    Zhang, Chao
    Fan, Wentao
    Wang, Bo
    Chen, Chunlin
    Li, Huaxiong
    Information Fusion, 2024, 107
  • [50] ACTIVE LEARNING FOR SEMI-SUPERVISED MULTI-TASK LEARNING
    Li, Hui
    Liao, Xuejun
    Carin, Lawrence
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1637 - +