SSLMM: Semi-Supervised Learning with Missing Modalities for Multimodal Sentiment Analysis

被引:0
|
作者
Wang, Yiyu [1 ]
Jian, Haifang [2 ,3 ]
Zhuang, Jian [4 ]
Guo, Huimin [2 ,3 ]
Leng, Yan [1 ]
机构
[1] Shandong Normal Univ, Sch Phys & Elect, Jinan 250358, Peoples R China
[2] Chinese Acad Sci, Lab Solid State Optoelect Informat Technol, Inst Semicond, Beijing 100083, Peoples R China
[3] Chinese Acad Sci, Beijing 100049, Peoples R China
[4] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116023, Liaoning, Peoples R China
关键词
Multimodal sentiment analysis; Semi-supervised learning; Missing modalities;
D O I
10.1016/j.inffus.2025.103058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal Sentiment Analysis (MSA) integrates information from text, audio, and visuals to understand human emotions, but real-world applications face two challenges: (1) expensive annotation costs reduce the effectiveness of fully supervised methods, and (2) missing modality severely impact model robustness. While there are studies addressing these issues separately, few focus on solving both within a single framework. In real-world scenarios, these challenges often occur together, necessitating an algorithm that can handle both. To address this, we propose a Semi-Supervised Learning with Missing Modalities (SSLMM) framework. SSLMM combines self-supervised learning, alternating interaction information, semi-supervised learning, and modality reconstruction to tackle label scarcity and modality missing simultaneously. Firstly, SSLMM captures latent structural information through self-supervised pre-training. It then fine-tunes the model using semi- supervised learning and modality reconstruction to reduce dependence on labeled data and improve robustness to modality missing. The framework uses a graph-based architecture with an iterative message propagation mechanism to alternately propagate intra-modal and inter-modal messages, capturing emotional associations within and across modalities. Experiments on CMU-MOSI, CMU-MOSEI, and CH-SIMS demonstrate that under the condition where the proportion of labeled samples and the missing modality rate are both 0.5, SSLMM achieves binary classification (negative vs. positive) accuracies of 80.2%, 81.7%, and 77.1%, respectively, surpassing existing methods.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Semi-supervised Multi-view Sentiment Analysis
    Lazarova, Gergana
    Koychev, Ivan
    COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2015), PT I, 2015, 9329 : 181 - 190
  • [22] Semi-supervised distributed representations of documents for sentiment analysis
    Park, Saerom
    Lee, Jaewook
    Kim, Kyoungok
    NEURAL NETWORKS, 2019, 119 : 139 - 150
  • [23] Semi-Supervised Multi-Modal Learning with Incomplete Modalities
    Yang, Yang
    Zhan, De-Chuan
    Sheng, Xiang-Rong
    Jiang, Yuan
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2998 - 3004
  • [24] Semi-supervised Sentiment Classification Based on Auxiliary Task Learning
    Liu, Huan
    Wang, Jingjing
    Li, Shoushan
    Li, Junhui
    Zhou, Guodong
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2018, PT II, 2018, 11109 : 372 - 382
  • [25] Building an Arabic Sentiment Lexicon Using Semi-supervised Learning
    Mahyoub, Fawaz H. H.
    Siddiqui, Muazzam A.
    Dahab, Mohamed Y.
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2014, 26 (04) : 417 - 424
  • [26] Multi-view Learning for Semi-supervised Sentiment Classification
    Su, Yan
    Li, Shoushan
    Ju, Shengfeng
    Zhou, Guodong
    Li, Xiaojun
    2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012), 2012, : 13 - 16
  • [27] Cooperative Hybrid Semi-Supervised Learning for Text Sentiment Classification
    Li, Yang
    Lv, Ying
    Wang, Suge
    Liang, Jiye
    Li, Juanzi
    Li, Xiaoli
    SYMMETRY-BASEL, 2019, 11 (02):
  • [28] Active deep learning method for semi-supervised sentiment classification
    Zhou, Shusen
    Chen, Qingcai
    Wang, Xiaolong
    NEUROCOMPUTING, 2013, 120 : 536 - 546
  • [29] UniMF: A Unified Multimodal Framework for Multimodal Sentiment Analysis in Missing Modalities and Unaligned Multimodal Sequences
    Huan, Ruohong
    Zhong, Guowei
    Chen, Peng
    Liang, Ronghua
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5753 - 5768
  • [30] A Theoretical Analysis of Semi-supervised Learning
    Fujii, Takashi
    Ito, Hidetaka
    Miyoshi, Seiji
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT II, 2016, 9948 : 28 - 36