SSLMM: Semi-Supervised Learning with Missing Modalities for Multimodal Sentiment Analysis

被引：0

作者：

Wang, Yiyu ^{[1
]}

Jian, Haifang ^{[2
,3
]}

Zhuang, Jian ^{[4
]}

Guo, Huimin ^{[2
,3
]}

Leng, Yan ^{[1
]}

机构：

[1] Shandong Normal Univ, Sch Phys & Elect, Jinan 250358, Peoples R China

[2] Chinese Acad Sci, Lab Solid State Optoelect Informat Technol, Inst Semicond, Beijing 100083, Peoples R China

[3] Chinese Acad Sci, Beijing 100049, Peoples R China

[4] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116023, Liaoning, Peoples R China

来源：

INFORMATION FUSION | 2025年 / 120卷

关键词：

Multimodal sentiment analysis; Semi-supervised learning; Missing modalities;

D O I：

10.1016/j.inffus.2025.103058

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multimodal Sentiment Analysis (MSA) integrates information from text, audio, and visuals to understand human emotions, but real-world applications face two challenges: (1) expensive annotation costs reduce the effectiveness of fully supervised methods, and (2) missing modality severely impact model robustness. While there are studies addressing these issues separately, few focus on solving both within a single framework. In real-world scenarios, these challenges often occur together, necessitating an algorithm that can handle both. To address this, we propose a Semi-Supervised Learning with Missing Modalities (SSLMM) framework. SSLMM combines self-supervised learning, alternating interaction information, semi-supervised learning, and modality reconstruction to tackle label scarcity and modality missing simultaneously. Firstly, SSLMM captures latent structural information through self-supervised pre-training. It then fine-tunes the model using semi- supervised learning and modality reconstruction to reduce dependence on labeled data and improve robustness to modality missing. The framework uses a graph-based architecture with an iterative message propagation mechanism to alternately propagate intra-modal and inter-modal messages, capturing emotional associations within and across modalities. Experiments on CMU-MOSI, CMU-MOSEI, and CH-SIMS demonstrate that under the condition where the proportion of labeled samples and the missing modality rate are both 0.5, SSLMM achieves binary classification (negative vs. positive) accuracies of 80.2%, 81.7%, and 77.1%, respectively, surpassing existing methods.

引用

页数：19

共 50 条

[21] Semi-supervised Multi-view Sentiment Analysis
Lazarova, Gergana
Koychev, Ivan
COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2015), PT I, 2015, 9329 : 181 - 190
[22] Semi-supervised distributed representations of documents for sentiment analysis
Park, Saerom
Lee, Jaewook
Kim, Kyoungok
NEURAL NETWORKS, 2019, 119 : 139 - 150
[23] Semi-Supervised Multi-Modal Learning with Incomplete Modalities
Yang, Yang
Zhan, De-Chuan
Sheng, Xiang-Rong
Jiang, Yuan
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2998 - 3004
[24] Semi-supervised Sentiment Classification Based on Auxiliary Task Learning
Liu, Huan
Wang, Jingjing
Li, Shoushan
Li, Junhui
Zhou, Guodong
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2018, PT II, 2018, 11109 : 372 - 382
[25] Building an Arabic Sentiment Lexicon Using Semi-supervised Learning
Mahyoub, Fawaz H. H.
Siddiqui, Muazzam A.
Dahab, Mohamed Y.
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2014, 26 (04) : 417 - 424
[26] Multi-view Learning for Semi-supervised Sentiment Classification
Su, Yan
Li, Shoushan
Ju, Shengfeng
Zhou, Guodong
Li, Xiaojun
2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012), 2012, : 13 - 16
[27] Cooperative Hybrid Semi-Supervised Learning for Text Sentiment Classification
Li, Yang
Lv, Ying
Wang, Suge
Liang, Jiye
Li, Juanzi
Li, Xiaoli
SYMMETRY-BASEL, 2019, 11 (02):
[28] Active deep learning method for semi-supervised sentiment classification
Zhou, Shusen
Chen, Qingcai
Wang, Xiaolong
NEUROCOMPUTING, 2013, 120 : 536 - 546
[29] UniMF: A Unified Multimodal Framework for Multimodal Sentiment Analysis in Missing Modalities and Unaligned Multimodal Sequences
Huan, Ruohong
Zhong, Guowei
Chen, Peng
Liang, Ronghua
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5753 - 5768
[30] A Theoretical Analysis of Semi-supervised Learning
Fujii, Takashi
Ito, Hidetaka
Miyoshi, Seiji
NEURAL INFORMATION PROCESSING, ICONIP 2016, PT II, 2016, 9948 : 28 - 36

← 1 2 3 4 5 →