Multimodal Consistency-Based Teacher for Semi-Supervised Multimodal Sentiment Analysis

被引:0
|
作者
Yuan, Ziqi [1 ]
Fang, Jingliang [1 ,2 ]
Xu, Hua [1 ,2 ]
Gao, Kai [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
[2] Samton Jiangxi Technol Dev Co Ltd, Nanchang 330036, Peoples R China
[3] Hebei Univ Sci & Technol, Sch Informat Sci & Engn, Shijiazhuang 050018, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Sentiment analysis; Visualization; Training; Speech processing; Semisupervised learning; Image classification; Consistency-based semi-supervised learning; multimodal sentiment analysis; pseudo-label filtering;
D O I
10.1109/TASLP.2024.3430543
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Multimodal sentiment analysis holds significant importance within the realm of human-computer interaction. Due to the ease of collecting unlabeled online resources compared to the high costs associated with annotation, it becomes imperative for researchers to develop semi-supervised methods that leverage unlabeled data to enhance model performance. Existing semi-supervised approaches, particularly those applied to trivial image classification tasks, are not suitable for multimodal regression tasks due to their reliance on task-specific augmentation and thresholds designed for classification tasks. To address this limitation, we propose the Multimodal Consistency-based Teacher (MC-Teacher), which incorporates consistency-based pseudo-label technique into semi-supervised multimodal sentiment analysis. In our approach, we first propose synergistic consistency assumption which focus on the consistency among bimodal representation. Building upon this assumption, we develop a learnable filter network that autonomously learns how to identify misleading instances instead of threshold-based methods. This is achieved by leveraging both the implicit discriminant consistency on unlabeled instances and the explicit guidance on constructed training data with labeled instances. Additionally, we design the self-adaptive exponential moving average strategy to decouple the student and teacher networks, utilizing a heuristic momentum coefficient. Through both quantitative and qualitative experiments on two benchmark datasets, we demonstrate the outstanding performances of the proposed MC-Teacher approach. Furthermore, detailed analysis experiments and case studies are provided for each crucial component to intuitively elucidate the inner mechanism and further validate their effectiveness.
引用
收藏
页码:3669 / 3683
页数:15
相关论文
共 50 条
  • [41] A semi-supervised approach to sentiment analysis using revised sentiment strength based on SentiWordNet
    Farhan Hassan Khan
    Usman Qamar
    Saba Bashir
    Knowledge and Information Systems, 2017, 51 : 851 - 872
  • [42] Set-Similarity Joins Based Semi-supervised Sentiment Analysis
    Dong, Xishuang
    Zou, Qibo
    Guan, Yi
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT I, 2012, 7663 : 176 - 183
  • [43] Multimodal consistency-specificity fusion based on information bottleneck for sentiment analysis
    Liu, Wei
    Cao, Shenchao
    Zhang, Sun
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (02)
  • [44] An autoencoder-based self-supervised learning for multimodal sentiment analysis
    Feng, Wenjun
    Wang, Xin
    Cao, Donglin
    Lin, Dazhen
    INFORMATION SCIENCES, 2024, 675
  • [45] Semi-Supervised Multimodal Deep Learning Model for Polarity Detection in Arguments
    Ange, Tato
    Roger, Nkambou
    Aude, Dufresne
    Claude, Frasson
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [46] Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech
    Sunkara, Monica
    Ronanki, Srikanth
    Bekal, Dhanush
    Bodapati, Sravan
    Kirchhoff, Katrin
    INTERSPEECH 2020, 2020, : 4911 - 4915
  • [47] Kernel semi-supervised graph embedding model for multimodal and mixmodal data
    Qi Zhang
    Rui Li
    Tianguang Chu
    Science China Information Sciences, 2020, 63
  • [48] DOUBLY SEMI-SUPERVISED MULTIMODAL ADVERSARIAL LEARNING FOR CLASSIFICATION, GENERATION AND RETRIEVAL
    Du, Changde
    Du, Changying
    He, Huiguang
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 13 - 18
  • [49] Kernel semi-supervised graph embedding model for multimodal and mixmodal data
    Qi ZHANG
    Rui LI
    Tianguang CHU
    ScienceChina(InformationSciences), 2020, 63 (01) : 247 - 249
  • [50] Semi-supervised dimensional sentiment analysis with variational autoencoder
    Wu, Chuhan
    Wu, Fangzhao
    Wu, Sixing
    Yuan, Zhigang
    Liu, Junxin
    Huang, Yongfeng
    KNOWLEDGE-BASED SYSTEMS, 2019, 165 : 30 - 39