Multimodal Consistency-Based Teacher for Semi-Supervised Multimodal Sentiment Analysis

被引:0
|
作者
Yuan, Ziqi [1 ]
Fang, Jingliang [1 ,2 ]
Xu, Hua [1 ,2 ]
Gao, Kai [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
[2] Samton Jiangxi Technol Dev Co Ltd, Nanchang 330036, Peoples R China
[3] Hebei Univ Sci & Technol, Sch Informat Sci & Engn, Shijiazhuang 050018, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Sentiment analysis; Visualization; Training; Speech processing; Semisupervised learning; Image classification; Consistency-based semi-supervised learning; multimodal sentiment analysis; pseudo-label filtering;
D O I
10.1109/TASLP.2024.3430543
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Multimodal sentiment analysis holds significant importance within the realm of human-computer interaction. Due to the ease of collecting unlabeled online resources compared to the high costs associated with annotation, it becomes imperative for researchers to develop semi-supervised methods that leverage unlabeled data to enhance model performance. Existing semi-supervised approaches, particularly those applied to trivial image classification tasks, are not suitable for multimodal regression tasks due to their reliance on task-specific augmentation and thresholds designed for classification tasks. To address this limitation, we propose the Multimodal Consistency-based Teacher (MC-Teacher), which incorporates consistency-based pseudo-label technique into semi-supervised multimodal sentiment analysis. In our approach, we first propose synergistic consistency assumption which focus on the consistency among bimodal representation. Building upon this assumption, we develop a learnable filter network that autonomously learns how to identify misleading instances instead of threshold-based methods. This is achieved by leveraging both the implicit discriminant consistency on unlabeled instances and the explicit guidance on constructed training data with labeled instances. Additionally, we design the self-adaptive exponential moving average strategy to decouple the student and teacher networks, utilizing a heuristic momentum coefficient. Through both quantitative and qualitative experiments on two benchmark datasets, we demonstrate the outstanding performances of the proposed MC-Teacher approach. Furthermore, detailed analysis experiments and case studies are provided for each crucial component to intuitively elucidate the inner mechanism and further validate their effectiveness.
引用
收藏
页码:3669 / 3683
页数:15
相关论文
共 50 条
  • [1] SSLMM: Semi-Supervised Learning with Missing Modalities for Multimodal Sentiment Analysis
    Wang, Yiyu
    Jian, Haifang
    Zhuang, Jian
    Guo, Huimin
    Leng, Yan
    INFORMATION FUSION, 2025, 120
  • [2] Consistency-based Semi-supervised Learning for Object Detection
    Jeong, Jisoo
    Lee, Seungeui
    Kim, Jeesoo
    Kwak, Nojun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [3] Semi-supervised Audio Classification with Consistency-Based Regularization
    Lu, Kangkang
    Foo, Chuan-Sheng
    Teh, Kah Kuan
    Huy Dat Tran
    Chandrasekhar, Vijay Ramaseshan
    INTERSPEECH 2019, 2019, : 3654 - 3658
  • [4] Consistency-based semi-supervised learning for oriented object detection
    Fu, Ronghao
    Chen, Chengcheng
    Yan, Shuang
    Wang, Xianchang
    Chen, Huiling
    KNOWLEDGE-BASED SYSTEMS, 2024, 304
  • [5] Explanation Consistency Training: Facilitating Consistency-Based Semi-Supervised Learning with Interpretability
    Han, Tao
    Tu, Wei-Wei
    Li, Yu-Feng
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7639 - 7646
  • [6] Semi-supervised hash learning method with consistency-based dimensionality reduction
    Lv, Fang
    Wei, Yuliang
    Han, Xixian
    Wang, Bailing
    ADVANCES IN MECHANICAL ENGINEERING, 2019, 11 (01)
  • [7] Semi-supervised Learning with Multimodal Perturbation
    Su, Lei
    Liao, Hongzhi
    Yu, Zhengtao
    Tang, Jiahua
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 1, PROCEEDINGS, 2009, 5551 : 651 - +
  • [8] Consistency-Based Semi-supervised Evidential Active Learning for Diagnostic Radiograph Classification
    Balaram, Shafa
    Nguyen, Cuong M.
    Kassim, Ashraf
    Krishnaswamy, Pavitra
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT I, 2022, 13431 : 675 - 685
  • [9] SemiMemes: A Semi-supervised Learning Approach for Multimodal Memes Analysis
    Pham Thai Hoang Tung
    Nguyen Tan Viet
    Ngo Tien Anh
    Phan Duy Hung
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2023, 2023, 14162 : 565 - 577
  • [10] Graph-based multimodal semi-supervised image classification
    Xie, Wenxuan
    Lu, Zhiwu
    Peng, Yuxin
    Xiao, Jianguo
    NEUROCOMPUTING, 2014, 138 : 167 - 179