Causal Inference for Modality Debiasing in Multimodal Emotion Recognition

被引:0
|
作者
Kim, Juyeon [1 ]
Hong, Juyoung [1 ]
Choi, Yukyung [1 ]
机构
[1] Sejong Univ, Dept Convergence Engn Intelligent Drone, Seoul 05006, South Korea
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 23期
关键词
emotion recognition; multimodal learning; causal inference;
D O I
10.3390/app142311397
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Multimodal emotion recognition (MER) aims to enhance the understanding of human emotions by integrating visual, auditory, and textual modalities. However, previous MER approaches often depend on a dominant modality rather than considering all modalities, leading to poor generalization. To address this, we propose Causal Inference in Multimodal Emotion Recognition (CausalMER), which leverages counterfactual reasoning and causal graphs to capture relationships between modalities and reduce direct modality effects contributing to bias. This allows CausalMER to make unbiased predictions while being easily applied to existing MER methods in a model-agnostic manner, without requiring any architectural modifications. We evaluate CausalMER on the IEMOCAP and CMU-MOSEI datasets, widely used benchmarks in MER, and compare it with existing methods. On the IEMOCAP dataset with the MulT backbone, CausalMER achieves an average accuracy of 83.4%. On the CMU-MOSEI dataset, the average accuracies with MulT, PMR, and DMD backbones are 50.1%, 48.8%, and 48.8%, respectively. Experimental results demonstrate that CausalMER is robust in missing modality scenarios, as shown by its low standard deviation in performance drop gaps. Additionally, we evaluate modality contributions and show that CausalMER achieves balanced contributions from each modality, effectively mitigating direct biases from individual modalities.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] CausalABSC: Causal Inference for Aspect Debiasing in Aspect-Based Sentiment Classification
    Zhou, Jie
    Lin, Yuanbiao
    Chen, Qin
    Zhang, Qi
    Huang, Xuanjing
    He, Liang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 830 - 840
  • [22] Debiasing Counterfactual Context With Causal Inference for Multi-Turn Dialogue Reasoning
    Wang, Xu
    Zhang, Hainan
    Zhao, Shuai
    Chen, Hongshen
    Ding, Zhuoye
    Wan, Zhiguo
    Cheng, Bo
    Lan, Yanyan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1125 - 1132
  • [23] Beyond superficial emotion recognition: Modality-adaptive emotion recognition system
    Kang, Dohee
    Kim, Daeha
    Kang, Donghyun
    Kim, Taein
    Lee, Bowon
    Kim, Deokhwan
    Song, Byung Cheol
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
  • [24] Towards Context-Aware Emotion Recognition Debiasing From a Causal Demystification Perspective via De-Confounded Training
    Yang, Dingkang
    Yang, Kun
    Kuang, Haopeng
    Chen, Zhaoyu
    Wang, Yuzheng
    Zhang, Lihua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10663 - 10680
  • [25] A high speed inference architecture for multimodal emotion recognition based on sparse cross modal encoder
    Cui, Lin
    Zhang, Yuanbang
    Cui, Yingkai
    Wang, Boyan
    Sun, Xiaodong
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (05)
  • [26] An Emotion-Space Model of Multimodal Emotion Recognition
    Choe, Kyung-Il
    ADVANCED SCIENCE LETTERS, 2018, 24 (01) : 699 - 702
  • [27] INTERACTIVE EMOTION INFERENCE MODEL FOR EMOTION RECOGNITION IN CONVERSATION
    Qian, Y. A. N. J. U. N.
    Zhang, X. U. E. J. I. E.
    Wang, J. I. N.
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2022, 23 (10) : 2175 - 2193
  • [28] Causal inference in environmental sound recognition
    Traer, James
    Norman-Haignere, Sam, V
    McDermott, Josh H.
    COGNITION, 2021, 214
  • [29] Multimodal Emotion Recognition in Response to Videos
    Soleymani, Mohammad
    Pantic, Maja
    Pun, Thierry
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2012, 3 (02) : 211 - 223
  • [30] Multimodal emotion recognition and expressivity analysis
    Kollias, S
    Karpouzis, K
    2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 779 - 783