Modeling Hierarchical Uncertainty for Multimodal Emotion Recognition in Conversation

被引:7
|
作者
Chen, Feiyu [1 ,2 ]
Shao, Jie [1 ,2 ]
Zhu, Anjie [1 ]
Ouyang, Deqiang [3 ]
Liu, Xueliang [4 ]
Shen, Heng Tao [1 ,5 ]
机构
[1] Univ Elect Sci & Technol China, Ctr Future Media, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[2] Sichuan Artificial Intelligence Res Inst, Yibin 644000, Peoples R China
[3] Chongqing Univ, Coll Comp Sci, Chongqing 400044, Peoples R China
[4] Hefei Univ Technol, Sch Comp & Informat, Hefei 230009, Peoples R China
[5] Peng Cheng Lab, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Uncertainty; Emotion recognition; Predictive models; Context modeling; Reliability; Bayes methods; Adaptation models; Bayesian deep learning; capsule network (CapsNet); conditional layer normalization (CLN); emotion recognition in conversation (ERC); uncertainty;
D O I
10.1109/TCYB.2022.3185119
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Approximating the uncertainty of an emotional AI agent is crucial for improving the reliability of such agents and facilitating human-in-the-loop solutions, especially in critical scenarios. However, none of the existing systems for emotion recognition in conversation (ERC) has attempted to estimate the uncertainty of their predictions. In this article, we present HU-Dialogue, which models hierarchical uncertainty for the ERC task. We perturb contextual attention weight values with source-adaptive noises within each modality, as a regularization scheme to model context-level uncertainty and adapt the Bayesian deep learning method to the capsule-based prediction layer to model modality-level uncertainty. Furthermore, a weight-sharing triplet structure with conditional layer normalization is introduced to detect both invariance and equivariance among modalities for ERC. We provide a detailed empirical analysis for extensive experiments, which shows that our model outperforms previous state-of-the-art methods on three popular multimodal ERC datasets.
引用
收藏
页码:187 / 198
页数:12
相关论文
共 50 条
  • [1] HAAN-ERC: hierarchical adaptive attention network for multimodal emotion recognition in conversation
    Tao Zhang
    Zhenhua Tan
    Xiaoer Wu
    [J]. Neural Computing and Applications, 2023, 35 : 17619 - 17632
  • [2] HAAN-ERC: hierarchical adaptive attention network for multimodal emotion recognition in conversation
    Zhang, Tao
    Tan, Zhenhua
    Wu, Xiaoer
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (24): : 17619 - 17632
  • [3] Multimodal Emotion Recognition in Conversation Based on Hypergraphs
    Li, Jiaze
    Mei, Hongyan
    Jia, Liyun
    Zhang, Xing
    [J]. ELECTRONICS, 2023, 12 (22)
  • [4] A Contextual Attention Network for Multimodal Emotion Recognition in Conversation
    Wang, Tana
    Hou, Yaqing
    Zhou, Dongsheng
    Zhang, Qiang
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [5] Interactive Multimodal Attention Network for Emotion Recognition in Conversation
    Ren, Minjie
    Huang, Xiangdong
    Shi, Xiaoqi
    Nie, Weizhi
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1046 - 1050
  • [6] Unlocking the Power of Multimodal Learning for Emotion Recognition in Conversation
    Wang, Yunxiao
    Liu, Meng
    Li, Zhe
    Hu, Yupeng
    Luo, Xin
    Nie, Liqiang
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5947 - 5955
  • [7] Fusion with Hierarchical Graphs for Multimodal Emotion Recognition
    Tang, Shuyun
    Luo, Zhaojie
    Nan, Guoshun
    Baba, Jun
    Yoshikawa, Yuichiro
    Ishiguro, Hiroshi
    [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1288 - 1296
  • [8] Multimodal emotion recognition with hierarchical memory networks
    Lai, Helang
    Wu, Keke
    Li, Lingli
    [J]. INTELLIGENT DATA ANALYSIS, 2021, 25 (04) : 1031 - 1045
  • [9] Consistency, Uncertainty or Inconsistency Detection in Multimodal Emotion Recognition
    Fantini, Alessia
    Pilato, Giovanni
    Vitale, Gianpaolo
    [J]. 2023 SEVENTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC 2023, 2023, : 377 - 380
  • [10] Improving multimodal fusion with Main Modal Transformer for emotion recognition in conversation
    Zou, ShiHao
    Huang, Xianying
    Shen, XuDong
    Liu, Hankai
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 258