Disentangled Variational Autoencoder for Emotion Recognition in Conversations

被引:5
|
作者
Yang, Kailai [1 ]
Zhang, Tianlin [1 ]
Ananiadou, Sophia [1 ]
机构
[1] Univ Manchester, Dept Comp Sci, NaCTeM, Manchester M13 9PL, England
基金
英国生物技术与生命科学研究理事会;
关键词
Task analysis; Emotion recognition; Hidden Markov models; Context modeling; Decoding; Oral communication; Gaussian distribution; Emotion recognition in conversations; variational autoencoder; valence-arousal-dominance; disentangled representations; DIALOGUE;
D O I
10.1109/TAFFC.2023.3280038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In Emotion Recognition in Conversations (ERC), the emotions of target utterances are closely dependent on their context. Therefore, existing works train the model to generate the response of the target utterance, which aims to recognise emotions leveraging contextual information. However, adjacent response generation ignores long-range dependencies and provides limited affective information in many cases. In addition, most ERC models learn a unified distributed representation for each utterance, which lacks interpretability and robustness. To address these issues, we propose a VAD-disentangled Variational AutoEncoder (VAD-VAE), which first introduces a target utterance reconstruction task based on Variational Autoencoder, then disentangles three affect representations Valence-Arousal-Dominance (VAD) from the latent space. We also enhance the disentangled representations by introducing VAD supervision signals from a sentiment lexicon and minimising the mutual information between VAD distributions. Experiments show that VAD-VAE outperforms the state-of-the-art model on two datasets. Further analysis proves the effectiveness of each proposed module and the quality of disentangled VAD representations.
引用
收藏
页码:508 / 518
页数:11
相关论文
共 50 条
  • [21] Robust unsupervised image categorization based on variational autoencoder with disentangled latent representations
    Yang, Lin
    Fan, Wentao
    Bouguila, Nizar
    KNOWLEDGE-BASED SYSTEMS, 2022, 246
  • [22] Semantically Disentangled Variational Autoencoder for Modeling 3D Facial Details
    Ling, Jingwang
    Wang, Zhibo
    Lu, Ming
    Wang, Quan
    Qian, Chen
    Xu, Feng
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (08) : 3630 - 3641
  • [23] Bimodal variational autoencoder for audiovisual speech recognition
    Hadeer M. Sayed
    Hesham E. ElDeeb
    Shereen A. Taie
    Machine Learning, 2023, 112 : 1201 - 1226
  • [24] Bimodal variational autoencoder for audiovisual speech recognition
    Sayed, Hadeer M.
    ElDeeb, Hesham E.
    Taie, Shereen A.
    MACHINE LEARNING, 2023, 112 (04) : 1201 - 1226
  • [25] SAR Target Recognition Based On Variational Autoencoder
    Xu, Yanbing
    Zhang, Gong
    Wang, Ke
    Leung, Henry
    2019 IEEE MTT-S INTERNATIONAL MICROWAVE BIOMEDICAL CONFERENCE (IMBIOC 2019), 2019,
  • [26] Speech Emotion Recognition 'in the wild' Using an Autoencoder
    Dissanayake, Vipula
    Zhang, Haimo
    Billinghurst, Mark
    Nanayakkara, Suranga
    INTERSPEECH 2020, 2020, : 526 - 530
  • [27] Multi-Level Variational Autoencoder: Learning Disentangled Representations from Grouped Observations
    Bouchacourt, Diane
    Tomioka, Ryota
    Nowozin, Sebastian
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2095 - 2102
  • [28] Self-supervised learning for tool wear monitoring with a disentangled-variational-autoencoder
    von Hahn, Tim
    Mechefske, Chris K.
    INTERNATIONAL JOURNAL OF HYDROMECHATRONICS, 2021, 4 (01) : 69 - 98
  • [29] Information Bottlenecked Variational Autoencoder for Disentangled 3D Facial Expression Modelling
    Sun, Hao
    Pears, Nick
    Gu, Yajie
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2334 - 2343
  • [30] DISENTANGLED REPRESENTATION OF LONGITUDINAL ß-AMYLOID FOR AD VIA SEQUENTIAL GRAPH VARIATIONAL AUTOENCODER WITH SUPERVISION
    Yang, Fan
    Wu, Guorong
    Kim, Won Hwa
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,