Disentangled Variational Autoencoder for Emotion Recognition in Conversations

被引：5

作者：

Yang, Kailai ^{[1
]}

Zhang, Tianlin ^{[1
]}

Ananiadou, Sophia ^{[1
]}

机构：

[1] Univ Manchester, Dept Comp Sci, NaCTeM, Manchester M13 9PL, England

来源：

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING | 2024年 / 15卷 / 02期

基金：

英国生物技术与生命科学研究理事会;

关键词：

Task analysis; Emotion recognition; Hidden Markov models; Context modeling; Decoding; Oral communication; Gaussian distribution; Emotion recognition in conversations; variational autoencoder; valence-arousal-dominance; disentangled representations; DIALOGUE;

D O I：

10.1109/TAFFC.2023.3280038

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In Emotion Recognition in Conversations (ERC), the emotions of target utterances are closely dependent on their context. Therefore, existing works train the model to generate the response of the target utterance, which aims to recognise emotions leveraging contextual information. However, adjacent response generation ignores long-range dependencies and provides limited affective information in many cases. In addition, most ERC models learn a unified distributed representation for each utterance, which lacks interpretability and robustness. To address these issues, we propose a VAD-disentangled Variational AutoEncoder (VAD-VAE), which first introduces a target utterance reconstruction task based on Variational Autoencoder, then disentangles three affect representations Valence-Arousal-Dominance (VAD) from the latent space. We also enhance the disentangled representations by introducing VAD supervision signals from a sentiment lexicon and minimising the mutual information between VAD distributions. Experiments show that VAD-VAE outperforms the state-of-the-art model on two datasets. Further analysis proves the effectiveness of each proposed module and the quality of disentangled VAD representations.

引用

页码：508 / 518

页数：11

共 50 条

[21] Robust unsupervised image categorization based on variational autoencoder with disentangled latent representations
Yang, Lin
Fan, Wentao
Bouguila, Nizar
KNOWLEDGE-BASED SYSTEMS, 2022, 246
[22] Semantically Disentangled Variational Autoencoder for Modeling 3D Facial Details
Ling, Jingwang
Wang, Zhibo
Lu, Ming
Wang, Quan
Qian, Chen
Xu, Feng
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (08) : 3630 - 3641
[23] Bimodal variational autoencoder for audiovisual speech recognition
Hadeer M. Sayed
Hesham E. ElDeeb
Shereen A. Taie
Machine Learning, 2023, 112 : 1201 - 1226
[24] Bimodal variational autoencoder for audiovisual speech recognition
Sayed, Hadeer M.
ElDeeb, Hesham E.
Taie, Shereen A.
MACHINE LEARNING, 2023, 112 (04) : 1201 - 1226
[25] SAR Target Recognition Based On Variational Autoencoder
Xu, Yanbing
Zhang, Gong
Wang, Ke
Leung, Henry
2019 IEEE MTT-S INTERNATIONAL MICROWAVE BIOMEDICAL CONFERENCE (IMBIOC 2019), 2019,
[26] Speech Emotion Recognition 'in the wild' Using an Autoencoder
Dissanayake, Vipula
Zhang, Haimo
Billinghurst, Mark
Nanayakkara, Suranga
INTERSPEECH 2020, 2020, : 526 - 530
[27] Multi-Level Variational Autoencoder: Learning Disentangled Representations from Grouped Observations
Bouchacourt, Diane
Tomioka, Ryota
Nowozin, Sebastian
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2095 - 2102
[28] Self-supervised learning for tool wear monitoring with a disentangled-variational-autoencoder
von Hahn, Tim
Mechefske, Chris K.
INTERNATIONAL JOURNAL OF HYDROMECHATRONICS, 2021, 4 (01) : 69 - 98
[29] Information Bottlenecked Variational Autoencoder for Disentangled 3D Facial Expression Modelling
Sun, Hao
Pears, Nick
Gu, Yajie
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2334 - 2343
[30] DISENTANGLED REPRESENTATION OF LONGITUDINAL ß-AMYLOID FOR AD VIA SEQUENTIAL GRAPH VARIATIONAL AUTOENCODER WITH SUPERVISION
Yang, Fan
Wu, Guorong
Kim, Won Hwa
2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,

← 1 2 3 4 5 →