Disentangled Variational Autoencoder for Emotion Recognition in Conversations

被引：5

作者：

Yang, Kailai ^{[1
]}

Zhang, Tianlin ^{[1
]}

Ananiadou, Sophia ^{[1
]}

机构：

[1] Univ Manchester, Dept Comp Sci, NaCTeM, Manchester M13 9PL, England

来源：

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING | 2024年 / 15卷 / 02期

基金：

英国生物技术与生命科学研究理事会;

关键词：

Task analysis; Emotion recognition; Hidden Markov models; Context modeling; Decoding; Oral communication; Gaussian distribution; Emotion recognition in conversations; variational autoencoder; valence-arousal-dominance; disentangled representations; DIALOGUE;

D O I：

10.1109/TAFFC.2023.3280038

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In Emotion Recognition in Conversations (ERC), the emotions of target utterances are closely dependent on their context. Therefore, existing works train the model to generate the response of the target utterance, which aims to recognise emotions leveraging contextual information. However, adjacent response generation ignores long-range dependencies and provides limited affective information in many cases. In addition, most ERC models learn a unified distributed representation for each utterance, which lacks interpretability and robustness. To address these issues, we propose a VAD-disentangled Variational AutoEncoder (VAD-VAE), which first introduces a target utterance reconstruction task based on Variational Autoencoder, then disentangles three affect representations Valence-Arousal-Dominance (VAD) from the latent space. We also enhance the disentangled representations by introducing VAD supervision signals from a sentiment lexicon and minimising the mutual information between VAD distributions. Experiments show that VAD-VAE outperforms the state-of-the-art model on two datasets. Further analysis proves the effectiveness of each proposed module and the quality of disentangled VAD representations.

引用

页码：508 / 518

页数：11

共 50 条

[1] Contrastively Disentangled Sequential Variational Autoencoder
Bai, Junwen
Wang, Weiran
Gomes, Carla
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[2] Disentangled Variational Autoencoder for Social Recommendation
Zhang, Yongshuai
Huang, Jiajin
Yang, Jian
NEURAL PROCESSING LETTERS, 2024, 56 (03)
[3] A Disentangled Recurrent Variational Autoencoder for Speech Enhancement
Yan, Hegen
Lu, Zhihua
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1697 - 1702
[4] Facial landmark disentangled network with variational autoencoder
Sen Liang
Zhi-ze Zhou
Yu-dong Guo
Xuan Gao
Ju-yong Zhang
Hu-jun Bao
Applied Mathematics-A Journal of Chinese Universities, 2022, 37 : 290 - 305
[5] Facial landmark disentangled network with variational autoencoder
Liang, Sen
Zhou, Zhi-ze
Guo, Yu-dong
Gao, Xuan
Zhang, Ju-yong
Bao, Hu-jun
APPLIED MATHEMATICS-A JOURNAL OF CHINESE UNIVERSITIES SERIES B, 2022, 37 (02) : 290 - 305
[6] Facial landmark disentangled network with variational autoencoder
LIANG Sen
ZHOU Zhize
GUO Yudong
GAO Xuan
ZHANG Juyong
BAO Hujun
Applied Mathematics:A Journal of Chinese Universities, 2022, 37 (02) : 290 - 305
[7] DualVAE: Dual Disentangled Variational AutoEncoder for Recommendation
Guo, Zhiqiang
Li, Guohui
Li, Jianjun
Wang, Chaoyang
Shi, Si
PROCEEDINGS OF THE 2024 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2024, : 571 - 579
[8] Reconstructing Neutral Face Expressions with Disentangled Variational Autoencoder
Wiem, Grina
Ali, Douik
ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT II, 2024, 14496 : 83 - 94
[9] Counterfactual Fairness with Disentangled Causal Effect Variational Autoencoder
Kim, Hyemi
Shin, Seungjae
Jang, JoonHo
Song, Kyungwoo
Joo, Weonyoung
Kang, Wanmo
Moon, Il-Chul
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8128 - 8136
[10] Variational Autoencoder based Latent Factor Decoding of Multichannel EEG for Emotion Recognition
Li, Xiang
Zhao, Zhigang
Song, Dawei
Zhang, Yazhou
Niu, Chunyang
Zhang, Junwei
Huo, Jidong
Li, Jing
2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 684 - 687

← 1 2 3 4 5 →