Cluster-Level Contrastive Learning for Emotion Recognition in Conversations

被引:27
|
作者
Yang, Kailai [1 ,2 ]
Zhang, Tianlin [1 ,2 ]
Alhuzali, Hassan [3 ]
Ananiadou, Sophia [1 ,2 ]
机构
[1] Univ Manchester, NaCTeM, Manchester M13 9PL, England
[2] Univ Manchester, Dept Comp Sci, Manchester M13 9PL, England
[3] Umm Al Qura Univ, Coll Comp & Informat Syst, Mecca 24382, Saudi Arabia
基金
英国生物技术与生命科学研究理事会;
关键词
Emotion recognition; Prototypes; Linguistics; Task analysis; Semantics; Training; Adaptation models; Cluster-level contrastive learning; emotion recognition in conversations; pre-trained knowledge adapters; valence-arousal-dominance; DIALOGUE; FRAMEWORK;
D O I
10.1109/TAFFC.2023.3243463
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key challenge for Emotion Recognition in Conversations (ERC) is to distinguish semantically similar emotions. Some works utilise Supervised Contrastive Learning (SCL) which uses categorical emotion labels as supervision signals and contrasts in high-dimensional semantic space. However, categorical labels fail to provide quantitative information between emotions. ERC is also not equally dependent on all embedded features in the semantic space, which makes the high-dimensional SCL inefficient. To address these issues, we propose a novel low-dimensional Supervised Cluster-level Contrastive Learning (SCCL) method, which first reduces the high-dimensional SCL space to a three-dimensional affect representation space Valence-Arousal-Dominance (VAD), then performs cluster-level contrastive learning to incorporate measurable emotion prototypes. To help modelling the dialogue and enriching the context, we leverage the pre-trained knowledge adapters to infuse linguistic and factual knowledge. Experiments show that our method achieves new state-of-the-art results with 69.81% on IEMOCAP, 65.7% on MELD, and 62.51% on DailyDialog datasets. The analysis also proves that the VAD space is not only suitable for ERC but also interpretable, with VAD prototypes enhancing its performance and stabilising the training of SCCL. In addition, the pre-trained knowledge adapters benefit the performance of the utterance encoder and SCCL. Our code is available at: https://github.com/SteveKGYang/SCCLI
引用
收藏
页码:3269 / 3280
页数:12
相关论文
共 50 条
  • [41] SCCL Text Deep Clustering with Increased Cluster-Level Comparison
    Li J.
    Zhang Z.
    Wang Y.
    Data Analysis and Knowledge Discovery, 2024, 8 (03) : 98 - 109
  • [42] Disentangled Variational Autoencoder for Emotion Recognition in Conversations
    Yang, Kailai
    Zhang, Tianlin
    Ananiadou, Sophia
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (02) : 508 - 518
  • [43] A Cluster-Level Semi-supervision Model for Interactive Clustering
    Dubey, Avinava
    Bhattacharya, Indrajit
    Godbole, Shantanu
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I: EUROPEAN CONFERENCE, ECML PKDD 2010, 2010, 6321 : 409 - 424
  • [44] Fusing pairwise modalities for emotion recognition in conversations
    Fan, Chunxiao
    Lin, Jie
    Mao, Rui
    Cambria, Erik
    INFORMATION FUSION, 2024, 106
  • [45] Hypergraph Neural Network for Emotion Recognition in Conversations
    Zheng, Cheng
    Xu, Haojie
    Sun, Xiao
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (02)
  • [46] Exploiting Unsupervised Data for Emotion Recognition in Conversations
    Jiao, Wenxiang
    Lyu, Michael R.
    King, Irwin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4839 - 4846
  • [47] C-BGA: Multimodal Speech Emotion Recognition Network Combining Contrastive Learning
    Miao, Borui
    Xu, Yunfeng
    Zhao, Shaojie
    Wang, Jialin
    Computer Engineering and Applications, 60 (16): : 168 - 176
  • [48] Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations
    Wu, Wen
    Zhang, Chao
    Woodland, Philip C.
    INTERSPEECH 2023, 2023, : 3607 - 3611
  • [49] Cross-subject emotion recognition with contrastive learning based on EEG signal correlations
    Hu, Mengting
    Xu, Dan
    He, Kangjian
    Zhao, Kunyuan
    Zhang, Hao
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 104
  • [50] Forensics Filesystem with Cluster-Level Identifiers for Efficient Data Recovery
    Alhussein, Mohammed
    Srinivasan, Avinash
    Wijesekera, Duminda
    2012 INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS, 2012, : 411 - 415