A graph neural network with context filtering and feature correction for conversational emotion recognition

被引:6
|
作者
Gan, Chenquan [1 ,2 ]
Zheng, Jiahao [1 ]
Zhu, Qingyi [2 ]
Kumar, Deepak [3 ,4 ]
Struc, Vitomir [5 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Sch Cyber Secur & Informat Law, Chongqing 400065, Peoples R China
[3] Dalian Univ Technol, Sch Artificial Intelligence, Key Lab Intelligent Control & Optimizat Ind Equipm, Minist Educ, Dalian 116024, Peoples R China
[4] Symbiosis Int Univ, Symbiosis Inst Technol, Pune 412115, India
[5] Univ Ljubljana, Fac Elect Engn, Trzaska Cesta 25, SI-1000 Ljubljana, Slovenia
关键词
Conversational emotion recognition; Context filter; Feature correction; Graph network; SENTIMENT; MODEL;
D O I
10.1016/j.ins.2023.120017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conversational emotion recognition represents an important machine-learning problem with a wide variety of deployment possibilities. The key challenge in this area is how to properly capture the key conversational aspects that facilitate reliable emotion recognition, including utterance semantics, temporal order, informative contextual cues, speaker interactions as well as other relevant factors. In this paper, we present a novel Graph Neural Network approach for conversational emotion recognition at the utterance level. Our method addresses the outlined challenges and represents conversations in the form of graph structures that naturally encode temporal order, speaker dependencies, and even long-distance context. To efficiently capture the semantic content of the conversations, we leverage the zero-shot feature-extraction capabilities of pre-trained large-scale language models and then integrate two key contributions into the graph neural network to ensure competitive recognition results. The first is a novel context filter that establishes meaningful utterance dependencies for the graph construction procedure and removes low-relevance and uninformative utterances from being used as a source of contextual information for the recognition task. The second contribution is a feature-correction procedure that adjusts the information content in the generated feature representations through a gating mechanism to improve their discriminative power and reduce emotion-prediction errors. We conduct extensive experiments on four commonly used conversational datasets, i.e., IEMOCAP, MELD, Dailydialog, and EmoryNLP, to demonstrate the capabilities of the developed graph neural network with context filtering and error-correction capabilities. The results of the experiments point to highly promising performance, especially when compared to state-of-the-art competitors from the literature.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Facial Landmark-Based Emotion Recognition via Directed Graph Neural Network
    Quang Tran Ngoc
    Lee, Seunghyun
    Song, Byung Cheol
    ELECTRONICS, 2020, 9 (05)
  • [42] SUNET: Speaker-utterance interaction Graph Neural Network for Emotion Recognition in Conversations
    Song, Rui
    Giunchiglia, Fausto
    Shi, Lida
    Shen, Qiang
    Xu, Hao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [43] Accurate Emotion Recognition Utilizing Extracted EEG Sources as Graph Neural Network Nodes
    Shiva Asadzadeh
    Tohid Yousefi Rezaii
    Soosan Beheshti
    Saeed Meshgini
    Cognitive Computation, 2023, 15 : 176 - 189
  • [44] Static and Dynamic Speaker Modeling based on Graph Neural Network for Emotion Recognition in Conversation
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2022, : 247 - 253
  • [45] A cross-modal fusion network based on graph feature learning for multimodal emotion recognition
    Cao Xiaopeng
    Zhang Linying
    Chen Qiuxian
    Ning Hailong
    Dong Yizhuo
    The Journal of China Universities of Posts and Telecommunications, 2024, 31 (06) : 16 - 25
  • [46] A Dynamic Emotion Recognition System Based on Convolutional Feature Extraction and Recurrent Neural Network
    Yin, Yida
    Ayoub, Misbah
    Abel, Andrew
    Zhang, Haiyang
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, 2023, 543 : 134 - 154
  • [47] Objectivity meets subjectivity: A subjective and objective feature fused neural network for emotion recognition
    Zhou, Sijin
    Huang, Dongmin
    Liu, Cheng
    Jiang, Dazhi
    Applied Soft Computing, 2022, 122
  • [48] Objectivity meets subjectivity: A subjective and objective feature fused neural network for emotion recognition
    Zhou, Sijin
    Huang, Dongmin
    Liu, Cheng
    Jiang, Dazhi
    APPLIED SOFT COMPUTING, 2022, 122
  • [49] A Multi-Level Alignment and Cross-Modal Unified Semantic Graph Refinement Network for Conversational Emotion Recognition
    Zhang, Xiaoheng
    Cui, Weigang
    Hu, Bin
    Li, Yang
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (03) : 1553 - 1566
  • [50] Graph Neural Network-Based Speech Emotion Recognition: A Fusion of Skip Graph Convolutional Networks and Graph Attention Networks
    Wang, Han
    Kim, Deok-Hwan
    ELECTRONICS, 2024, 13 (21)