A graph neural network with context filtering and feature correction for conversational emotion recognition

被引:6
|
作者
Gan, Chenquan [1 ,2 ]
Zheng, Jiahao [1 ]
Zhu, Qingyi [2 ]
Kumar, Deepak [3 ,4 ]
Struc, Vitomir [5 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Sch Cyber Secur & Informat Law, Chongqing 400065, Peoples R China
[3] Dalian Univ Technol, Sch Artificial Intelligence, Key Lab Intelligent Control & Optimizat Ind Equipm, Minist Educ, Dalian 116024, Peoples R China
[4] Symbiosis Int Univ, Symbiosis Inst Technol, Pune 412115, India
[5] Univ Ljubljana, Fac Elect Engn, Trzaska Cesta 25, SI-1000 Ljubljana, Slovenia
关键词
Conversational emotion recognition; Context filter; Feature correction; Graph network; SENTIMENT; MODEL;
D O I
10.1016/j.ins.2023.120017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conversational emotion recognition represents an important machine-learning problem with a wide variety of deployment possibilities. The key challenge in this area is how to properly capture the key conversational aspects that facilitate reliable emotion recognition, including utterance semantics, temporal order, informative contextual cues, speaker interactions as well as other relevant factors. In this paper, we present a novel Graph Neural Network approach for conversational emotion recognition at the utterance level. Our method addresses the outlined challenges and represents conversations in the form of graph structures that naturally encode temporal order, speaker dependencies, and even long-distance context. To efficiently capture the semantic content of the conversations, we leverage the zero-shot feature-extraction capabilities of pre-trained large-scale language models and then integrate two key contributions into the graph neural network to ensure competitive recognition results. The first is a novel context filter that establishes meaningful utterance dependencies for the graph construction procedure and removes low-relevance and uninformative utterances from being used as a source of contextual information for the recognition task. The second contribution is a feature-correction procedure that adjusts the information content in the generated feature representations through a gating mechanism to improve their discriminative power and reduce emotion-prediction errors. We conduct extensive experiments on four commonly used conversational datasets, i.e., IEMOCAP, MELD, Dailydialog, and EmoryNLP, to demonstrate the capabilities of the developed graph neural network with context filtering and error-correction capabilities. The results of the experiments point to highly promising performance, especially when compared to state-of-the-art competitors from the literature.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Context-Dependent Domain Adversarial Neural Network for Multimodal Emotion Recognition
    Lian, Zheng
    Tao, Jianhua
    Liu, Bin
    Huang, Jian
    Yang, Zhanlei
    Li, Rongjun
    INTERSPEECH 2020, 2020, : 394 - 398
  • [32] Noise Filtering and Feature Enhancement Based Graph Neural Network Method for Fraud Detection
    Li K.-H.
    Huang Z.-H.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (11): : 3053 - 3060
  • [33] Revisiting Disentanglement and Fusion on Modality and Context in Conversational Multimodal Emotion Recognition
    Li, Bobo
    Fei, Hao
    Liao, Lizi
    Zhao, Yu
    Teng, Chong
    Chua, Tat-Seng
    Ji, Donghong
    Li, Fei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5923 - 5934
  • [34] Topics Guided Multimodal Fusion Network for Conversational Emotion Recognition
    Yuan, Peicong
    Cai, Guoyong
    Chen, Ming
    Tang, Xiaolv
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14877 : 250 - 262
  • [35] MALN: Multimodal Adversarial Learning Network for Conversational Emotion Recognition
    Ren, Minjie
    Huang, Xiangdong
    Liu, Jing
    Liu, Ming
    Li, Xuanya
    Liu, An-An
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6965 - 6980
  • [36] Performance Analysis of Graph Neural Network (GNN) for Manufacturing Feature Recognition Problem
    Betkier, Igor
    Oszczypala, Mateusz
    Pobozniak, Janusz
    Sobieski, Sergiusz
    2023 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE, HPEC, 2023,
  • [37] Graph Isomorphism Network for Speech Emotion Recognition
    Liu, Jiawang
    Wang, Haoxiang
    INTERSPEECH 2021, 2021, : 3405 - 3409
  • [38] Emotion Recognition with Capsule Neural Network
    Loan Trinh Van
    Quang H Nguyen
    Thuy Dao Thi Le
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 41 (03): : 1083 - 1098
  • [39] Efficient Emotion Recognition based on Hybrid Emotion Recognition Neural Network
    Ou, Yang-Yen
    Su, Bo-Hao
    Tseng, Shih-Pang
    Hsu, Liu-Yi-Cheng
    Wang, Jhing-Fa
    Kuan, Ta-Wen
    2018 INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT), 2018,
  • [40] Accurate Emotion Recognition Utilizing Extracted EEG Sources as Graph Neural Network Nodes
    Asadzadeh, Shiva
    Rezaii, Tohid Yousefi
    Beheshti, Soosan
    Meshgini, Saeed
    COGNITIVE COMPUTATION, 2023, 15 (01) : 176 - 189