Transformer-Based Potential Emotional Relation Mining Network for Emotion Recognition in Conversation

被引:0
|
作者
Shi, Yunwei [1 ]
Sun, Xiao [2 ,3 ]
机构
[1] Anhui Univ, Hefei, Anhui, Peoples R China
[2] Hefei Univ Technol, Hefei, Anhui, Peoples R China
[3] Hefei Comprehens Natl Sci Ctr, Hefei, Anhui, Peoples R China
关键词
Emotion recognition in conversation; Transformer encoder; Natural language processing;
D O I
10.1007/978-981-99-2401-1_22
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Emotion recognition in conversation (ERC) has attracted much attention due to its widespread applications in the field of human communication analysis. Compared with the vanilla sentiment analysis of the single utterance, the ERC task which aims to judge the emotion labels of utterances in the conversation requiresmodeling both the contextual information and the speaker dependency. However, previous models are limited in exploring the potential emotional relation of the utterances. To address the problem, we propose a novel transformer-based potential emotion relation mining network (TPERMN) to better explore the potential emotional relation and integrate the emotional clues. First, we utilize the global gated recurrence unit to extract the situation-level emotion vector. Then, different speakerGRU are assigned to different speakers to capture the intra-speaker dependency of the utterances and obtain the speaker-level emotion vector. Second, a potential relation mining transformer called PERformer is devised to extract the potential emotional relation and integrate emotional clues for the situation-level and speaker-level emotion vector. In PERformer, we combine graph attention and multi-head attention mechanism to explore the deep semantic information and potential emotional relation. And an emotion augment block is designed to enhance and complement the inherent characteristics. After multi-layer accumulation, the updated representation is obtained for emotion classification. Detailed experiments on two public ERC datasets demonstrate our model outperforms the state-of-the-art models.
引用
收藏
页码:238 / 251
页数:14
相关论文
共 50 条
  • [1] Robust Multimodal Emotion Recognition from Conversation with Transformer-Based Crossmodality Fusion
    Xie, Baijun
    Sidulova, Mariia
    Park, Chung Hyuk
    SENSORS, 2021, 21 (14)
  • [2] A transformer-based network for speech recognition
    Tang L.
    International Journal of Speech Technology, 2023, 26 (02) : 531 - 539
  • [3] TDFNet: Transformer-Based Deep-Scale Fusion Network for Multimodal Emotion Recognition
    Zhao, Zhengdao
    Wang, Yuhua
    Shen, Guang
    Xu, Yuezhu
    Zhang, Jiayuan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 3771 - 3782
  • [4] Exploring Wearable Emotion Recognition with Transformer-Based Continual Learning
    Rizza, Federica
    Bellitto, Giovanni
    Calcagno, Salvatore
    Palazzo, Simone
    ARTIFICIAL INTELLIGENCE IN PANCREATIC DISEASE DETECTION AND DIAGNOSIS, AND PERSONALIZED INCREMENTAL LEARNING IN MEDICINE, AIPAD 2024, PILM 2024, 2025, 15197 : 86 - 101
  • [5] ERTNet: an interpretable transformer-based framework for EEG emotion recognition
    Liu, Ruixiang
    Chao, Yihu
    Ma, Xuerui
    Sha, Xianzheng
    Sun, Limin
    Li, Shuo
    Chang, Shijie
    FRONTIERS IN NEUROSCIENCE, 2024, 18
  • [6] The MERSA Dataset and a Transformer-Based Approach for Speech Emotion Recognition
    Zhang, Enshi
    Trujillo, Rafael
    Poellabauer, Christian
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 13960 - 13970
  • [7] Transformer-Based Self-Supervised Learning for Emotion Recognition
    Vazquez-Rodriguez, Juan
    Lefebvre, Gregoire
    Cumin, Julien
    Crowley, James L.
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2605 - 2612
  • [8] TNTC: TWO-STREAM NETWORK WITH TRANSFORMER-BASED COMPLEMENTARITY FOR GAIT-BASED EMOTION RECOGNITION
    Hu, Chuanfei
    Sheng, Weijie
    Dong, Bo
    Li, Xinde
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3229 - 3233
  • [9] A Transformer-Based Network for Dynamic Hand Gesture Recognition
    D'Eusanio, Andrea
    Simoni, Alessandro
    Pini, Stefano
    Borghi, Guido
    Vezzani, Roberto
    Cucchiara, Rita
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 623 - 632
  • [10] A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis
    Delbrouck, Jean-Benoit
    Tits, Noe
    Brousmiche, Mathilde
    Dupont, Stephane
    PROCEEDINGS OF THE SECOND GRAND CHALLENGE AND WORKSHOP ON MULTIMODAL LANGUAGE (CHALLENGE-HML), VOL 1, 2020, : 1 - 7