Emotion recognition in conversations with emotion shift detection based on multi-task learning

被引:13
|
作者
Gao, Qingqing [1 ]
Cao, Biwei [1 ]
Guan, Xin [1 ]
Gu, Tianyun [2 ]
Bao, Xing [3 ,4 ]
Wu, Junyan [4 ]
Liu, Bo [2 ,5 ]
Cao, Jiuxin [1 ,5 ]
机构
[1] Southeast Univ, Sch Cyber Sci & Engn, Nanjing, Peoples R China
[2] Southeast Univ, Sch Comp Sci & Engn, Nanjing, Peoples R China
[3] Univ Sci & Technol China, Hefei, Peoples R China
[4] Aerospace Informat Res Inst, Suzhou, Peoples R China
[5] Purple Mt Labs, Nanjing, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Emotion analysis; Emotion shift detection; Emotion recognition in conversations; Multi -task learning;
D O I
10.1016/j.knosys.2022.108861
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition in conversations (ERC) has attracted increasing attention from the research community for its wide applications. For ERC, the main challenge is how to effectively utilize the conversational context based on the adequate analysis of each utterance. Current research ignores to use the emotion shift information when modeling the conversational context which tends to result in recognition performance inadequacy. We believe that employing emotion shift as explicit guidance would help to further improve the performance of ERC. Therefore, we propose a multi-task learning model ESD-ERC, which comprises the auxiliary task of Emotion Shift Detection (ESD) and the main task of ERC. The model exploits a shared BERT-based encoder to extract the unified emotion semantic representations, obtains the emotion shift representations through ESD based on Bi-directional Long Short-Term Memory Neural Network and feeds the emotion semantic representations concatenated with the emotion shift representations into the context level Transformer with positional encoding for ERC. The comparative experiment results show that our model outperforms the state-of-the-art models on two different datasets, validating our idea about emotion shift. In addition, we verify the effectiveness of each component of ESD-ERC by ablation experiments and explain the significance of ESD by case study. (C)& nbsp;2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Speech Emotion Recognition based on Multi-Task Learning
    Zhao, Huijuan
    Han Zhijie
    Wang, Ruchuan
    [J]. 2019 IEEE 5TH INTL CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY) / IEEE INTL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING (HPSC) / IEEE INTL CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2019, : 186 - 188
  • [2] Multi-task Learning for Speech Emotion and Emotion Intensity Recognition
    Yue, Pengcheng
    Qu, Leyuan
    Zheng, Shukai
    Li, Taihao
    [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1232 - 1237
  • [3] Speech Emotion Recognition with Multi-task Learning
    Cai, Xingyu
    Yuan, Jiahong
    Zheng, Renjie
    Huang, Liang
    Church, Kenneth
    [J]. INTERSPEECH 2021, 2021, : 4508 - 4512
  • [4] Meta Multi-task Learning for Speech Emotion Recognition
    Cai, Ruichu
    Guo, Kaibin
    Xu, Boyan
    Yang, Xiaoyan
    Zhang, Zhenjie
    [J]. INTERSPEECH 2020, 2020, : 3336 - 3340
  • [5] Emotion Recognition With Sequential Multi-task Learning Technique
    Phan Tran Dac Thinh
    Hoang Manh Hung
    Yang, Hyung-Jeong
    Kim, Soo-Hyung
    Lee, Guee-Sang
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3586 - 3589
  • [6] Inconsistency-Based Multi-Task Cooperative Learning for Emotion Recognition
    Xu, Yifan
    Cui, Yuqi
    Jiang, Xue
    Yin, Yingjie
    Ding, Jingting
    Li, Liang
    Wu, Dongrui
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (04) : 2017 - 2027
  • [7] Logistic Regression Based Multi-task, Multi-kernel Learning for Emotion Recognition
    He, Xinrun
    Huang, Jian
    Zeng, Zhigang
    [J]. 2021 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2021), 2021, : 572 - 577
  • [8] Coarse-to-Fine Speech Emotion Recognition Based on Multi-Task Learning
    Zhao Huijuan
    Ye Ning
    Wang Ruchuan
    [J]. Journal of Signal Processing Systems, 2021, 93 : 299 - 308
  • [9] Coarse-to-Fine Speech Emotion Recognition Based on Multi-Task Learning
    Zhao, Huijuan
    Ye, Ning
    Wang, Ruchuan
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2021, 93 (2-3): : 299 - 308
  • [10] Multi-Task Learning Framework for Extracting Emotion Cause Span and Entailment in Conversations
    Bhat, Ashwani
    Modi, Ashutosh
    [J]. TRANSFER LEARNING FOR NATURAL LANGUAGE PROCESSING WORKSHOP, VOL 203, 2022, 203 : 33 - 51