Speech emotion recognition based on emotion perception

被引:0
|
作者
Liu, Gang [1 ]
Cai, Shifang [1 ]
Wang, Ce [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing, Peoples R China
关键词
Speech emotion recognition; Emotion perception; Implicit emotional attribute; Multi-task learning; ATTENTION;
D O I
10.1186/s13636-023-00289-4
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech emotion recognition (SER) is a hot topic in speech signal processing. With the advanced development of the cheap computing power and proliferation of research in data-driven methods, deep learning approaches are prominent solutions to SER nowadays. SER is a challenging task due to the scarcity of datasets and the lack of emotion perception. Most existing networks of SER are based on computer vision and natural language processing, so the applicability for extracting emotion is not strong. Drawing on the research results of brain science on emotion computing and inspired by the emotional perceptive process of the human brain, we propose an approach based on emotional perception, which designs a human-like implicit emotional attribute classification and introduces implicit emotional information through multi-task learning. Preliminary experiments show that the unweighted accuracy (UA) of the proposed method has increased by 2.44%, and weighted accuracy (WA) 3.18% (both absolute values) on the Interactive Emotional Dyadic Motion Capture (IEMOCAP) dataset, which verifies the effectiveness of our method.
引用
下载
收藏
页数:7
相关论文
共 50 条
  • [21] Speech Emotion Recognition Based on Sparse Representation
    Yan, Jingjie
    Wang, Xiaolan
    Gu, Weiyi
    Ma, Lili
    ARCHIVES OF ACOUSTICS, 2013, 38 (04) : 465 - 470
  • [22] Speech Emotion Recognition Based on PCA and CHMM
    Ke, Xianxin
    Cao, Bin
    Bai, Jiaojiao
    Yu, Qichao
    Yang, Dezhi
    PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 667 - 671
  • [23] Speech Emotion Recognition Based on Learning Automata in
    Motamed, Sara
    Setayeshi, Saeed
    Farhoudi, Zeinab
    Ahmadi, Ali
    JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE-JMCS, 2014, 12 (03): : 173 - 185
  • [24] Speech emotion recognition in web based service
    Huang, Huiqin
    Luo, Qi
    Zhu, Aiqin
    2007 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1 AND 2: VOL 1: COMMUNICATION THEORY AND SYSTEMS; VOL 2: SIGNAL PROCESSING, COMPUTATIONAL INTELLIGENCE, CIRCUITS AND SYSTEMS, 2007, : 804 - +
  • [25] Speech emotion recognition based on HMM and SVM
    Lin, YL
    Wei, G
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 4898 - 4901
  • [26] Speech Emotion Recognition Based on Arabic Features
    Meddeb, Mohamed
    Karray, Hichem
    Alimi, Adel M.
    2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 46 - 51
  • [27] Speech Emotion Recognition Based on Feature Fusion
    Shen, Qi
    Chen, Guanggen
    Chang, Lin
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE, MACHINERY AND ENERGY ENGINEERING (MSMEE 2017), 2017, 123 : 1071 - 1074
  • [28] Speech Emotion Recognition Based on Modified ReliefF
    Li, Guo-Min
    Liu, Na
    Zhang, Jun-Ao
    SENSORS, 2022, 22 (21)
  • [29] Speech Emotion Recognition Based on Henan Dialect
    Cheng, Zichen
    Li, Yan
    Jiu, Mengfei
    Ge, Jiangwei
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, VOL. 1, 2022, 878 : 498 - 505
  • [30] Speech Emotion Recognition based on Multi-Label Emotion Existence Model
    Ando, Atsushi
    Masumura, Ryo
    Kamiyama, Havana
    Kobashikawa, Satoshi
    Aono, Yushi
    INTERSPEECH 2019, 2019, : 2818 - 2822