Gender-Driven Emotion Recognition Through Speech Signals for Ambient Intelligence Applications

被引:57
|
作者
Bisio, Igor [1 ]
Delfino, Alessandro [1 ]
Lavagetto, Fabio [1 ]
Marchese, Mario [1 ]
Sciarrone, Andrea [1 ]
机构
[1] Univ Genoa, Dept Elect Elect Telecommun Engn & Naval Architec, I-16145 Genoa, Italy
关键词
Human-computer intelligent interaction; gender recognition; emotion recognition; pitch estimation; support vector machine; FEATURES; AUDIO;
D O I
10.1109/TETC.2013.2274797
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a system that allows recognizing a person's emotional state starting from audio signal registrations. The provided solution is aimed at improving the interaction among humans and computers, thus allowing effective human-computer intelligent interaction. The system is able to recognize six emotions (anger, boredom, disgust, fear, happiness, and sadness) and the neutral state. This set of emotional states is widely used for emotion recognition purposes. It also distinguishes a single emotion versus all the other possible ones, as proven in the proposed numerical results. The system is composed of two subsystems: 1) gender recognition (GR) and 2) emotion recognition (ER). The experimental analysis shows the performance in terms of accuracy of the proposed ER system. The results highlight that the a priori knowledge of the speaker's gender allows a performance increase. The obtained results show also that the features selection adoption assures a satisfying recognition rate and allows reducing the employed features. Future developments of the proposed solution may include the implementation of this system over mobile devices such as smartphones.
引用
收藏
页码:244 / 257
页数:14
相关论文
共 50 条
  • [31] Multi-modal emotion recognition using EEG and speech signals
    Wang, Qian
    Wang, Mou
    Yang, Yan
    Zhang, Xiaolei
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 149
  • [32] When Old Meets New: Emotion Recognition from Speech Signals
    Arano, Keith April
    Gloor, Peter
    Orsenigo, Carlotta
    Vercellis, Carlo
    [J]. COGNITIVE COMPUTATION, 2021, 13 (03) : 771 - 783
  • [33] Emotion Recognition Based on EMD-Wavelet Analysis of Speech Signals
    Shahnaz, C.
    Sultanas, S.
    Fattah, S. A.
    Rafi, R. H. M.
    Ahmmed, I.
    Zhu, W. -P.
    Ahmad, M. O.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2015, : 307 - 310
  • [34] MULTI-HEAD ATTENTION FOR SPEECH EMOTION RECOGNITION WITH AUXILIARY LEARNING OF GENDER RECOGNITION
    Nediyanchath, Anish
    Paramasivam, Periyasamy
    Yenigalla, Promod
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7179 - 7183
  • [35] Speaker and gender dependencies in within/cross linguistic Speech Emotion Recognition
    Chakhtouna A.
    Sekkate S.
    Adib A.
    [J]. International Journal of Speech Technology, 2023, 26 (03) : 609 - 625
  • [36] Gender-Aware CNN-BLSTM for Speech Emotion Recognition
    Zhang, Linjuan
    Wang, Longbiao
    Dang, Jianwu
    Guo, Lili
    Yu, Qiang
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 782 - 790
  • [37] Speech Emotion Recognition Based on Attention MCNN Combined With Gender Information
    Hu, Zhangfang
    LingHu, Kehuan
    Yu, Hongling
    Liao, Chenzhuo
    [J]. IEEE ACCESS, 2023, 11 : 50285 - 50294
  • [38] IMPROVING COMPUTER ASSISTED SPEECH THERAPY THROUGH SPEECH BASED EMOTION RECOGNITION
    Schipor, Ovidiu Andrei
    [J]. LET'S BUILD THE FUTURE THROUGH LEARNING INNOVATION!, VOL. 1, 2014, : 101 - 104
  • [39] Enhancing Emotion Recognition from Speech through Feature Selection
    Kostoulas, Theodoros
    Ganchev, Todor
    Lazaridis, Alexandros
    Fakotakis, Nikos
    [J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 338 - 344
  • [40] DEEPFAKE SPEECH DETECTION THROUGH EMOTION RECOGNITION: A SEMANTIC APPROACH
    Conti, Emanuele
    Salvi, Davide
    Borrelli, Clara
    Hosler, Brian
    Bestagini, Paolo
    Antonacci, Fabio
    Sarti, Augusto
    Stamm, Matthew C.
    Tubaro, Stefano
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8962 - 8966