Emotion recognition from speech: a review

被引:177
|
作者
Koolagudi, Shashidhar G. [1 ]
Rao, K. Sreenivasa [1 ]
机构
[1] Indian Inst Technol Kharagpur, Sch Informat Technol, Kharagpur 721302, W Bengal, India
关键词
Emotion recognition; Simulated emotional speech corpus; Elicited speech corpus; Natural speech corpus; Excitation source features; System features; Prosodic features; Classification models;
D O I
10.1007/s10772-011-9125-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Emotion recognition from speech has emerged as an important research area in the recent past. In this regard, review of existing work on emotional speech processing is useful for carrying out further research. In this paper, the recent literature on speech emotion recognition has been presented considering the issues related to emotional speech corpora, different types of speech features and models used for recognition of emotions from speech. Thirty two representative speech databases are reviewed in this work from point of view of their language, number of speakers, number of emotions, and purpose of collection. The issues related to emotional speech databases used in emotional speech recognition are also briefly discussed. Literature on different features used in the task of emotion recognition from speech is presented. The importance of choosing different classification models has been discussed along with the review. The important issues to be considered for further emotion recognition research in general and in specific to the Indian context have been highlighted where ever necessary.
引用
收藏
页码:99 / 117
页数:19
相关论文
共 50 条
  • [41] Speech Emotion Recognition Systems: A Comprehensive Review on Different Methodologies
    Audre Arlene Anthony
    Chandreshekar Mohan Patil
    [J]. Wireless Personal Communications, 2023, 130 : 515 - 525
  • [42] Analyzing the influence of different speech data corpora and speech features on speech emotion recognition: A review
    Rathi, Tarun
    Tripathy, Manoj
    [J]. SPEECH COMMUNICATION, 2024, 162
  • [43] Evaluating intonational features for emotion recognition from speech
    Zervas, Panagiotis
    Mporas, Iosif
    Fakotakis, Nikos
    Kokkinakis, George
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2007, 16 (06) : 1001 - 1014
  • [44] EMOTION RECOGNITION FROM SPEECH: PUTTING ASR IN THE LOOP
    Schuller, Bjoern
    Batliner, Anton
    Steidl, Stefan
    Seppi, Dino
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4585 - +
  • [45] Emotion recognition and acoustic analysis from speech signal
    Park, CH
    Sim, KB
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 2594 - 2598
  • [46] Emotion Recognition from Speech for an Interactive Robot Agent
    Anjum, Madiha
    [J]. 2019 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2019, : 363 - 368
  • [47] Emotion recognition and evaluation from Mandarin speech signals
    Pao, Tsanglong
    Chen, Yute
    Yeh, Junheng
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2008, 4 (07): : 1695 - 1709
  • [48] Amplitude Modulation Features for Emotion Recognition from Speech
    Alam, Md Jahangir
    Attabi, Yazid
    Dumouchel, Pierre
    Kenny, Patrick
    O'Shaughnessy, D.
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2419 - 2423
  • [49] Emotion Recognition and Spoof Detection from Whispered Speech
    Sivan, Dawn
    Gopakumar, C.
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC), 2017, : 1091 - 1095
  • [50] Improving Automatic Emotion Recognition from Speech Signals
    Bozkurt, Elif
    Erzin, Engin
    Erdem, Cigdem Eroglu
    Erdem, A. Tanju
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 312 - +