A Review on Emotion Recognition using Speech

被引:0
|
作者
Basu, Saikat [1 ,2 ]
Chakraborty, Jaybrata [3 ]
Bag, Arnab [4 ]
Aftabuddin, Md. [5 ]
机构
[1] Indian Inst Technol, Sch Med Sci & Technol, IEEE, Kharagpur, W Bengal, India
[2] Maulana Abul Kalam Azad Univ Technol, Dept Comp Sci & Engn, Kolkata, W Bengal, India
[3] Maulana Abul Kalam Azad Univ Technol, Dept Informat Technol, Kolkata, W Bengal, India
[4] Indian Inst Technol, Dept Elect & Elect Commun Engn, Kharagpur, W Bengal, India
[5] Maulana Abul Kalam Azad Univ Technol, Kolkata, W Bengal, India
关键词
Affect Detection; Corpora; Features; MFCC (Mel Frequency Cepstral Coefficient); LPCC (Linear Prediction Cepstral Coefficients); LPC (Linear Prediction Coefficients); Classifier; Neural Network; GMM (Gaussian Mixture Model); HMM (Hidden Markov Model); KNN (K; Nearest Neighbors); MLP (Multi Layer Perceptron); RNN (Recurrent Neural Network); Back Propagation; FEATURES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition or affect detection from speech is an old and challenging problem in the field of artificial intelligence. Many significant research works have been done on emotion recognition. In this paper, the recent works on affect detection using speech and different issues related to affect detection has been presented. The primary challenges of emotion recognition are choosing the emotion recognition corpora ( speech database), identification of different features related to speech and an appropriate choice of a classification model. Different types of methods to collect emotional speech data and issues related to them are covered by this presentation along with the previous works review. Literature survey on different features used for recognizing emotion from human speech has been discussed. The significance of various classification models has been presented along with some recent research works review. A detailed description of a prime feature extraction technique named Mel Frequency Cepstral Coefficient (MFCC) and brief description of the working principle of some classification models are also discussed here. In this paper terms like affect detection and emotion recognition are used interchangeably.
引用
收藏
页码:109 / 114
页数:6
相关论文
共 50 条
  • [1] Speech Emotion Recognition Using Deep Learning Techniques: A Review
    Khalil, Ruhul Amin
    Jones, Edward
    Babar, Mohammad Inayatullah
    Jan, Tariqullah
    Zafar, Mohammad Haseeb
    Alhussain, Thamer
    [J]. IEEE ACCESS, 2019, 7 : 117327 - 117345
  • [2] Speech emotion recognition using machine learning - A systematic review
    Madanian, Samaneh
    Chen, Talen
    Adeleye, Olayinka
    Templeton, John Michael
    Poellabauer, Christian
    Parry, Dave
    Schneidere, Sandra L.
    [J]. INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 20
  • [3] An ongoing review of speech emotion recognition
    de Lope, Javier
    Grana, Manuel
    [J]. NEUROCOMPUTING, 2023, 528 : 1 - 11
  • [4] Dimensional Speech Emotion Recognition Review
    Li, Hai-Feng
    Chen, Jing
    Ma, Lin
    Bo, Hong-Jian
    Xu, Cong
    Li, Hong-Wei
    [J]. Ruan Jian Xue Bao/Journal of Software, 2020, 31 (08): : 2465 - 2491
  • [5] Emotion recognition from speech: a review
    Koolagudi, Shashidhar G.
    Rao, K. Sreenivasa
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (02) : 99 - 117
  • [6] Emotion Recognition using Imperfect Speech Recognition
    Metze, Florian
    Batliner, Anton
    Eyben, Florian
    Polzehl, Tim
    Schuller, Bjoern
    Steidl, Stefan
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 478 - +
  • [7] Speech Emotion Recognition using DWT
    Lalitha, S.
    Mudupu, Anoop
    Nandyala, Bala Visali
    Munagala, Renuka
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2015, : 20 - 23
  • [8] Speech Emotion Recognition Using CNN
    Huang, Zhengwei
    Dong, Ming
    Mao, Qirong
    Zhan, Yongzhao
    [J]. PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 801 - 804
  • [9] A Review on Speech Emotion Recognition Using Deep Learning and Attention Mechanism
    Lieskovska, Eva
    Jakubec, Maros
    Jarina, Roman
    Chmulik, Michal
    [J]. ELECTRONICS, 2021, 10 (10)
  • [10] A Comprehensive Review of Speech Emotion Recognition Systems
    Wani, Taiba Majid
    Gunawan, Teddy Surya
    Qadri, Syed Asif Ahmad
    Kartiwi, Mira
    Ambikairajah, Eliathamby
    [J]. IEEE ACCESS, 2021, 9 : 47795 - 47814