A Review on Emotion Recognition using Speech

被引:0
|
作者
Basu, Saikat [1 ,2 ]
Chakraborty, Jaybrata [3 ]
Bag, Arnab [4 ]
Aftabuddin, Md. [5 ]
机构
[1] Indian Inst Technol, Sch Med Sci & Technol, IEEE, Kharagpur, W Bengal, India
[2] Maulana Abul Kalam Azad Univ Technol, Dept Comp Sci & Engn, Kolkata, W Bengal, India
[3] Maulana Abul Kalam Azad Univ Technol, Dept Informat Technol, Kolkata, W Bengal, India
[4] Indian Inst Technol, Dept Elect & Elect Commun Engn, Kharagpur, W Bengal, India
[5] Maulana Abul Kalam Azad Univ Technol, Kolkata, W Bengal, India
关键词
Affect Detection; Corpora; Features; MFCC (Mel Frequency Cepstral Coefficient); LPCC (Linear Prediction Cepstral Coefficients); LPC (Linear Prediction Coefficients); Classifier; Neural Network; GMM (Gaussian Mixture Model); HMM (Hidden Markov Model); KNN (K; Nearest Neighbors); MLP (Multi Layer Perceptron); RNN (Recurrent Neural Network); Back Propagation; FEATURES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition or affect detection from speech is an old and challenging problem in the field of artificial intelligence. Many significant research works have been done on emotion recognition. In this paper, the recent works on affect detection using speech and different issues related to affect detection has been presented. The primary challenges of emotion recognition are choosing the emotion recognition corpora ( speech database), identification of different features related to speech and an appropriate choice of a classification model. Different types of methods to collect emotional speech data and issues related to them are covered by this presentation along with the previous works review. Literature survey on different features used for recognizing emotion from human speech has been discussed. The significance of various classification models has been presented along with some recent research works review. A detailed description of a prime feature extraction technique named Mel Frequency Cepstral Coefficient (MFCC) and brief description of the working principle of some classification models are also discussed here. In this paper terms like affect detection and emotion recognition are used interchangeably.
引用
收藏
页码:109 / 114
页数:6
相关论文
共 50 条
  • [41] Deep Multimodal Emotion Recognition on Human Speech: A Review
    Koromilas, Panagiotis
    Giannakopoulos, Theodoros
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (17):
  • [42] Urdu Speech Emotion Recognition: A Systematic Literature Review
    Taj, Soonh
    Mujtaba, Ghulam
    Daudpota, Sher Muhammad
    Mughal, Muhammad Hussain
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (07)
  • [43] A systematic literature review of speech emotion recognition approaches
    Singh, Youddha Beer
    Goel, Shivani
    [J]. NEUROCOMPUTING, 2022, 492 : 245 - 263
  • [44] Deep Learning Techniques for Speech Emotion Recognition : A Review
    Pandey, Sandeep Kumar
    Shekhawat, H. S.
    Prasanna, S. R. M.
    [J]. 2019 29TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2019, : 197 - 202
  • [45] Databases, features and classifiers for speech emotion recognition: a review
    Swain, Monorama
    Routray, Aurobinda
    Kabisatpathy, P.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (01) : 93 - 120
  • [46] Speech Emotion Recognition
    Lalitha, S.
    Madhavan, Abhishek
    Bhushan, Bharath
    Saketh, Srinivas
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2014,
  • [47] An Emotion Estimation from Human Speech Using Speech Recognition and Speech Synthesize
    Kurematsu, Masaki
    Ohashi, Marina
    Kinosita, Orimi
    Hakura, Jun
    Fujita, Hamido
    [J]. NEW TRENDS IN SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES, 2008, 182 : 278 - 289
  • [48] Speech emotion recognition using the novel PEmoNet (Parallel Emotion Network)
    Bhangale, Kishor B.
    Kothandaraman, Mohanaprasad
    [J]. APPLIED ACOUSTICS, 2023, 212
  • [49] A review on emotion recognition from dialect speech using feature optimization and classification techniques
    Thimmaiah, Sunil
    Vinay, N. A.
    Ravikumar, M. G.
    Prasad, S. R.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (29) : 73793 - 73793
  • [50] Detecting Human Emotion via Speech Recognition by Using Speech Spectrogram
    Prasomphan, Sathit
    [J]. PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (IEEE DSAA 2015), 2015, : 113 - 122