Automatic Speech Emotion Recognition: A Survey

被引:0
|
作者
Chandrasekar, Purnima [1 ]
Chapaneri, Santosh [1 ]
Jayaswal, Deepak [1 ]
机构
[1] Univ Mumbai, Dept Elect & Telecommun Engn, St Francis Inst Technol, Bombay, Maharashtra, India
关键词
feature extraction; dimensionality reduction; feature classification;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The area of Automatic Speech Emotion Recognition (ASER) has garnered a lot of interest among researchers. The framework of ASER typically includes three steps viz, speech feature extraction, dimensionality reduction and feature classification. At the base of this framework lies the design and recording of the database of emotional states through which the most popular set of emotions-happiness, sadness, anger, fear, disgust, boredom (which are typically called as 'archetypal emotions') and neutral among others have been obtained. This paper surveys the extent of work done in this field especially highlighting the three steps of the ASER framework Starting with the different languages that have been explored till date for creating the databases, this paper attempts to categorize the features that have been typically extracted, enlist the dimensionality reduction techniques that have been chosen and discuss the pros and cons, if any, of the feature classifiers that have been modelled.
引用
收藏
页码:341 / 346
页数:6
相关论文
共 50 条
  • [41] Automatic Speech Recognition Using Limited Vocabulary: A Survey
    Fendji, Jean Louis K. E.
    Tala, Diane C. M.
    Yenke, Blaise O.
    Atemkeng, Marcellin
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [42] Discriminative training of HMMs for automatic speech recognition: A survey
    Jiang, Hui
    [J]. COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04): : 589 - 608
  • [43] Survey on speech emotion recognition: Features, classification schemes, and databases
    El Ayadi, Moataz
    Kamel, Mohamed S.
    Karray, Fakhri
    [J]. PATTERN RECOGNITION, 2011, 44 (03) : 572 - 587
  • [44] Automatic Emotion Recognition Based on Body Movement Analysis A Survey
    Zacharatos, Haris
    Gatzoulis, Christos
    Chrysanthou, Yiorgos L.
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2014, 34 (06) : 35 - 45
  • [45] A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
    Esmaileyan, Z.
    Marvi, H.
    [J]. INTERNATIONAL JOURNAL OF ENGINEERING, 2014, 27 (01): : 79 - 89
  • [46] Pertinent feature selection techniques for automatic emotion recognition in stressed speech
    Tiwari P.
    Darji A.D.
    [J]. International Journal of Speech Technology, 2022, 25 (02) : 511 - 526
  • [47] User Identity Protection in Automatic Emotion Recognition through Disguised Speech
    Haider, Fasih
    Albert, Pierre
    Luz, Saturnino
    [J]. AI, 2021, 2 (04) : 636 - 649
  • [48] UNSUPERVISED LEARNING APPROACH TO FEATURE ANALYSIS FOR AUTOMATIC SPEECH EMOTION RECOGNITION
    Eskimez, Sefik Emre
    Duan, Zhiyao
    Heinzelman, Wendi
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5099 - 5103
  • [49] Transformer-CNN Automatic Hyperparameter Tuning for Speech Emotion Recognition
    Gumelar, Agustinus Bimo
    Yuniarno, Eko Mulyanto
    Adi, Derry Pramono
    Setiawan, Rudi
    Sugiarto, Indar
    Purnomo, Mauridhi Hery
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS AND TECHNIQUES (IST 2022), 2022,
  • [50] Speech Emotion Recognition
    Lalitha, S.
    Madhavan, Abhishek
    Bhushan, Bharath
    Saketh, Srinivas
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2014,