Speech Emotion Recognition Based on Minimal Voice Quality Features

被引:0
|
作者
Jacob, Agnes [1 ]
机构
[1] Govt Engn Coll, Appl Elect & Instrumentat Dept, West Hill, Kozhikode, Kerala, India
关键词
Jitter; Statistical Analysis; Shimmer; Speech Emotion Recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents the results of investigations in speech emotion recognition (SER) in English and Hindi based on micro perturbations in pitch, called jitter, as well as very small variations in intensity, called shimmer. Jitter and shimmer are proposed as minimal, reliable and effective features for speech emotion recognition since it is difficult to bring about such minute variations in intensity and pitch artificially, without actually experiencing the emotions. The identification of such a minimal feature set could result in savings of time and effort. It is significant in the present SER scenario where the performance of emotion recognition systems relies on hundreds of features, the collection of which is time consuming. These investigations were conducted on a database of induced emotional speech of females developed exclusively for this purpose. 2765 wave files in English and 2240 wave files in Hindi were statistically analyzed. Multiple classifiers were used for validating the classification results. Maximum overall accuracy of 64.8% for English SER and 83.3% for Hindi SER have been obtained with an ANN classifier when classifying seven different emotions.
引用
收藏
页码:886 / 890
页数:5
相关论文
共 50 条
  • [1] Voice Quality Features for Speech Emotion Recognition
    Idris, Inshirah
    Salam, Md Sah Hj
    [J]. JOURNAL OF INFORMATION ASSURANCE AND SECURITY, 2015, 10 (04): : 183 - 191
  • [2] Emotion Recognition in Chinese Natural Speech by Combining Prosody and Voice Quality Features
    Zhang, Shiqing
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2008, PT 2, PROCEEDINGS, 2008, 5264 : 457 - 464
  • [3] Speech Emotion Recognition Based on Voice Fundamental Frequency
    Dimitrova-Grekow, Teodora
    Klis, Aneta
    Igras-Cybulska, Magdalena
    [J]. ARCHIVES OF ACOUSTICS, 2019, 44 (02) : 277 - 286
  • [4] Speech Emotion Recognition Based on Arabic Features
    Meddeb, Mohamed
    Karray, Hichem
    Alimi, Adel M.
    [J]. 2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 46 - 51
  • [5] The relevance of voice quality features in speaker independent emotion recognition
    Lugger, Marko
    Yang, Bin
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 17 - +
  • [6] Investigating voice features for Speech emotion recognition based on four kinds of machine learning methods
    Chen, Haiyan
    Liu, Zheng
    Kang, Xin
    Nishide, Shun
    Ren, Fuji
    [J]. PROCEEDINGS OF 2019 6TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2019, : 195 - 199
  • [7] Voice Emotion Recognition Based on Color Histogram Features
    da Rocha, Marcelo Marques
    Conci, Aura
    Muchaluat Saade, Debora Christina
    [J]. 2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 341 - 347
  • [8] Informative Speech Features based on Emotion Classes and Gender in Explainable Speech Emotion Recognition
    Yildirim, Huseyin Ediz
    Iren, Deniz
    [J]. 2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2023,
  • [9] RECOGNITION OF EMOTION IN SPEECH USING VARIOGRAM BASED FEATURES
    Esmaileyan, Zeynab
    Marvi, Hosein
    [J]. MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2014, 27 (03) : 156 - 170
  • [10] SPEECH EMOTION CLASSIFICATION USING SVM AND MLP ON PROSODIC AND VOICE QUALITY FEATURES
    Idris, Inshirah
    Salam, Md Sah Hj
    Sunar, Mohd Shahrizal
    [J]. JURNAL TEKNOLOGI, 2016, 78 (2-2): : 27 - 33