Human Emotional States Classification Based upon Changes in Speech Production Features in Vowel Regions

被引:0
|
作者
Mohanta, Abhijit [1 ]
Mittal, Vinay Kumar [1 ]
机构
[1] Indian Inst Informat Technol Chittoor, Sri City, AP, India
关键词
F0; Formants; SVM; LP spectrum; LP residual; LINEAR PREDICTION; TUTORIAL;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech signal, that is produced by the human speech production system, carries emotions that the humans can perceive easily. In this paper, we aim to classify the four basic emotions, namely happy, anger, fear and neutral, by analyzing changes in the speech production features in the vowel regions of speech signal. A Telugu emotional speech database is used. Changes in two production features, the instantaneous fundamental frequency (F0) as the source feature, and first three Formants (F1, F2, and F3) as the filter (system) features, are examined in each case. These features are derived from the speech signal using the signal processing methods, i.e., linear prediction (LP) residual and LP spectrum, respectively. The features are examined in the speech segments of five Telugu vowels that have corresponding English vowels, i.e., /a/, /e/, /i/, /o/, and /u/. These vowel regions in the speech signal are detected manually. Further, the classification of emotional states is carried out using a Support Vector Machine (SVM) classifier. The results indicate that in the case of anger emotional state for both male and female speakers, the vowels /a/ and /e/ have higher mean F0 value, as compared to mean F0 for happy, fear and neutral states. Also, the classification accuracy of SVM classifier is observed to be highest for happy emotional state, and lowest for fear emotional state. This insight should be helpful in developing other diverse applications on emotional speech.
引用
收藏
页码:172 / 177
页数:6
相关论文
共 50 条
  • [1] Emotional Features for Speech Overlaps Classification
    Egorow, Olga
    Wendemuth, Andreas
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2356 - 2360
  • [2] Changes in Shout Features in Automatically Detected Vowel Regions
    Mittal, Vinay Kumar
    Vuppala, Anil Kumar
    2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
  • [3] Features importance analysis for emotional speech classification
    Tao, JH
    Kang, YG
    AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 449 - 457
  • [4] Features extraction and selection for emotional speech classification
    Xiao, ZZ
    Dellandrea, E
    Dou, WB
    Chen, LM
    AVSS 2005: ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, PROCEEDINGS, 2005, : 411 - 416
  • [5] EEG based Vowel Classification during Speech Imagery
    Idrees, Basil M.
    Farooq, Omar
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 1130 - 1134
  • [6] Emotion recognition from spontaneous speech using emotional vowel-like regions
    Fahad, Md Shah
    Singh, Shreya
    Abhinav
    Ranjan, Ashish
    Deepak, Akshay
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (10) : 14025 - 14043
  • [7] Emotion recognition from spontaneous speech using emotional vowel-like regions
    Md Shah Fahad
    Shreya Singh
    Ashish Abhinav
    Akshay Ranjan
    Multimedia Tools and Applications, 2022, 81 : 14025 - 14043
  • [8] Speech Emotional Features Extraction Based on Electroglottograph
    Chen, Lijiang
    Mao, Xia
    Wei, Pengfei
    Compare, Angelo
    NEURAL COMPUTATION, 2013, 25 (12) : 3294 - 3317
  • [9] Adult Speech in Different Emotional States: Temporal and Spectral Features
    Kurazhova, A. V.
    ACOUSTICAL PHYSICS, 2024, 70 (01) : 175 - 181
  • [10] Acoustic Features for Classification Based Speech Separation
    Wang, Yuxuan
    Han, Kun
    Wang, DeLiang
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1530 - 1533