Human Emotional States Classification Based upon Changes in Speech Production Features in Vowel Regions

被引：0

作者：

Mohanta, Abhijit ^{[1
]}

Mittal, Vinay Kumar ^{[1
]}

机构：

[1] Indian Inst Informat Technol Chittoor, Sri City, AP, India

来源：

2017 2ND INTERNATIONAL CONFERENCE ON TELECOMMUNICATION AND NETWORKS (TEL-NET) | 2017年

关键词：

F0; Formants; SVM; LP spectrum; LP residual; LINEAR PREDICTION; TUTORIAL;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Speech signal, that is produced by the human speech production system, carries emotions that the humans can perceive easily. In this paper, we aim to classify the four basic emotions, namely happy, anger, fear and neutral, by analyzing changes in the speech production features in the vowel regions of speech signal. A Telugu emotional speech database is used. Changes in two production features, the instantaneous fundamental frequency (F0) as the source feature, and first three Formants (F1, F2, and F3) as the filter (system) features, are examined in each case. These features are derived from the speech signal using the signal processing methods, i.e., linear prediction (LP) residual and LP spectrum, respectively. The features are examined in the speech segments of five Telugu vowels that have corresponding English vowels, i.e., /a/, /e/, /i/, /o/, and /u/. These vowel regions in the speech signal are detected manually. Further, the classification of emotional states is carried out using a Support Vector Machine (SVM) classifier. The results indicate that in the case of anger emotional state for both male and female speakers, the vowels /a/ and /e/ have higher mean F0 value, as compared to mean F0 for happy, fear and neutral states. Also, the classification accuracy of SVM classifier is observed to be highest for happy emotional state, and lowest for fear emotional state. This insight should be helpful in developing other diverse applications on emotional speech.

引用

页码：172 / 177

页数：6

共 50 条

[1] Emotional Features for Speech Overlaps Classification
Egorow, Olga
Wendemuth, Andreas
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2356 - 2360
[2] Changes in Shout Features in Automatically Detected Vowel Regions
Mittal, Vinay Kumar
Vuppala, Anil Kumar
2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
[3] Features importance analysis for emotional speech classification
Tao, JH
Kang, YG
AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 449 - 457
[4] Features extraction and selection for emotional speech classification
Xiao, ZZ
Dellandrea, E
Dou, WB
Chen, LM
AVSS 2005: ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, PROCEEDINGS, 2005, : 411 - 416
[5] EEG based Vowel Classification during Speech Imagery
Idrees, Basil M.
Farooq, Omar
PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 1130 - 1134
[6] Emotion recognition from spontaneous speech using emotional vowel-like regions
Fahad, Md Shah
Singh, Shreya
Abhinav
Ranjan, Ashish
Deepak, Akshay
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (10) : 14025 - 14043
[7] Emotion recognition from spontaneous speech using emotional vowel-like regions
Md Shah Fahad
Shreya Singh
Ashish Abhinav
Akshay Ranjan
Multimedia Tools and Applications, 2022, 81 : 14025 - 14043
[8] Speech Emotional Features Extraction Based on Electroglottograph
Chen, Lijiang
Mao, Xia
Wei, Pengfei
Compare, Angelo
NEURAL COMPUTATION, 2013, 25 (12) : 3294 - 3317
[9] Adult Speech in Different Emotional States: Temporal and Spectral Features
Kurazhova, A. V.
ACOUSTICAL PHYSICS, 2024, 70 (01) : 175 - 181
[10] Acoustic Features for Classification Based Speech Separation
Wang, Yuxuan
Han, Kun
Wang, DeLiang
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1530 - 1533

← 1 2 3 4 5 →