Automatic speech based emotion recognition using paralinguistics features

被引：10

作者：

Hook, J. ^{[1
]}

Noroozi, F. ^{[1
]}

Toygar, O. ^{[2
]}

Anbarjafari, G. ^{[1
,3
]}

机构：

[1] Univ Tartu, Inst Technol, iCV Res Grp, EE-50411 Tartu, Estonia

[2] Eastern Mediterranean Univ, Dept Comp Engn, Via Mersin 10, Famagusta, Northern Cyprus, Turkey

[3] Hasan Kalyoncu Univ, Dept Elect & Elect Engn, Gaziantep, Turkey

来源：

BULLETIN OF THE POLISH ACADEMY OF SCIENCES-TECHNICAL SCIENCES | 2019年 / 67卷 / 03期

关键词：

random forests; speech emotion recognition; machine learning; support vector machines; RANDOM FORESTS;

D O I：

10.24425/bpasts.2019.129647

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Affective computing studies and develops systems capable of detecting humans affects. The search for universal well-performing features for speech-based emotion recognition is ongoing. In this paper, a small set of features with support vector machines as the classifier is evaluated on Surrey Audio-Visual Expressed Emotion database, Berlin Database of Emotional Speech, Polish Emotional Speech database and Serbian emotional speech database. It is shown that a set of 87 features can offer results on-par with state-of-the-art, yielding 80.21, 88.6, 75.42 and 93.41% average emotion recognition rate, respectively. In addition, an experiment is conducted to explore the significance of gender in emotion recognition using random forests. Two models, trained on the first and second database, respectively, and four speakers were used to determine the effects. It is seen that the feature set used in this work performs well for both male and female speakers, yielding approximately 27% average emotion recognition in both models. In addition, the emotions for female speakers were recognized 18% of the time in the first model and 29% in the second. A similar effect is seen with male speakers: the first model yields 36%, the second 28% a verage emotion recognition rate. This illustrates the relationship between the constitution of training data and emotion recognition accuracy.

引用

页码：479 / 488

页数：10

共 50 条

[1] Automatic speech emotion recognition using modulation spectral features
Wu, Siqing
Falk, Tiago H.
Chan, Wai-Yip
[J]. SPEECH COMMUNICATION, 2011, 53 (05) : 768 - 785
[2] RECOGNITION OF EMOTION IN SPEECH USING VARIOGRAM BASED FEATURES
Esmaileyan, Zeynab
Marvi, Hosein
[J]. MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2014, 27 (03) : 156 - 170
[3] On the Correlation and Transferability of Features between Automatic Speech Recognition and Speech Emotion Recognition
Fayek, Haytham M.
Lech, Margaret
Cavedon, Lawrence
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3618 - 3622
[4] Automatic speech emotion recognition using an optimal combination of features based on EMD-TKEO
Kerkeni, Leila
Serrestou, Youssef
Raoof, Kosai
Mbarki, Mohamed
Mahjoub, Mohamed Ali
Cleder, Catherine
[J]. SPEECH COMMUNICATION, 2019, 114 : 22 - 35
[5] Speech Emotion Recognition using Combination of Features
Zhang, Qingli
An, Ning
Wang, Kunxia
Ren, Fuji
Li, Lian
[J]. PROCEEDINGS OF THE 2013 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2013, : 523 - 528
[6] Speech Emotion Recognition Based on Arabic Features
Meddeb, Mohamed
Karray, Hichem
Alimi, Adel M.
[J]. 2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 46 - 51
[7] AUTOMATIC EMOTION RECOGNITION IN SPEECH SIGNAL USING TEAGER ENERGY OPERATOR AND MFCC FEATURES
He, Ling
Lech, Margaret
Allen, Nicholas
[J]. 2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 3, 2012, : 695 - 699
[8] Automatic Emotion Recognition in Compressed Speech Using Acoustic and Non-Linear Features
Garcia, N.
Vasquez-Correa, J. C.
Arias-Londono, J. D.
Vargas-Bonilla, J. F.
Orozco-Arroyave, J. R.
[J]. 2015 20TH SYMPOSIUM ON SIGNAL PROCESSING, IMAGES AND COMPUTER VISION (STSIVA), 2015,
[9] Emotion Recognition in Speech Using MFCC and Wavelet Features
Kishore, K. V. Krishna
Satish, P. Krishna
[J]. PROCEEDINGS OF THE 2013 3RD IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2013, : 842 - 847
[10] Speech emotion recognition using nonlinear dynamics features
Shahzadi, Ali
Ahmadyfard, Alireza
Harimi, Ali
Yaghmaie, Khashayar
[J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2015, 23 : 2056 - 2073

← 1 2 3 4 5 →