Speech emotion recognition using nonlinear dynamics features

被引:11
|
作者
Shahzadi, Ali [1 ]
Ahmadyfard, Alireza [2 ]
Harimi, Ali [1 ]
Yaghmaie, Khashayar [1 ]
机构
[1] Semnan Univ, Dept Elect & Comp Engn, Semnan, Iran
[2] Shahrood Univ Technol, Dept Elect Engn, Shahrood, Iran
关键词
Nonlinear dynamics features; phase space reconstruction; speech emotion recognition; Fisher discriminant ratio; CLASSIFICATION; INFORMATION; SELECTION;
D O I
10.3906/elk-1302-90
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent developments in man-machine interaction have motivated researchers to recognize human emotion from speech signals. In this study, we propose using nonlinear dynamics features (NLDs) for emotion recognition. NLDs are extracted from the geometrical properties of the reconstructed phase space of speech signals. The traditional prosodic and spectral features are also used as a benchmark. The Fisher discriminant ratio acts as a filter to remove irrelevant features quickly. Then a wrapper method based on a genetic algorithm and support vector machine is employed to find the best feature subset that obtains the maximum recognition rate. The classification accuracy of the proposed system is evaluated using a 10-fold cross-validation technique on the Berlin database. Our results show that combining the proposed features with prosodic and spectral features notably reduces the classification ambiguity between joy and anger, which are highly confused. The NLDs further render a substantial improvement of 3.32% for females and 7.27% for males in recognition performance when used to augment prosodic and spectral features. Finally, by using all types of features for classifying 7 emotion categories, overall recognition rates of 82.72% and 85.90% are obtained for females and males, respectively.
引用
收藏
页码:2056 / 2073
页数:18
相关论文
共 50 条
  • [31] Emotion recognition from speech signals using new harmony features
    Yang, B.
    Lugger, M.
    [J]. SIGNAL PROCESSING, 2010, 90 (05) : 1415 - 1423
  • [32] Improving Speech Emotion Recognition System Using Spectral and Prosodic Features
    Chakhtouna, Adil
    Sekkate, Sara
    Adib, Abdellah
    [J]. INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, ISDA 2021, 2022, 418 : 399 - 409
  • [33] Novel acoustic features for speech emotion recognition
    Yong-Wan Roh
    Dong-Ju Kim
    Woo-Seok Lee
    Kwang-Seok Hong
    [J]. Science in China Series E: Technological Sciences, 2009, 52 : 1838 - 1848
  • [34] Applying articulatory features to speech emotion recognition
    Zhou, Yu
    Sun, Yanqing
    Yang, Lin
    Yan, Yonghong
    [J]. 2009 INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN COMPUTER SCIENCE, ICRCCS 2009, 2009, : 73 - 76
  • [35] Learning Transferable Features for Speech Emotion Recognition
    Marczewski, Alison
    Veloso, Adriano
    Ziviani, Nivio
    [J]. PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 529 - 536
  • [36] Novel acoustic features for speech emotion recognition
    ROH Yong-Wan
    KIM Dong-Ju
    LEE Woo-Seok
    HONG Kwang-Seok
    [J]. Science China Technological Sciences, 2009, 52 (07) : 1838 - 1848
  • [37] Exploiting the potentialities of features for speech emotion recognition
    Li, Dongdong
    Zhou, Yijun
    Wang, Zhe
    Gao, Daqi
    [J]. INFORMATION SCIENCES, 2021, 548 : 328 - 343
  • [38] Significance of Phonological Features in Speech Emotion Recognition
    Wei Wang
    Paul A. Watters
    Xinyi Cao
    Lingjie Shen
    Bo Li
    [J]. International Journal of Speech Technology, 2020, 23 : 633 - 642
  • [39] Significance of Phonological Features in Speech Emotion Recognition
    Wang, Wei
    Watters, Paul A.
    Cao, Xinyi
    Shen, Lingjie
    Li, Bo
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (03) : 633 - 642
  • [40] Adding dimensional features for emotion recognition on speech
    Ben Letaifa, Leila
    Ines Torres, Maria
    Justo, Raquel
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP'2020), 2020,