A Preliminary Study on Cross-Databases Emotion Recognition using the Glottal Features in Speech

被引:0
|
作者
Sun, Rui [1 ]
Moore, Elliot, II [1 ]
机构
[1] Georgia Inst Technol, Dept Elect & Comp Engn, Savannah, GA USA
关键词
emotion recognition; cross-databases; glottal features; pitch; CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While the majority of traditional research in emotional speech recognition has focused on the use of a single database for assessment, it is clear that the lack of large databases has presented a significant challenge in generalizing results for the purposes of building a robust emotion classification system. Recently, work has been reported on cross-training emotional databases to examine consistency and reliability of acoustic measures in performing emotional assessment. This paper presents preliminary results on the use of glottal-based features in cross-testing (i.e., training on one database and testing on another) across 3 databases for emotion recognition of neutral, angry, happy, and sad. A comparative study is also presented using pitch-based features. The results suggest that the glottal features are more robust to the 4-class emotion classification system developed in this study and are able to perform well above chance for several of the cross-testing experiments.
引用
收藏
页码:1626 / 1629
页数:4
相关论文
共 50 条
  • [31] SPEECH EMOTION RECOGNITION USING SELF-SUPERVISED FEATURES
    Morais, Edmilson
    Hoory, Ron
    Zhu, Weizhong
    Gat, Itai
    Damasceno, Matheus
    Aronowitz, Hagai
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6922 - 6926
  • [32] Emotion Recognition from Speech using Prosodic and Linguistic Features
    Pervaiz, Mahwish
    Khan, Tamim Ahmed
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (08) : 84 - 90
  • [33] Speech Emotion Recognition using MFCC features and LSTM network
    Kumbhar, Harshawardhan S.
    Bhandari, Sheetal U.
    [J]. 2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2019,
  • [34] SPEECH EMOTION RECOGNITION USING AUTOENCODER BOTTLENECK FEATURES AND LSTM
    Huang, Kun-Yi
    Wu, Chung-Hsien
    Yang, Tsung-Hsien
    Su, Ming-Hsiang
    Chou, Jia-Hui
    [J]. 2016 INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT), 2018, : 1 - 4
  • [35] Speech emotion recognition: Emotional models, databases, features, preprocessing methods, supporting modalities, and classifiers
    Akcay, Mehmet Berkehan
    Oguz, Kaya
    [J]. SPEECH COMMUNICATION, 2020, 116 (116) : 56 - 76
  • [36] Spoken emotion recognition through optimum-path forest classification using glottal features
    Iliev, Alexander I.
    Scordilis, Michael S.
    Papa, Joao P.
    Falcao, Alexandre X.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2010, 24 (03): : 445 - 460
  • [37] Improved Emotion Recognition Using Gaussian Mixture Model and Extreme Learning Machine in Speech and Glottal Signals
    Muthusamy, Hariharan
    Polat, Kemal
    Yaacob, Sazali
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [38] Urdu Speech Emotion Recognition using Speech Spectral Features and Deep Learning Techniques
    Taj, Soonh
    Shaikh, Ghulam Mujtaba
    Hassan, Saif
    Nimra
    [J]. 2023 4th International Conference on Computing, Mathematics and Engineering Technologies: Sustainable Technologies for Socio-Economic Development, iCoMET 2023, 2023,
  • [39] Investigation of Results Using Various Databases and Algorithms for Music Player Using Speech Emotion Recognition
    Deshmukh, Shrikala
    Gupta, Preeti
    Mane, Prashant
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2021), 2022, 417 : 205 - 215
  • [40] Emotion recognition from speech using global and local prosodic features
    Rao K.S.
    Koolagudi S.G.
    Vempada R.R.
    [J]. International Journal of Speech Technology, 2013, 16 (2) : 143 - 160