A Preliminary Study on Cross-Databases Emotion Recognition using the Glottal Features in Speech

被引:0
|
作者
Sun, Rui [1 ]
Moore, Elliot, II [1 ]
机构
[1] Georgia Inst Technol, Dept Elect & Comp Engn, Savannah, GA USA
关键词
emotion recognition; cross-databases; glottal features; pitch; CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While the majority of traditional research in emotional speech recognition has focused on the use of a single database for assessment, it is clear that the lack of large databases has presented a significant challenge in generalizing results for the purposes of building a robust emotion classification system. Recently, work has been reported on cross-training emotional databases to examine consistency and reliability of acoustic measures in performing emotional assessment. This paper presents preliminary results on the use of glottal-based features in cross-testing (i.e., training on one database and testing on another) across 3 databases for emotion recognition of neutral, angry, happy, and sad. A comparative study is also presented using pitch-based features. The results suggest that the glottal features are more robust to the 4-class emotion classification system developed in this study and are able to perform well above chance for several of the cross-testing experiments.
引用
收藏
页码:1626 / 1629
页数:4
相关论文
共 50 条
  • [1] Investigation of Glottal Features and Annotation Procedures for Speech Emotion Recognition
    Takebe, Masaaki
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    [J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [2] Speech Databases, Speech Features, and Classifiers in Speech Emotion Recognition: A Review
    Mohmad Dar, G.H.
    Delhibabu, Radhakrishnan
    [J]. IEEE Access, 2024, 12 : 151122 - 151152
  • [3] A Perspective Study on Speech Emotion Recognition: Databases, Features and Classification Models
    Raghu, Kogila
    Sadanandam, Manchala
    [J]. TRAITEMENT DU SIGNAL, 2021, 38 (06) : 1861 - 1873
  • [4] Databases, features and classifiers for speech emotion recognition: a review
    Swain, Monorama
    Routray, Aurobinda
    Kabisatpathy, P.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (01) : 93 - 120
  • [5] Speech Emotion Recognition with Cross-lingual Databases
    Chiou, Bo-Chang
    Chen, Chia-Ping
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 558 - 561
  • [6] Survey on speech emotion recognition: Features, classification schemes, and databases
    El Ayadi, Moataz
    Kamel, Mohamed S.
    Karray, Fakhri
    [J]. PATTERN RECOGNITION, 2011, 44 (03) : 572 - 587
  • [7] Speech Emotion Recognition Using Cross-Correlation and Acoustic Features
    Chatterjee, Joyjit
    Mukesh, Vajja
    Hsu, Hui-Huang
    Vyas, Garima
    Liu, Zhen
    [J]. 2018 16TH IEEE INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP, 16TH IEEE INT CONF ON PERVAS INTELLIGENCE AND COMP, 4TH IEEE INT CONF ON BIG DATA INTELLIGENCE AND COMP, 3RD IEEE CYBER SCI AND TECHNOL CONGRESS (DASC/PICOM/DATACOM/CYBERSCITECH), 2018, : 243 - 249
  • [8] Emotion recognition in speech using inter-sentence Glottal statistics
    Iliev, Alexander I.
    Scordilis, Michael S.
    [J]. PROCEEDINGS OF IWSSIP 2008: 15TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, 2008, : 465 - 468
  • [9] Characteristics of human auditory model based on compensation of glottal features in speech emotion recognition
    Ying, Sun
    Xue-Ying, Zhang
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 81 : 291 - 296
  • [10] Speech Emotion Recognition using Combination of Features
    Zhang, Qingli
    An, Ning
    Wang, Kunxia
    Ren, Fuji
    Li, Lian
    [J]. PROCEEDINGS OF THE 2013 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2013, : 523 - 528