Speech emotion classification with the combination of statistic features and temporal features

被引:31
|
作者
Jiang, DN [1 ]
Cai, LH [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
关键词
D O I
10.1109/ICME.2004.1394647
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For classifying speech emotion, most previous systems used either statistic features or temporal features, exclusively. However, these two distinct feature representations appear to be concerned with different aspects of emotion, and should be combined in the task. This paper proposes a classification scheme that enables the combination of them. In the scheme, GMM and MM are first performed to model the statistic features and temporal features respectively. Then the GMM likelihoods and MM likelihoods are used as features in further procedure. Finally, Weighted ayesian Classifier and MLP are applied to accomplish the classification. Experiments on Chinese speech corpus have demonstrated that the scheme could improve the classification accuracy greatly. More detailed analysis indicated that these two feature representations could compensate each other efficiently in the classification.
引用
收藏
页码:1967 / 1970
页数:4
相关论文
共 50 条
  • [1] Speech Emotion Recognition using Combination of Features
    Zhang, Qingli
    An, Ning
    Wang, Kunxia
    Ren, Fuji
    Li, Lian
    [J]. PROCEEDINGS OF THE 2013 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2013, : 523 - 528
  • [2] EMOTION CLASSIFICATION OF SPEECH USING MODULATION FEATURES
    Chaspari, Theodora
    Dimitriadis, Dimitrios
    Maragos, Petros
    [J]. 2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1552 - 1556
  • [3] Speech emotion recognition: Features and classification models
    Chen, Lijiang
    Mao, Xia
    Xue, Yuli
    Cheng, Lee Lung
    [J]. DIGITAL SIGNAL PROCESSING, 2012, 22 (06) : 1154 - 1160
  • [4] Speech Emotion Classification using Acoustic Features
    Chen, Shizhe
    Jin, Qin
    Li, Xirong
    Yang, Gang
    Xu, Jieping
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 579 - 583
  • [5] Deep temporal clustering features for speech emotion recognition
    Lin, Wei-Cheng
    Busso, Carlos
    [J]. SPEECH COMMUNICATION, 2024, 157
  • [6] Robust emotion recognition by spectro-temporal modulation statistic features
    Tai-Shih Chi
    Lan-Ying Yeh
    Chin-Cheng Hsu
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2012, 3 : 47 - 60
  • [7] Robust emotion recognition by spectro-temporal modulation statistic features
    Chi, Tai-Shih
    Yeh, Lan-Ying
    Hsu, Chin-Cheng
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2012, 3 (01) : 47 - 60
  • [8] Ranking Speech Features for Their Usage in Singing Emotion Classification
    Zaporowski, Szymon
    Kostek, Bozena
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2020), 2020, 12117 : 225 - 234
  • [9] Selection of Fractal Dimension Features for Speech Emotion Classification
    Tamulevicius, Gintautas
    Karbauskaite, Rasa
    Dzemvda, Gintautas
    [J]. 2017 OPEN CONFERENCE OF ELECTRICAL, ELECTRONIC AND INFORMATION SCIENCES (ESTREAM), 2017,
  • [10] Survey on speech emotion recognition: Features, classification schemes, and databases
    El Ayadi, Moataz
    Kamel, Mohamed S.
    Karray, Fakhri
    [J]. PATTERN RECOGNITION, 2011, 44 (03) : 572 - 587