Emotional speech classification using Gaussian mixture models

被引:20
|
作者
Ververidis, D [1 ]
Kotropoulos, C [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Thessaloniki 54124, Greece
关键词
D O I
10.1109/ISCAS.2005.1465226
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, the classification of utterances into five basic emotional states is studied. A total of 87 statistical characteristics of pitch, energy, and formants is extracted from 500 utterances of the Danish Emotional Speech database. An evaluation of the classification capability of each feature is performed with respect to the probability of correct classification achieved by the Bayes classifier that models the feature probability density function as a mixture of Gaussian densities. Next, the feature subset that yields the highest probability of correct classification is found using the Sequential Floating Forward Selection algorithm. The probability of correct classification is estimated via crossvalidation and the probability density functions are modelled as mixtures of 2 or 3 Gaussian densities. The results demonstrate that the Bayes classifier which employs mixtures of 2 Gaussian densities can achieve a probability of correct classification equal to 0.55, whereas the human classification score is 0.67 for the database considered and the random classification would give a probability of correct classification equal to 0.20.
引用
收藏
页码:2871 / 2874
页数:4
相关论文
共 50 条
  • [41] Subspace constrained Gaussian mixture models for speech recognition
    Axelrod, S
    Goel, V
    Gopinath, RA
    Olsen, PA
    Visweswariah, K
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (06): : 1144 - 1160
  • [42] Regularized Subspace Gaussian Mixture Models for Speech Recognition
    Lu, Liang
    Ghoshal, Arnab
    Renals, Steve
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2011, 18 (07) : 419 - 422
  • [43] Variational Gaussian Mixture Models for Speech Emotion Recognition
    Mishra, Harendra Kumar
    Sekhar, C. Chandra
    [J]. ICAPR 2009: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, PROCEEDINGS, 2009, : 183 - 186
  • [44] Robust Features for Emotion Recognition from Speech by Using Gaussian Mixture Model Classification
    Navyasri, M.
    RajeswarRao, R.
    DaveeduRaju, A.
    Ramakrishnamurthy, M.
    [J]. INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS (ICTIS 2017) - VOL 2, 2018, 84 : 437 - 444
  • [45] Speech enhancement using Maximum A-Posteriori and Gaussian Mixture Models for speech and noise Periodogram estimation
    Chehrehsa, Sarang
    Moir, Tom James
    [J]. COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 58 - 71
  • [46] Real Life Emotion Classification using Spectral Features and Gaussian Mixture Models
    Koolagudi, Shashidhar G.
    Barthwal, Anurag
    Devliyal, Swati
    Rao, K. Sreenivasa
    [J]. INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 3892 - 3899
  • [47] Hyperspectral Image Classification Using Gaussian Mixture Models and Markov Random Fields
    Li, Wei
    Prasad, Saurabh
    Fowler, James E.
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2014, 11 (01) : 153 - 157
  • [48] Continuous classification of myoelectric signals for powered prostheses using Gaussian mixture models
    Chan, ADC
    Englehart, KB
    [J]. PROCEEDINGS OF THE 25TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: A NEW BEGINNING FOR HUMAN HEALTH, 2003, 25 : 2841 - 2844
  • [49] Moving Vehicle Classification Using Pixel Quantity Based on Gaussian Mixture Models
    Putra, Bayu Charisma
    Setiyono, Budi
    Sulistyaningrum, Dwi Ratna
    Soetrisno
    Mukhlash, Imam
    [J]. PROCEEDINGS OF 2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS), 2018, : 254 - 257
  • [50] Vehicle acoustic classification in netted sensor systems using Gaussian mixture models
    Necioglu, BF
    Christou, CT
    George, EB
    Jacyna, CM
    [J]. SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION XIV, 2005, 5809 : 409 - 419