Discrimination Between Pathological and Normal Voices Using GMM-SVM Approach

被引:24
|
作者
Wang, Xiang [1 ]
Zhang, Jianping [1 ]
Yan, Yonghong [1 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Thinkit Speech Lab, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Pathological voices; GMM-SVM; TO-NOISE RATIO; SPEAKER ADAPTATION; IDENTIFICATION;
D O I
10.1016/j.jvoice.2009.08.002
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Acoustic features of vocal tract function are used widely in the study of pathological voices detection. Classification of normal and pathological voices by acoustic parameters is a useful way to diagnose voice diseases. In this aspect, mel-frequency cepstral coefficients are proved to be effective with traditional classifiers such as Gaussian Mixture Model (GMM). However, the accuracy of the classification method can be further improved. In this article, a Gaussian mixture model supervector kernel-support vector machine (GMM-SVM) classifier is compared with GMM classifier for the detection of voice pathology. We found that a sustain vowel phonation can be classified as normal or pathological with an accuracy of 96.1%. Voice recordings are selected from the Kay database to carry out the experiments. Experimental results show that equal error rates decrease from 8.0% for GMM to 4.6% for GMM-SVM.
引用
收藏
页码:38 / 43
页数:6
相关论文
共 50 条
  • [1] Automatic Detection of Pathological Voices Using GMM-SVM Method
    Wang, Xiang
    Zhang, Jianping
    Yan, Yonghong
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 525 - 528
  • [2] MiniVectors: an Improved GMM-SVM Approach for Speaker Verification
    Anguera, Xavier
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2323 - 2326
  • [3] Innovative Automatic Discrimination Multimedia Documents for Indexing using Hybrid GMM-SVM Method
    Turkia, Debabi
    Souha, Bousselmi
    Adnen, Cherif
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (01) : 274 - 279
  • [4] COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION
    Trabelsi, I.
    Ben Ayed, D.
    Ellouze, N.
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2016, 11 (09): : 1221 - 1233
  • [5] Phoneme Independent Pathological Voice Detection Using Wavelet Based MFCCs, GMM-SVM Hybrid Classifier
    Vikram, C. M.
    Umarani, K.
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 929 - 934
  • [6] The GMM-SVM Supervector Approach for the Recognition of the Emotional Status from Speech
    Schwenker, Friedhelm
    Scherer, Stefan
    Magdi, Yasmine M.
    Palm, Guenther
    ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT I, 2009, 5768 : 894 - +
  • [7] Identity authentication by sensed acoustic voices from a speaking person using an efficient GMM-SVM dual modeling framework
    Ing-Jr Ding
    Zih-Jheng Lin
    Microsystem Technologies, 2018, 24 : 3 - 8
  • [8] Non-linguistic Vocalisation Recognition Based on Hybrid GMM-SVM Approach
    Janicki, Artur
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 153 - 157
  • [9] Identity authentication by sensed acoustic voices from a speaking person using an efficient GMM-SVM dual modeling framework
    Ding, Ing-Jr
    Lin, Zih-Jheng
    MICROSYSTEM TECHNOLOGIES-MICRO-AND NANOSYSTEMS-INFORMATION STORAGE AND PROCESSING SYSTEMS, 2018, 24 (01): : 3 - 8
  • [10] Automatic Laughter Detection in Spontaneous Speech Using GMM-SVM Method
    Neuberger, Tilda
    Beke, Andras
    TEXT, SPEECH, AND DIALOGUE, TSD 2013, 2013, 8082 : 113 - 120