Discrimination Between Pathological and Normal Voices Using GMM-SVM Approach

被引：24

作者：

Wang, Xiang ^{[1
]}

Zhang, Jianping ^{[1
]}

Yan, Yonghong ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Acoust, Thinkit Speech Lab, Beijing, Peoples R China

来源：

JOURNAL OF VOICE | 2011年 / 25卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Pathological voices; GMM-SVM; TO-NOISE RATIO; SPEAKER ADAPTATION; IDENTIFICATION;

D O I：

10.1016/j.jvoice.2009.08.002

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

Acoustic features of vocal tract function are used widely in the study of pathological voices detection. Classification of normal and pathological voices by acoustic parameters is a useful way to diagnose voice diseases. In this aspect, mel-frequency cepstral coefficients are proved to be effective with traditional classifiers such as Gaussian Mixture Model (GMM). However, the accuracy of the classification method can be further improved. In this article, a Gaussian mixture model supervector kernel-support vector machine (GMM-SVM) classifier is compared with GMM classifier for the detection of voice pathology. We found that a sustain vowel phonation can be classified as normal or pathological with an accuracy of 96.1%. Voice recordings are selected from the Kay database to carry out the experiments. Experimental results show that equal error rates decrease from 8.0% for GMM to 4.6% for GMM-SVM.

引用

页码：38 / 43

页数：6

共 50 条

[1] Automatic Detection of Pathological Voices Using GMM-SVM Method
Wang, Xiang
Zhang, Jianping
Yan, Yonghong
PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 525 - 528
[2] MiniVectors: an Improved GMM-SVM Approach for Speaker Verification
Anguera, Xavier
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2323 - 2326
[3] Innovative Automatic Discrimination Multimedia Documents for Indexing using Hybrid GMM-SVM Method
Turkia, Debabi
Souha, Bousselmi
Adnen, Cherif
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (01) : 274 - 279
[4] COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION
Trabelsi, I.
Ben Ayed, D.
Ellouze, N.
JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2016, 11 (09): : 1221 - 1233
[5] Phoneme Independent Pathological Voice Detection Using Wavelet Based MFCCs, GMM-SVM Hybrid Classifier
Vikram, C. M.
Umarani, K.
2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 929 - 934
[6] The GMM-SVM Supervector Approach for the Recognition of the Emotional Status from Speech
Schwenker, Friedhelm
Scherer, Stefan
Magdi, Yasmine M.
Palm, Guenther
ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT I, 2009, 5768 : 894 - +
[7] Identity authentication by sensed acoustic voices from a speaking person using an efficient GMM-SVM dual modeling framework
Ing-Jr Ding
Zih-Jheng Lin
Microsystem Technologies, 2018, 24 : 3 - 8
[8] Non-linguistic Vocalisation Recognition Based on Hybrid GMM-SVM Approach
Janicki, Artur
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 153 - 157
[9] Identity authentication by sensed acoustic voices from a speaking person using an efficient GMM-SVM dual modeling framework
Ding, Ing-Jr
Lin, Zih-Jheng
MICROSYSTEM TECHNOLOGIES-MICRO-AND NANOSYSTEMS-INFORMATION STORAGE AND PROCESSING SYSTEMS, 2018, 24 (01): : 3 - 8
[10] Automatic Laughter Detection in Spontaneous Speech Using GMM-SVM Method
Neuberger, Tilda
Beke, Andras
TEXT, SPEECH, AND DIALOGUE, TSD 2013, 2013, 8082 : 113 - 120

← 1 2 3 4 5 →