Voice Pathology Detection and Classification Using MPEG-7 Audio Low-Level Features

被引：0

作者：

Muhammad, Ghulam ^{[1
]}

Melhem, Moutasem ^{[1
]}

机构：

[1] King Saud Univ, Dept Comp Engn, Coll Comp & Informat Sci, Riyadh 11543, Saudi Arabia

来源：

14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年

关键词：

MPEG-7 audio features; dysphonia recognition; support vector machines; pathology binary classification; Fisher discrimination ratio; RECOGNITION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a new pathological voice detection and pathology classification method based on MPEG-7 audio low-level features is proposed. MPEG-7 features are originally used for multimedia indexing, which includes both video and audio. Indexing is related to event detection, and as pathological voice is a separate event than normal voice, we show that MPEG-7 audio low-level features can do very well in detecting pathological voices, as well as classifying the pathologies. The experiments are done on a subset of sustained vowel (namely, "AH") recordings from healthy and voice pathological subjects, from the MEET database. For classification, support vector machine (SVM) with 10-fold cross-validation is applied. The proposed method with MPEG7 audio features and SVM classification is evaluated on voice pathology detection, as well as pathology classification. The experiment results show that the proposed method outperforms some recent methods in the literature both in detection and in classification. The proposed method is able to achieve an accuracy of 99.994 0.0105% for detecting pathological voices and an accuracy of 100% for binary pathologies classifying.

引用

页码：3594 / 3598

页数：5

共 50 条

[31] A Cartoon Image Classification System Using MPEG-7 Descriptors
Kim, Junghyun
Baik, Sung Wook
Kim, Kangseok
Jung, Changduk
Kim, Wonil
ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT II, 2011, 7003 : 368 - +
[32] ECHOCARDIOGRAM VIEW CLASSIFICATION USING LOW-LEVEL FEATURES
Wu, Hui
Bowers, Dustin M.
Huynh, Toan T.
Souvenir, Richard
2013 IEEE 10TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2013, : 752 - 755
[33] Musical Style Classification Using Low-Level Features
Buzzanca, Armando
Castellano, Giovanna
Fanelli, Anna Maria
ACTIVE MEDIA TECHNOLOGY, PROCEEDINGS, 2009, 5820 : 288 - 298
[34] Significance of MPEG-7 textural features for improved mass detection in mammography
Eltonsy, Nevine H.
Tourassi, Georgia D.
Fadeev, Aleksey
Elmaghraby, Adel S.
2006 28th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Vols 1-15, 2006, : 3554 - 3557
[35] User's Web Page Aesthetics Opinion: A Matter of Low-Level Image Descriptors Based on MPEG-7
Uribe, Silvia
Alvarez, Federico
Manuel Menendez, Jose
ACM TRANSACTIONS ON THE WEB, 2017, 11 (01)
[36] Beat tracking of musical performances using low-level audio features
Sethares, WA
Morris, RD
Sethares, JC
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (02): : 275 - 285
[37] Personal Spoken Sentence Retrieval Using Two-Level Feature Matching and MPEG-7 Audio LLDs
Lin, Po-Chuan
Wang, Jhing-Fa
Wang, Jia-Ching
Huang, Jun-Jin
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2009, 25 (04) : 1221 - 1238
[38] Acoustic Events Detection Using MFCC and MPEG-7 Descriptors
Vozarikova, Eva
Juhar, Jozef
Cizmar, Anton
MULTIMEDIA COMMUNICATIONS, SERVICES, AND SECURITY, 2011, 149 : 191 - 197
[39] Indexing of NFL video using MPEG-7 descriptors and MFCC features
Quadri, SG
Krishnan, S
Guan, L
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 429 - 432
[40] Score Level versus Audio Level Fusion for Voice Pathology Detection on the Saarbrucken Voice Database
Martinez, David
Lleida, Eduardo
Ortega, Alfonso
Miguel, Antonio
ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 110 - +

← 1 2 3 4 5 →