Voice Pathology Detection and Classification Using MPEG-7 Audio Low-Level Features

被引:0
|
作者
Muhammad, Ghulam [1 ]
Melhem, Moutasem [1 ]
机构
[1] King Saud Univ, Dept Comp Engn, Coll Comp & Informat Sci, Riyadh 11543, Saudi Arabia
关键词
MPEG-7 audio features; dysphonia recognition; support vector machines; pathology binary classification; Fisher discrimination ratio; RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a new pathological voice detection and pathology classification method based on MPEG-7 audio low-level features is proposed. MPEG-7 features are originally used for multimedia indexing, which includes both video and audio. Indexing is related to event detection, and as pathological voice is a separate event than normal voice, we show that MPEG-7 audio low-level features can do very well in detecting pathological voices, as well as classifying the pathologies. The experiments are done on a subset of sustained vowel (namely, "AH") recordings from healthy and voice pathological subjects, from the MEET database. For classification, support vector machine (SVM) with 10-fold cross-validation is applied. The proposed method with MPEG7 audio features and SVM classification is evaluated on voice pathology detection, as well as pathology classification. The experiment results show that the proposed method outperforms some recent methods in the literature both in detection and in classification. The proposed method is able to achieve an accuracy of 99.994 0.0105% for detecting pathological voices and an accuracy of 100% for binary pathologies classifying.
引用
收藏
页码:3594 / 3598
页数:5
相关论文
共 50 条
  • [31] A Cartoon Image Classification System Using MPEG-7 Descriptors
    Kim, Junghyun
    Baik, Sung Wook
    Kim, Kangseok
    Jung, Changduk
    Kim, Wonil
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT II, 2011, 7003 : 368 - +
  • [32] ECHOCARDIOGRAM VIEW CLASSIFICATION USING LOW-LEVEL FEATURES
    Wu, Hui
    Bowers, Dustin M.
    Huynh, Toan T.
    Souvenir, Richard
    2013 IEEE 10TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2013, : 752 - 755
  • [33] Musical Style Classification Using Low-Level Features
    Buzzanca, Armando
    Castellano, Giovanna
    Fanelli, Anna Maria
    ACTIVE MEDIA TECHNOLOGY, PROCEEDINGS, 2009, 5820 : 288 - 298
  • [34] Significance of MPEG-7 textural features for improved mass detection in mammography
    Eltonsy, Nevine H.
    Tourassi, Georgia D.
    Fadeev, Aleksey
    Elmaghraby, Adel S.
    2006 28th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Vols 1-15, 2006, : 3554 - 3557
  • [35] User's Web Page Aesthetics Opinion: A Matter of Low-Level Image Descriptors Based on MPEG-7
    Uribe, Silvia
    Alvarez, Federico
    Manuel Menendez, Jose
    ACM TRANSACTIONS ON THE WEB, 2017, 11 (01)
  • [36] Beat tracking of musical performances using low-level audio features
    Sethares, WA
    Morris, RD
    Sethares, JC
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (02): : 275 - 285
  • [37] Personal Spoken Sentence Retrieval Using Two-Level Feature Matching and MPEG-7 Audio LLDs
    Lin, Po-Chuan
    Wang, Jhing-Fa
    Wang, Jia-Ching
    Huang, Jun-Jin
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2009, 25 (04) : 1221 - 1238
  • [38] Acoustic Events Detection Using MFCC and MPEG-7 Descriptors
    Vozarikova, Eva
    Juhar, Jozef
    Cizmar, Anton
    MULTIMEDIA COMMUNICATIONS, SERVICES, AND SECURITY, 2011, 149 : 191 - 197
  • [39] Indexing of NFL video using MPEG-7 descriptors and MFCC features
    Quadri, SG
    Krishnan, S
    Guan, L
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 429 - 432
  • [40] Score Level versus Audio Level Fusion for Voice Pathology Detection on the Saarbrucken Voice Database
    Martinez, David
    Lleida, Eduardo
    Ortega, Alfonso
    Miguel, Antonio
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 110 - +