Voice Pathology Detection and Classification Using MPEG-7 Audio Low-Level Features

被引:0
|
作者
Muhammad, Ghulam [1 ]
Melhem, Moutasem [1 ]
机构
[1] King Saud Univ, Dept Comp Engn, Coll Comp & Informat Sci, Riyadh 11543, Saudi Arabia
关键词
MPEG-7 audio features; dysphonia recognition; support vector machines; pathology binary classification; Fisher discrimination ratio; RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a new pathological voice detection and pathology classification method based on MPEG-7 audio low-level features is proposed. MPEG-7 features are originally used for multimedia indexing, which includes both video and audio. Indexing is related to event detection, and as pathological voice is a separate event than normal voice, we show that MPEG-7 audio low-level features can do very well in detecting pathological voices, as well as classifying the pathologies. The experiments are done on a subset of sustained vowel (namely, "AH") recordings from healthy and voice pathological subjects, from the MEET database. For classification, support vector machine (SVM) with 10-fold cross-validation is applied. The proposed method with MPEG7 audio features and SVM classification is evaluated on voice pathology detection, as well as pathology classification. The experiment results show that the proposed method outperforms some recent methods in the literature both in detection and in classification. The proposed method is able to achieve an accuracy of 99.994 0.0105% for detecting pathological voices and an accuracy of 100% for binary pathologies classifying.
引用
收藏
页码:3594 / 3598
页数:5
相关论文
共 50 条
  • [41] Automatic speaker change detection with the Bayesian Information Criterion using MPEG-7 features and a fusion scheme
    Kotti, Margarita
    Benetos, Emmanouil
    Kotropoulos, Constantine
    2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 1856 - +
  • [42] MPEG-7 based description schemes for multi-level video content classification
    Vakali, A
    Hacid, MS
    Elmagarmid, A
    IMAGE AND VISION COMPUTING, 2004, 22 (05) : 367 - 378
  • [43] Vocal characteristics classification of audio segments: An investigation of the influence of accompaniment music on low-level features
    Gaertner, Daniel
    Dittmar, Christian
    EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, : 583 - 589
  • [44] Scene classification from low-level features
    Oliva, AP
    Torralba, A
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1999, 40 (04) : S414 - S414
  • [45] Violence detection in surveillance video using low-level features
    Zhou, Peipei
    Ding, Qinghai
    Luo, Haibo
    Hou, Xinglin
    PLOS ONE, 2018, 13 (10):
  • [46] Image orientation detection using low-level features and faces
    Ciocca, Gianluigi
    Cusano, Claudio
    Schettini, Raimondo
    DIGITAL PHOTOGRAPHY VI, 2010, 7537
  • [47] Pap smear cell image classification using global MPEG-7 descriptors
    Luz H Camargo
    Gloria Diaz
    Eduardo Romero
    Diagnostic Pathology, 8 (Suppl 1)
  • [48] Using visual features based on MPEG-7 and deep learning for movie recommendation
    Deldjoo, Yashar
    Elahi, Mehdi
    Quadrana, Massimo
    Cremonesi, Paolo
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2018, 7 (04) : 207 - 219
  • [49] Classification of Printed Gujarati Characters Using Low-Level Stroke Features
    Goswami, Mukesh M.
    Mitra, Suman K.
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2016, 15 (04)
  • [50] A method for classification of scenery documentary using MPEG-7 edge histogram descriptor
    Cao, JR
    Cai, A
    Proceedings of 2005 IEEE International Workshop on VLSI Design and Video Technology, 2005, : 105 - 108