Automated Detection of Voice Disorder in the Saarbrucken Voice Database: Effects of Pathology Subset and Audio Materials

被引:13
|
作者
Huckvale, Mark [1 ]
Buciuleac, Catinca [1 ]
机构
[1] UCL, Speech Hearing & Phonet Sci, London, England
来源
关键词
voice disorders; machine learning; health applications;
D O I
10.21437/Interspeech.2021-1507
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
The Saarbrucken Voice Database contains speech and simultaneous electroglottography recordings of 1002 speakers exhibiting a wide range of voice disorders, together with recordings of 851 controls. Previous studies have used this database to build systems for automated detection of voice disorders and for differential diagnosis. These studies have varied considerably in the subset of pathologies tested, the audio materials analyzed, the cross-validation method used and the performance metric reported. This variation has made it hard to determine the most promising approaches to the problem of detecting voice disorders. In this study we reimplement three recently published systems that have been trained to detect pathology using the SVD and compare their performance on the same pathologies with the same audio materials using a common cross-validation protocol and performance metric. We show that under this approach, there is much less difference in performance across systems than in their original publication. We also show that voice disorder detection on the basis of a short phrase gives similar performance to that based on a sequence of vowels of different pitch. Our evaluation protocol may be useful for future studies on voice disorder detection with the SVD.
引用
收藏
页码:1399 / 1403
页数:5
相关论文
共 50 条
  • [1] Score Level versus Audio Level Fusion for Voice Pathology Detection on the Saarbrucken Voice Database
    Martinez, David
    Lleida, Eduardo
    Ortega, Alfonso
    Miguel, Antonio
    [J]. ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 110 - +
  • [2] Voice Pathology Detection on the Saarbrucken Voice Database with Calibration and Fusion of Scores Using MultiFocal Toolkit
    Martinez, David
    Lleida, Eduardo
    Ortega, Alfonso
    Miguel, Antonio
    Villalba, Jesus
    [J]. ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 99 - +
  • [3] Voice Pathology Detection with MDVP Parameters Using Arabic Voice Pathology Database
    Al-nasheri, Ahmed
    Ali, Zulfiqar
    Muhammad, Ghulam
    Alsulaiman, Mansour
    Almalki, Khalid H.
    Mesallam, Tamer A.
    Farahat, Mohamed
    [J]. 2015 5TH NATIONAL SYMPOSIUM ON INFORMATION TECHNOLOGY: TOWARDS NEW SMART WORLD (NSITNSW), 2015,
  • [4] MPEG-7 Audio Features Based Voice Pathology Detection
    Muhammad, Ghulam
    Melhem, Moutasem
    [J]. 2013 IEEE EUROCON, 2013, : 1611 - 1618
  • [5] Effects of Audio Compression in Automatic Detection of Voice Pathologies
    Saenz-Lechon, Nicolas
    Osma-Ruiz, Victor
    Godino-Llorente, Juan I.
    Blanco-Velasco, Manuel
    Cruz-Roldan, Fernando
    Arias-Londono, Julian D.
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2008, 55 (12) : 2831 - 2835
  • [6] Voice pathology detection based on the modified voice contour and SVM
    Ali, Zulfiqar
    Alsulaiman, Mansour
    Elamvazuthi, Irraivan
    Muhammad, Ghulam
    Mesallam, Tamer A.
    Farahat, Mohamed
    Malki, Khalid H.
    [J]. BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, 2016, 15 : 10 - 18
  • [7] Normalized Modulation Spectral Features for Cross-Database Voice Pathology Detection
    Markaki, Maria
    Stylianou, Yannis
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 952 - 955
  • [8] Voice Pathology Detection By Fuzzy Logic
    Panek, Dania
    Skalski, Andrzej
    Gajda, Janusz
    [J]. 2015 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC), 2015, : 289 - 293
  • [9] The accuracy of an Online Sequential Extreme Learning Machine in detecting voice pathology using the Malaysian Voice Pathology Database
    Nur Ain Nabila Za’im
    Fahad Taha AL-Dhief
    Mawaddah Azman
    Majid Razaq Mohamed Alsemawi
    Nurul Mu′azzah Abdul Latiff
    Marina Mat Baki
    [J]. Journal of Otolaryngology - Head & Neck Surgery, 52
  • [10] The accuracy of an Online Sequential Extreme Learning Machine in detecting voice pathology using the Malaysian Voice Pathology Database
    Za'im, Nur Ain Nabila
    AL-Dhief, Fahad Taha
    Azman, Mawaddah
    Alsemawi, Majid Razaq Mohamed
    Abdul Latiff, Nurul Mu'azzah
    Baki, Marina Mat
    [J]. JOURNAL OF OTOLARYNGOLOGY-HEAD & NECK SURGERY, 2023, 52 (01)