Score Level versus Audio Level Fusion for Voice Pathology Detection on the Saarbrucken Voice Database

被引:0
|
作者
Martinez, David [1 ]
Lleida, Eduardo [1 ]
Ortega, Alfonso [1 ]
Miguel, Antonio [1 ]
机构
[1] Univ Zaragoza, Aragon Inst Engn Res I3A, E-50009 Zaragoza, Spain
关键词
Pathological Voice Detection; Saarbrucken Voice Database; GMM; Fusion; MultiFocal toolkit;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The article presents a set of experiments on pathological voice detection over the Saarbrucken Voice Database (SVD). The SVD is freely available online containing a collection of voice recordings of different pathologies, both functional and organic. It includes recordings for more than 2000 speakers in which sustained vowels /a/, /i/, and /u/ are pronounced with normal, low, high, and low-high-low intonations. This variety of sounds makes possible to set different experiments, and in this paper a comparison between the performance of a system where all the vowels and intonations are pooled together to train a single model per class, and a system where a different model per class is trained for each vowel and intonation, and the scores of each subsystem are fused at the end, is conducted. The first approach is what we call audio level fusion, and the second is what we call score level fusion. For classification, a generative Gaussian mixture model trained with mel-frequency cepstral coefficients, harmonics-to-noise ratio, normalized noise energy and glottal-to-noise excitation ratio, is used. It is shown that the score level fusion is far more effective than the audio level fusion.
引用
收藏
页码:110 / +
页数:3
相关论文
共 50 条
  • [21] A score-level fusion benchmark database for biometric authentication
    Poh, N
    Bengio, S
    AUDIO AND VIDEO BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3546 : 1059 - 1070
  • [22] Voice Pathology Detection By Fuzzy Logic
    Panek, Dania
    Skalski, Andrzej
    Gajda, Janusz
    2015 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC), 2015, : 289 - 293
  • [23] The accuracy of an Online Sequential Extreme Learning Machine in detecting voice pathology using the Malaysian Voice Pathology Database
    Nur Ain Nabila Za’im
    Fahad Taha AL-Dhief
    Mawaddah Azman
    Majid Razaq Mohamed Alsemawi
    Nurul Mu′azzah Abdul Latiff
    Marina Mat Baki
    Journal of Otolaryngology - Head & Neck Surgery, 52
  • [24] Score Function for Voice Activity Detection
    Sole-Casals, Jordi
    Marti-Puig, Pere
    Reig-Bolano, Ramon
    Zaiats, Vladimir
    ADVANCES IN NONLINEAR SPEECH PROCESSING, 2010, 5933 : 76 - 83
  • [25] The accuracy of an Online Sequential Extreme Learning Machine in detecting voice pathology using the Malaysian Voice Pathology Database
    Za'im, Nur Ain Nabila
    AL-Dhief, Fahad Taha
    Azman, Mawaddah
    Alsemawi, Majid Razaq Mohamed
    Abdul Latiff, Nurul Mu'azzah
    Baki, Marina Mat
    JOURNAL OF OTOLARYNGOLOGY-HEAD & NECK SURGERY, 2023, 52 (01)
  • [26] Voice Pathology Detection Using a Two-Level Classifier Based on Combined CNN-RNN Architecture
    Ksibi, Amel
    Hakami, Nada Ali
    Alturki, Nazik
    Asiri, Mashael M. M.
    Zakariah, Mohammed
    Ayadi, Manel
    SUSTAINABILITY, 2023, 15 (04)
  • [27] Fusion of Low-Level Descriptors of Digital Voice Recordings for Dementia Assessment
    Karjadi, Cody
    Xue, Chonghua
    Cordella, Claire
    Kiran, Swathi
    Paschalidis, Ioannis Ch.
    Au, Rhoda
    Kolachalama, Vijaya B.
    JOURNAL OF ALZHEIMERS DISEASE, 2023, 96 (02) : 507 - 514
  • [28] Voice Pathology Detection using Multiresolution Technique
    Muhammad, Ghulam
    Alsulaiman, Mansour
    Mahmood, Awais
    Almojali, Malak
    Abdelkader, Bencherif Mohamed
    UKSIM-AMSS EIGHTH EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS 2014), 2014, : 185 - 189
  • [29] New trends in voice pathology detection and classification
    Manfredi, Claudia
    Kob, Malte
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2009, 4 (03) : 171 - 172
  • [30] Spectral perturbation parameters for voice pathology detection
    Gómez, P
    Díaz, F
    Lázaro, C
    Murphy, K
    Martínez, R
    Rodellar, V
    Alvarez, A
    ISSCS 2005: International Symposium on Signals, Circuits and Systems, Vols 1 and 2, Proceedings, 2005, : 299 - 302