Score Level versus Audio Level Fusion for Voice Pathology Detection on the Saarbrucken Voice Database

被引:0
|
作者
Martinez, David [1 ]
Lleida, Eduardo [1 ]
Ortega, Alfonso [1 ]
Miguel, Antonio [1 ]
机构
[1] Univ Zaragoza, Aragon Inst Engn Res I3A, E-50009 Zaragoza, Spain
关键词
Pathological Voice Detection; Saarbrucken Voice Database; GMM; Fusion; MultiFocal toolkit;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The article presents a set of experiments on pathological voice detection over the Saarbrucken Voice Database (SVD). The SVD is freely available online containing a collection of voice recordings of different pathologies, both functional and organic. It includes recordings for more than 2000 speakers in which sustained vowels /a/, /i/, and /u/ are pronounced with normal, low, high, and low-high-low intonations. This variety of sounds makes possible to set different experiments, and in this paper a comparison between the performance of a system where all the vowels and intonations are pooled together to train a single model per class, and a system where a different model per class is trained for each vowel and intonation, and the scores of each subsystem are fused at the end, is conducted. The first approach is what we call audio level fusion, and the second is what we call score level fusion. For classification, a generative Gaussian mixture model trained with mel-frequency cepstral coefficients, harmonics-to-noise ratio, normalized noise energy and glottal-to-noise excitation ratio, is used. It is shown that the score level fusion is far more effective than the audio level fusion.
引用
收藏
页码:110 / +
页数:3
相关论文
共 50 条
  • [1] Automated Detection of Voice Disorder in the Saarbrucken Voice Database: Effects of Pathology Subset and Audio Materials
    Huckvale, Mark
    Buciuleac, Catinca
    [J]. INTERSPEECH 2021, 2021, : 1399 - 1403
  • [2] Voice Pathology Detection on the Saarbrucken Voice Database with Calibration and Fusion of Scores Using MultiFocal Toolkit
    Martinez, David
    Lleida, Eduardo
    Ortega, Alfonso
    Miguel, Antonio
    Villalba, Jesus
    [J]. ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 99 - +
  • [3] Voice Pathology Detection with MDVP Parameters Using Arabic Voice Pathology Database
    Al-nasheri, Ahmed
    Ali, Zulfiqar
    Muhammad, Ghulam
    Alsulaiman, Mansour
    Almalki, Khalid H.
    Mesallam, Tamer A.
    Farahat, Mohamed
    [J]. 2015 5TH NATIONAL SYMPOSIUM ON INFORMATION TECHNOLOGY: TOWARDS NEW SMART WORLD (NSITNSW), 2015,
  • [4] Voice Pathology Detection and Classification Using MPEG-7 Audio Low-Level Features
    Muhammad, Ghulam
    Melhem, Moutasem
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3594 - 3598
  • [5] Score Level Fusion Based Multimodal Biometric Identification (Fingerprint & Voice)
    Elmir, Youssef
    Elberrichi, Zakaria
    Adjoudj, Reda
    [J]. 2012 6TH INTERNATIONAL CONFERENCE ON SCIENCES OF ELECTRONICS, TECHNOLOGIES OF INFORMATION AND TELECOMMUNICATIONS (SETIT), 2012, : 146 - 150
  • [6] Adaptive score level fusion of fingerprint and voice combining wavelets and separability measures
    Anzar, S. M.
    Sathidevi, P. S.
    [J]. AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2013, 67 (09) : 733 - 742
  • [7] Impact of Score Fusion on Voice Biometrics and Presentation Attack Detection in Cross-Database Evaluations
    Korshunov, Pavel
    Marcel, Sebastien
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (04) : 695 - 705
  • [8] MPEG-7 Audio Features Based Voice Pathology Detection
    Muhammad, Ghulam
    Melhem, Moutasem
    [J]. 2013 IEEE EUROCON, 2013, : 1611 - 1618
  • [9] Score-Level Fusion of Face and Voice Using Particle Swarm Optimization and Belief Functions
    Mezai, L.
    Hachouf, F.
    [J]. IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2015, 45 (06) : 761 - 772
  • [10] Voice Pathology Detection Using Fusion Feature and Improved MLP
    Bai, Jing
    Wang, Po
    Xue, Peiyun
    Feng, Xiaojing
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 129 - 129