Early Detection of Severe Apnoea through Voice Analysis and Automatic Speaker Recognition Techniques

被引:0
|
作者
Fernandez, Ruben [1 ]
Luis Blanco, Jose [1 ]
Diaz, David [1 ]
Hernandez, Luis A. [1 ]
Lopez, Eduardo [1 ]
Alcazar, Jose [2 ]
机构
[1] Univ Politecn Madrid, Dept GAPS, Avda Complutense 30, E-28040 Madrid, Spain
[2] Hosp Torrecardenas, Respirat Dept, E-04009 Almeria, Spain
关键词
Apnoea; Automatic speaker recognition; GMM; Nasalization; OBSTRUCTIVE SLEEP-APNEA; MODELS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This study is part of an on-going collaborative effort between the medical and the signal processing communities to promote research on applying voice analysis and Automatic Speaker Recognition techniques (ASR) for the automatic diagnosis of patients with severe obstructive sleep apnoea (OSA). Early detection of severe apnoea cases is important so that patients can receive early treatment. Effective ASR-based diagnosis could dramatically cut medical testing time. Working with a carefully designed speech database of healthy and apnoea subjects, we present and discuss the possibilities of using generative Gaussian Mixture Models (GMMs), generally used in ASR systems, to model distinctive apnoea voice characteristics (i.e. abnormal nasalization). Finally, we present experimental findings regarding the discriminative power of speaker recognition techniques applied to severe apnoea detection. We have achieved an 81.25 % correct classification rate, which is very promising and underpins the interest in this line of inquiry.
引用
收藏
页码:245 / +
页数:3
相关论文
共 50 条
  • [31] ON AUTOMATIC VOICE CASTING FOR EXPRESSIVE SPEECH: SPEAKER RECOGNITION VS. SPEECH CLASSIFICATION
    Obin, Nicolas
    Roebel, Axel
    Bachman, Gregoire
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [32] Minimum of Information Divergence Criterion for Signals with Tuning to Speaker Voice in Automatic Speech Recognition
    Savchenko V.V.
    Radioelectronics and Communications Systems, 2020, 63 (01) : 42 - 54
  • [33] Automatic Speech Recognition systems errors for accident-prone sleepiness detection through voice
    Martin, Vincent P.
    Rouas, Jean-Luc
    Boyer, Florian
    Philip, Pierre
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 541 - 545
  • [34] Comparative Analysis of Speaker Recognition System Based on Voice Activity Detection Technique, MFCC and PLP Features
    Kalia, Akanksha
    Sharma, Shikar
    Pandey, Saurabh Kumar
    Jadoun, Vinay Kumar
    Das, Madhulika
    INTELLIGENT COMPUTING TECHNIQUES FOR SMART ENERGY SYSTEMS, 2020, 607 : 781 - 787
  • [35] SPEAKER-INDEPENDENT WORD RECOGNITION TECHNIQUES FOR CONTROL OF A VOICE MESSAGING SYSTEM.
    Gupta, V.
    Mermelstein, P.
    U.S. Symposium on Rock Mechanics, 1981, : 233 - 238
  • [36] Robust speaker recognition based on level-building voice activity detection
    Xie, Yan-Lu
    Zhang, Jing-Song
    Liu, Ming-Hui
    Huang, Zhong-Wei
    Shenzhen Daxue Xuebao (Ligong Ban)/Journal of Shenzhen University Science and Engineering, 2012, 29 (04): : 328 - 334
  • [37] The Delta-Phase Spectrum With Application to Voice Activity Detection and Speaker Recognition
    McCowan, Iain
    Dean, David
    McLaren, Mitchell
    Vogt, Robert
    Sridharan, Sridha
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 2026 - 2038
  • [38] Voice Activation System Using Acoustic Event Detection and Keyword/Speaker Recognition
    Cho, Namgook
    Kim, Taeyoon
    Shin, Sangwook
    Kim, Eun-Kyoung
    IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE 2011), 2011, : 21 - 22
  • [39] Analysis of Compressed Speech Signals in an Automatic Speaker Recognition System
    Metzger, Richard A.
    Doherty, John F.
    Jenkins, David M.
    2015 49TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2015,
  • [40] Tuning the performance of automatic speaker recognition in different conditions: effects of language and simulated voice disguise
    Skarnitzl, Radek
    Asiaee, Maral
    Nourbakhsh, Mandana
    INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2019, 26 (02) : 209 - 229