SPEECH BREATHING ESTIMATION USING DEEP LEARNING METHODS

被引:0
|
作者
Nallanthighal, Venkata Srikanth [1 ,2 ]
Harma, Aki [1 ]
Strik, Helmer [2 ]
机构
[1] Philips Res, Eindhoven, Netherlands
[2] Radboud Univ Nijmegen, Ctr Language Studies CLS, Nijmegen, Netherlands
基金
欧盟地平线“2020”;
关键词
Speech breathing; signal processing; speech technology; deep neural networks; Multi task learning; RIB CAGE;
D O I
10.1109/icassp40776.2020.9053753
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Breathing is the primary mechanism for maintaining the sub-glottal pressure for speech production. Speech can be seen as a systematic outflow of air during exhalation characterized by linguistic content and prosodic factors. Thus, sensing respiratory dynamics from the speech is plausible. In this paper, we explore techniques for sensing breathing from speech using deep learning architectures including multi-task learning approaches. Estimating the breathing pattern from the speech would give us information about the respiration rate, breathing capacity and thus enable us to understand the pathological condition of a person using one's speech. Training and evaluation of our model on our database of breathing signal and speech for 40 subjects yielded a sensitivity of 0.88 for breath event detection and 5.6 % error for breathing rate estimation.
引用
收藏
页码:1140 / 1144
页数:5
相关论文
共 50 条
  • [1] Yi Language Speech Recognition using Deep Learning Methods
    Chen, Ziyan
    Yang, Hongwu
    [J]. PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 1064 - 1068
  • [2] Speech enhancement with noise estimation and filtration using deep learning models
    Kantamaneni, Sravanthi
    Charles, A.
    Babu, T. Ranga
    [J]. THEORETICAL COMPUTER SCIENCE, 2023, 941 : 14 - 28
  • [3] Speech enhancement with noise estimation and filtration using deep learning models
    Kantamaneni, Sravanthi
    Charles, A.
    Babu, T. Ranga
    [J]. THEORETICAL COMPUTER SCIENCE, 2023, 941 : 14 - 28
  • [4] On Reference Signal Estimation from Noisy Speech Using Deep Learning for Intelligibility Estimation
    Takahashi, Hiroto
    Kondo, Kazuhiro
    [J]. 2018 IEEE 7TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE 2018), 2018, : 347 - 348
  • [5] RUL Estimation Enhancement using Hybrid Deep Learning Methods
    Remadna, Ikram
    Terrissa, Labib Sadek
    Ayad, Soheyb
    Zerhouni, Noureddine
    [J]. INTERNATIONAL JOURNAL OF PROGNOSTICS AND HEALTH MANAGEMENT, 2021, 12 (01)
  • [6] Classification and risk estimation of osteoarthritis using deep learning methods
    Patil, Aparna R.
    Salunkhe, Satish Sampatrao
    [J]. Measurement: Sensors, 2024, 35
  • [7] Hate Speech Detection in Social Networks using Machine Learning and Deep Learning Methods
    Toktarova, Aigerim
    Syrlybay, Dariga
    Myrzakhmetova, Bayan
    Anuarbekova, Gulzat
    Rakhimbayeva, Gulbarshin
    Zhylanbaeva, Balkiya
    Suieuova, Nabat
    Kerimbekov, Mukhtar
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (05) : 396 - 406
  • [8] Gender and Age Estimation Methods Based on Speech Using Deep Neural Networks
    Kwasny, Damian
    Hemmerling, Daria
    [J]. SENSORS, 2021, 21 (14)
  • [9] Non-intrusive speech intelligibility estimation using deep learning with speech enhancement and convolutional layers
    Nakazawa, Kazushi
    Kondo, Kazuhiro
    [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1047 - 1052
  • [10] MEAN TROPOSPHERIC TEMPERATURE ESTIMATION USING DEEP LEARNING AND ENSEMBLE METHODS
    Brum, Diego
    Rofatto, Vinicius Francisco
    Gonzaga, Luiz, Jr.
    Pena, Rafaela de Oliveira
    Sapucci, Luiz Fernando
    Veronez, Mauricio Roberto
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 6658 - 6661