Automatic Wheeze Segmentation Using Harmonic-Percussive Source Separation and Empirical Mode Decomposition

被引:7
|
作者
Rocha, Bruno Machado [1 ]
Pessoa, Diogo [1 ]
Marques, Alda [2 ,3 ]
de Carvalho, Paulo [1 ]
Paiva, Rui Pedro [1 ]
机构
[1] Univ Coimbra, Ctr Informat & Syst, Dept Informat Engn, P-3030290 Coimbra, Portugal
[2] Univ Aveiro, Sch Hlth Sci ESSUA, Resp Res & Rehabil Lab Lab3R, P-3810193 Aveiro, Portugal
[3] Univ Aveiro, Inst Biomed IBiMED, P-3810193 Aveiro, Portugal
基金
欧盟地平线“2020”;
关键词
Recording; Harmonic analysis; Power harmonic filters; Spectrogram; Empirical mode decomposition; Source separation; Indexes; Respiratory sound analysis; expert systems; harmonic-percussive source separation; empirical mode decomposition; sound event detection; ADVENTITIOUS RESPIRATORY SOUNDS;
D O I
10.1109/JBHI.2023.3248265
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Wheezes are adventitious respiratory sounds commonly present in patients with respiratory conditions. The presence of wheezes and their time location are relevant for clinical reasons, such as understanding the degree of bronchial obstruction. Conventional auscultation is usually employed to analyze wheezes, but remote monitoring has become a pressing need during recent years. Automatic respiratory sound analysis is required to reliably perform remote auscultation. In this work we propose a method for wheeze segmentation. Our method starts by decomposing a given audio excerpt into intrinsic mode frequencies using empirical mode decomposition. Then, we apply harmonic-percussive source separation to the resulting audio tracks and get harmonic-enhanced spectrograms, which are processed to obtain harmonic masks. Subsequently, a series of empirically derived rules are applied to find wheeze candidates. Finally, the candidates stemming from the different audio tracks are merged and median filtered. In the evaluation stage, we compare our method to three baselines on the ICBHI 2017 Respiratory Sound Database, a challenging dataset containing various noise sources and background sounds. Using the full dataset, our method outperforms the baselines, achieving an F1 of 41.9%. Our method's performance is also better than the baselines across several stratified results focusing on five variables: recording equipment, age, sex, body-mass index, and diagnosis. We conclude that wheeze segmentation has not been solved for real life scenario applications. Adaptation of existing systems to demographic characteristics might be a promising step in the direction of algorithm personalization, which would make automatic wheeze segmentation clinically viable.
引用
收藏
页码:1926 / 1934
页数:9
相关论文
共 50 条
  • [1] UNIFYING LOCAL AND GLOBAL METHODS FOR HARMONIC-PERCUSSIVE SOURCE SEPARATION
    Dittmar, Christian
    Lopez-Serrano, Patricio
    Mueller, Meinard
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 176 - 180
  • [2] Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation
    Lordelo, C.
    Benetos, E.
    Dixon, S.
    Ahlback, S.
    Ohlsson, P.
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 81 - 85
  • [3] HARMONIC-PERCUSSIVE SOURCE SEPARATION WITH DEEP NEURAL NETWORKS AND PHASE RECOVERY
    Drossos, Konstantinos
    Magron, Paul
    Mimilakis, Stylianos Ioannis
    Virtanen, Tuomas
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 421 - 425
  • [4] Harmonic-Percussive Source Separation of Polyphonic Music by Suppressing Impulsive Noise Events
    Reddy, Gurunath M.
    Rao, K. Sreenivasa
    Das, Partha Pratim
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 831 - 835
  • [5] Music Genre Recognition Using Spectrograms with Harmonic-Percussive Sound Separation
    Aguiar, Rafael de Lima
    da Costa, Yandre Maldonado e Gomes
    Nanni, Loris
    PROCEEDINGS OF THE 2016 35TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2016,
  • [6] Exploiting Continuity/Discontinuity of Basis Vectors in Spectrogram Decomposition for Harmonic-Percussive Sound Separation
    Park, Jeongsoo
    Shin, Jaeyoung
    Lee, Kyogu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) : 1061 - 1074
  • [7] ADVERSARIAL AUDIO SYNTHESIS USING A HARMONIC-PERCUSSIVE DISCRIMINATOR
    Lee, Jihyun
    Lim, Hyungseob
    Lee, Chanwoo
    Jang, Inseon
    Kang, Hong-Goo
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 961 - 965
  • [8] Improving Time-Scale Modification of Music Signals Using Harmonic-Percussive Separation
    Driedger, Jonathan
    Mueller, Meinard
    Ewert, Sebastian
    IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (01) : 105 - 109
  • [9] Harmonic and Percussive Source Separation Using a Convolutional Auto Encoder
    Lim, Wootaek
    Lee, Taejin
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 1804 - 1808
  • [10] Large-Scale Multi-Channel Transformer-based Speaker Identification with Knowledge Transfer Using Harmonic-Percussive Source Separation
    Xie, Yuejiao
    Tang, Jingjing
    Yang, Nan
    Pun, Man-On
    2022 31ST WIRELESS AND OPTICAL COMMUNICATIONS CONFERENCE (WOCC), 2022, : 1 - 5