Detecting Cough Recordings in Crowdsourced Data Using CNN-RNN

被引:1
|
作者
Sharan, Roneel V. [1 ]
Xiong, Hao [1 ]
Berkovsky, Shlomo [1 ]
机构
[1] Macquarie Univ, Australian Inst Hlth Innovat, Sydney, NSW, Australia
关键词
cough sound; crowdsourced; deep learning; melspectrogram; respiratory diseases;
D O I
10.1109/BHI56158.2022.9926896
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The sound of cough is an important indicator of the condition of the respiratory system. Automatic cough sound evaluation can aid the diagnosis of respiratory diseases. Large crow dsourced cough sound datasets have recently been used by several groups around the world to develop cough classification models. However, not all recordings in these datasets contain cough sounds. As such, it is important to screen the recordings for the presence of cough sounds before developing cough classification models. This work proposes a method to screen crowdsourced audio recordings for cough sounds using deep learning methods. The proposed approach divides the audio recording into overlapping frames and converts each frame into a mel-spectrogram representation. A pretrained convolutional neural network for audio classification is trained to learn the spectral characteristics of cough and non-cough frames from its mel-spectrogram representation. It is combined with a recurrent neural network to learn the dependencies between the sequence of frames. The proposed method is evaluated on 400 crowdsourced audio recordings, manually annotated as cough or non-cough. An accuracy of 0.9800 (AUC of 0.9973) is achieved in classifying cough and non-cough recordings using the proposed method. The trained network is used to analyze the remaining audio recordings in the dataset, identifying only about 67% of recordings as containing usable cough sounds. This shows the need to exercise caution when using crowdsourced cough data.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] CNN-RNN: a large-scale hierarchical image classification framework
    Guo, Yanming
    Liu, Yu
    Bakker, Erwin M.
    Guo, Yuanhao
    Lew, Michael S.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (08) : 10251 - 10271
  • [42] Relative CNN-RNN: Learning Relative Atmospheric Visibility From Images
    You, Yang
    Lu, Cewu
    Wang, Weiming
    Tang, Chi-Keung
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) : 45 - 55
  • [43] Video Emotion Recognition Using Local Enhanced Motion History Image and CNN-RNN Networks
    Wang, Haowen
    Zhou, Guoxiang
    Hu, Min
    Wang, Xiaohua
    BIOMETRIC RECOGNITION, CCBR 2018, 2018, 10996 : 109 - 119
  • [44] RUMBA-Mouse: Rapid User Mouse-Behavior Authentication Using a CNN-RNN Approach
    Fu, Shen
    Qin, Dong
    Qiao, Daji
    Amariucai, George T.
    2020 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2020,
  • [45] Biomedical Named Entity Recognition Based on Hybrid Multistage CNN-RNN Learner
    Phan, Robert
    Luu, Thoai Man
    Davey, Rachel
    Chetty, Girija
    2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA ENGINEERING (ICMLDE 2018), 2018, : 128 - 135
  • [46] 基于CNN-RNN集成的隧道事故异常声音识别
    郎巨林
    郑晟
    电子测量技术, 2023, 46 (20) : 164 - 169
  • [47] Voice Pathology Detection Using a Two-Level Classifier Based on Combined CNN-RNN Architecture
    Ksibi, Amel
    Hakami, Nada Ali
    Alturki, Nazik
    Asiri, Mashael M. M.
    Zakariah, Mohammed
    Ayadi, Manel
    SUSTAINABILITY, 2023, 15 (04)
  • [48] Detection of Deepfake Media Using a Hybrid CNN-RNN Model and Particle Swarm Optimization (PSO) Algorithm
    Al-Adwan, Aryaf
    Alazzam, Hadeel
    Al-Anbaki, Noor
    Alduweib, Eman
    COMPUTERS, 2024, 13 (04)
  • [49] The assessment of 3D model representation for retrieval with CNN-RNN networks
    Weizhi Nie
    Kun Wang
    Hongtao Wang
    Yuting Su
    Multimedia Tools and Applications, 2019, 78 : 16979 - 16994
  • [50] A UNIFIED CNN-RNN APPROACH FOR IN-AIR HANDWRITTEN ENGLISH WORD RECOGNITION
    Gan, Ji
    Wang, Weiqiang
    Lu, Ke
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,