Detecting Cough Recordings in Crowdsourced Data Using CNN-RNN

被引:1
|
作者
Sharan, Roneel V. [1 ]
Xiong, Hao [1 ]
Berkovsky, Shlomo [1 ]
机构
[1] Macquarie Univ, Australian Inst Hlth Innovat, Sydney, NSW, Australia
关键词
cough sound; crowdsourced; deep learning; melspectrogram; respiratory diseases;
D O I
10.1109/BHI56158.2022.9926896
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The sound of cough is an important indicator of the condition of the respiratory system. Automatic cough sound evaluation can aid the diagnosis of respiratory diseases. Large crow dsourced cough sound datasets have recently been used by several groups around the world to develop cough classification models. However, not all recordings in these datasets contain cough sounds. As such, it is important to screen the recordings for the presence of cough sounds before developing cough classification models. This work proposes a method to screen crowdsourced audio recordings for cough sounds using deep learning methods. The proposed approach divides the audio recording into overlapping frames and converts each frame into a mel-spectrogram representation. A pretrained convolutional neural network for audio classification is trained to learn the spectral characteristics of cough and non-cough frames from its mel-spectrogram representation. It is combined with a recurrent neural network to learn the dependencies between the sequence of frames. The proposed method is evaluated on 400 crowdsourced audio recordings, manually annotated as cough or non-cough. An accuracy of 0.9800 (AUC of 0.9973) is achieved in classifying cough and non-cough recordings using the proposed method. The trained network is used to analyze the remaining audio recordings in the dataset, identifying only about 67% of recordings as containing usable cough sounds. This shows the need to exercise caution when using crowdsourced cough data.
引用
下载
收藏
页数:4
相关论文
共 50 条
  • [31] An Integrated Hybrid CNN-RNN Model for Visual Description and Generation of Captions
    Khamparia, Aditya
    Pandey, Babita
    Tiwari, Shrasti
    Gupta, Deepak
    Khanna, Ashish
    Rodrigues, Joel J. P. C.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (02) : 776 - 788
  • [32] Surgical Tool Segmentation Using A Hybrid Deep CNN-RNN Auto Encoder-Decoder
    Attia, Mohamed
    Hossny, Mohammed
    Nahavandi, Saeid
    Asadi, Hamed
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 3373 - 3378
  • [33] Improving Hate Speech Detection Accuracy using Hybrid CNN-RNN and Random Oversampling Techniques
    Riyadi, Slamet
    Andriyani, Annisa Divayu
    Masyhur, Ahmad Musthafa
    2024 IEEE SYMPOSIUM ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, ISIEA 2024, 2024,
  • [34] CNN-RNN: A Unified Framework for Multi-label Image Classification
    Wang, Jiang
    Yang, Yi
    Mao, Junhua
    Huang, Zhiheng
    Huang, Chang
    Xu, Wei
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2285 - 2294
  • [35] Yield estimation of winter wheat in China based on CNN-RNN network
    He X.
    Luo H.
    Qiao M.
    Tian Z.
    Zhou G.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2021, 37 (17): : 124 - 132
  • [36] CNN-RNN architecture to calculate BPM from underwater ECG samples
    Beckingham, Thomas
    Spencer, Joseph
    McKay, Kirsty
    APPLIED INTELLIGENCE, 2023, 53 (18) : 21156 - 21166
  • [37] An OpenCL-Based Hybrid CNN-RNN Inference Accelerator On FPGA
    Sun, Yunfei
    Liu, Brian
    Xu, Xianchao
    2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019), 2019, : 283 - 286
  • [38] CNN-RNN: a large-scale hierarchical image classification framework
    Yanming Guo
    Yu Liu
    Erwin M. Bakker
    Yuanhao Guo
    Michael S. Lew
    Multimedia Tools and Applications, 2018, 77 : 10251 - 10271
  • [39] CNN-RNN architecture to calculate BPM from underwater ECG samples
    Thomas Beckingham
    Joseph Spencer
    Kirsty McKay
    Applied Intelligence, 2023, 53 : 21156 - 21166
  • [40] Image Captioning Encoder-Decoder Models Using CNN-RNN Architectures: A Comparative Study
    Suresh, K. Revati
    Jarapala, Arun
    Sudeep, P., V
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (10) : 5719 - 5742