Detecting Cough Recordings in Crowdsourced Data Using CNN-RNN

被引:1
|
作者
Sharan, Roneel V. [1 ]
Xiong, Hao [1 ]
Berkovsky, Shlomo [1 ]
机构
[1] Macquarie Univ, Australian Inst Hlth Innovat, Sydney, NSW, Australia
关键词
cough sound; crowdsourced; deep learning; melspectrogram; respiratory diseases;
D O I
10.1109/BHI56158.2022.9926896
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The sound of cough is an important indicator of the condition of the respiratory system. Automatic cough sound evaluation can aid the diagnosis of respiratory diseases. Large crow dsourced cough sound datasets have recently been used by several groups around the world to develop cough classification models. However, not all recordings in these datasets contain cough sounds. As such, it is important to screen the recordings for the presence of cough sounds before developing cough classification models. This work proposes a method to screen crowdsourced audio recordings for cough sounds using deep learning methods. The proposed approach divides the audio recording into overlapping frames and converts each frame into a mel-spectrogram representation. A pretrained convolutional neural network for audio classification is trained to learn the spectral characteristics of cough and non-cough frames from its mel-spectrogram representation. It is combined with a recurrent neural network to learn the dependencies between the sequence of frames. The proposed method is evaluated on 400 crowdsourced audio recordings, manually annotated as cough or non-cough. An accuracy of 0.9800 (AUC of 0.9973) is achieved in classifying cough and non-cough recordings using the proposed method. The trained network is used to analyze the remaining audio recordings in the dataset, identifying only about 67% of recordings as containing usable cough sounds. This shows the need to exercise caution when using crowdsourced cough data.
引用
下载
收藏
页数:4
相关论文
共 50 条
  • [1] Hand pose estimation with CNN-RNN
    Hu, Zhongxu
    Hu, Youmin
    Wu, Bo
    Liu, Jie
    2017 EUROPEAN CONFERENCE ON ELECTRICAL ENGINEERING AND COMPUTER SCIENCE (EECS), 2017, : 458 - 463
  • [2] Human Abnormality Classification Using Combined CNN-RNN Approach
    Kabir, Mohsin
    Safir, Farisa Benta
    Shahen, Saifullah
    Maua, Jannatul
    Awlad, Iffat Ara Binte
    Mridha, M. F.
    2020 IEEE 17TH INTERNATIONAL CONFERENCE ON SMART COMMUNITIES: IMPROVING QUALITY OF LIFE USING ICT, IOT AND AI (IEEEHONET 2020), 2020, : 204 - 208
  • [3] Handwritten Odia numeral recognition using combined CNN-RNN
    Das, Abhishek
    Mohanty, Mihir Narayan
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2023, 14 (04) : 382 - 388
  • [4] Categorization of actions in soccer videos using a CNN-RNN architecture
    Macedo, Matheus de Sousa
    Adamatti, Diana Francisca
    REVISTA BRASILEIRA DE COMPUTACAO APLICADA, 2023, 15 (03): : 1 - 14
  • [5] A dual CNN-RNN for multiple people tracking
    Babaee, Maryam
    Li, Zimu
    Rigoll, Gerhard
    NEUROCOMPUTING, 2019, 368 : 69 - 83
  • [6] A CNN-RNN Framework for Crop Yield Prediction
    Khaki, Saeed
    Wang, Lizhi
    Archontoulis, Sotirios V.
    FRONTIERS IN PLANT SCIENCE, 2020, 10
  • [7] Unconstrained OCR for Urdu using Deep CNN-RNN Hybrid Networks
    Jain, Mohit
    Mathew, Minesh
    Jawahar, C. V.
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 747 - 752
  • [8] Spectrum Sensing in Cognitive Radio Using CNN-RNN and Transfer Learning
    Solanki, Surendra
    Dehalwar, Vasudev
    Choudhary, Jaytrilok
    Kolhe, Mohan Lal
    Ogura, Koki
    IEEE ACCESS, 2022, 10 : 113482 - 113492
  • [9] Characters Recognition based on CNN-RNN architecture and Metaheuristic
    Keddous, F.
    Nguyen, H-N
    Nakib, A.
    2021 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2021, : 500 - 507
  • [10] A CNN-RNN unified framework for intrapartum cardiotocograph classification
    Liang, Huanwen
    Lu, Yu
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 229