Detecting Cough Recordings in Crowdsourced Data Using CNN-RNN

被引:1
|
作者
Sharan, Roneel V. [1 ]
Xiong, Hao [1 ]
Berkovsky, Shlomo [1 ]
机构
[1] Macquarie Univ, Australian Inst Hlth Innovat, Sydney, NSW, Australia
关键词
cough sound; crowdsourced; deep learning; melspectrogram; respiratory diseases;
D O I
10.1109/BHI56158.2022.9926896
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The sound of cough is an important indicator of the condition of the respiratory system. Automatic cough sound evaluation can aid the diagnosis of respiratory diseases. Large crow dsourced cough sound datasets have recently been used by several groups around the world to develop cough classification models. However, not all recordings in these datasets contain cough sounds. As such, it is important to screen the recordings for the presence of cough sounds before developing cough classification models. This work proposes a method to screen crowdsourced audio recordings for cough sounds using deep learning methods. The proposed approach divides the audio recording into overlapping frames and converts each frame into a mel-spectrogram representation. A pretrained convolutional neural network for audio classification is trained to learn the spectral characteristics of cough and non-cough frames from its mel-spectrogram representation. It is combined with a recurrent neural network to learn the dependencies between the sequence of frames. The proposed method is evaluated on 400 crowdsourced audio recordings, manually annotated as cough or non-cough. An accuracy of 0.9800 (AUC of 0.9973) is achieved in classifying cough and non-cough recordings using the proposed method. The trained network is used to analyze the remaining audio recordings in the dataset, identifying only about 67% of recordings as containing usable cough sounds. This shows the need to exercise caution when using crowdsourced cough data.
引用
下载
收藏
页数:4
相关论文
共 50 条
  • [21] A NEW CNN-RNN FRAMEWORK FOR REMOTE SENSING IMAGE CAPTIONING
    Hoxha, Genc
    Melgani, Farid
    Slaghenauffi, Jacopo
    2020 MEDITERRANEAN AND MIDDLE-EAST GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (M2GARSS), 2020, : 1 - 4
  • [22] Image Captioning Encoder–Decoder Models Using CNN-RNN Architectures: A Comparative Study
    K. Revati Suresh
    Arun Jarapala
    P. V. Sudeep
    Circuits, Systems, and Signal Processing, 2022, 41 : 5719 - 5742
  • [23] Predicting Beijing Air Quality Using Bayesian Optimized CNN-RNN Hybrid Model
    Tu, Zihan
    Wu, Zhe
    2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 581 - 587
  • [24] Emergency Detection Method in Social Media Based on CNN-RNN
    Li X.
    Bai C.
    Tiedao Xuebao/Journal of the China Railway Society, 2021, 43 (08): : 97 - 105
  • [25] Time-Continuous Emotion Recognition Using Spectrogram Based CNN-RNN Modelling
    Fedotov, Dmitrii
    Kim, Bobae
    Karpov, Alexey
    Minker, Wolfgang
    SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 93 - 102
  • [26] License Plate Localization in Unconstrained Scenes Using a Two-Stage CNN-RNN
    Zhang, Jingjing
    Li, Yuanyuan
    Li, Teng
    Xun, Lina
    Shan, Caifeng
    IEEE SENSORS JOURNAL, 2019, 19 (13) : 5256 - 5265
  • [27] 基于CNN-RNN网络的中国冬小麦估产
    赫晓慧
    罗浩田
    乔梦佳
    田智慧
    周广胜
    农业工程学报, 2021, 37 (17) : 124 - 132
  • [28] Dependency Exploitation: A Unified CNN-RNN Approach for Visual Emotion Recognition
    Zhu, Xinge
    Li, Liang
    Zhang, Weigang
    Rao, Tianrong
    Xu, Min
    Huang, Qingming
    Xu, Dong
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3595 - 3601
  • [29] An Efficient Reconfigurable Framework for General Purpose CNN-RNN Models on FPGAs
    Zeng, Shulin
    Guo, Kaiyuan
    Fang, Shaoxia
    Kang, Junlong
    Xie, Dongliang
    Shan, Yi
    Wang, Yu
    Yang, Huazhong
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [30] A New Combined CNN-RNN Model for Sector Stock Price Analysis
    Zhang, Ruixun
    Yuan, Zhaozheng
    Shao, Xiuli
    2018 IEEE 42ND ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC 2018), VOL 2, 2018, : 546 - 551