Detecting Cough Recordings in Crowdsourced Data Using CNN-RNN

被引：1

作者：

Sharan, Roneel V. ^{[1
]}

Xiong, Hao ^{[1
]}

Berkovsky, Shlomo ^{[1
]}

机构：

[1] Macquarie Univ, Australian Inst Hlth Innovat, Sydney, NSW, Australia

来源：

2022 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI) JOINTLY ORGANISED WITH THE IEEE-EMBS INTERNATIONAL CONFERENCE ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS (BSN'22) | 2022年

关键词：

cough sound; crowdsourced; deep learning; melspectrogram; respiratory diseases;

D O I：

10.1109/BHI56158.2022.9926896

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The sound of cough is an important indicator of the condition of the respiratory system. Automatic cough sound evaluation can aid the diagnosis of respiratory diseases. Large crow dsourced cough sound datasets have recently been used by several groups around the world to develop cough classification models. However, not all recordings in these datasets contain cough sounds. As such, it is important to screen the recordings for the presence of cough sounds before developing cough classification models. This work proposes a method to screen crowdsourced audio recordings for cough sounds using deep learning methods. The proposed approach divides the audio recording into overlapping frames and converts each frame into a mel-spectrogram representation. A pretrained convolutional neural network for audio classification is trained to learn the spectral characteristics of cough and non-cough frames from its mel-spectrogram representation. It is combined with a recurrent neural network to learn the dependencies between the sequence of frames. The proposed method is evaluated on 400 crowdsourced audio recordings, manually annotated as cough or non-cough. An accuracy of 0.9800 (AUC of 0.9973) is achieved in classifying cough and non-cough recordings using the proposed method. The trained network is used to analyze the remaining audio recordings in the dataset, identifying only about 67% of recordings as containing usable cough sounds. This shows the need to exercise caution when using crowdsourced cough data.

引用

下载

页数：4

共 50 条

[31] An Integrated Hybrid CNN-RNN Model for Visual Description and Generation of Captions
Khamparia, Aditya
Pandey, Babita
Tiwari, Shrasti
Gupta, Deepak
Khanna, Ashish
Rodrigues, Joel J. P. C.
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (02) : 776 - 788
[32] Surgical Tool Segmentation Using A Hybrid Deep CNN-RNN Auto Encoder-Decoder
Attia, Mohamed
Hossny, Mohammed
Nahavandi, Saeid
Asadi, Hamed
2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 3373 - 3378
[33] Improving Hate Speech Detection Accuracy using Hybrid CNN-RNN and Random Oversampling Techniques
Riyadi, Slamet
Andriyani, Annisa Divayu
Masyhur, Ahmad Musthafa
2024 IEEE SYMPOSIUM ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, ISIEA 2024, 2024,
[34] CNN-RNN: A Unified Framework for Multi-label Image Classification
Wang, Jiang
Yang, Yi
Mao, Junhua
Huang, Zhiheng
Huang, Chang
Xu, Wei
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2285 - 2294
[35] Yield estimation of winter wheat in China based on CNN-RNN network
He X.
Luo H.
Qiao M.
Tian Z.
Zhou G.
Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2021, 37 (17): : 124 - 132
[36] CNN-RNN architecture to calculate BPM from underwater ECG samples
Beckingham, Thomas
Spencer, Joseph
McKay, Kirsty
APPLIED INTELLIGENCE, 2023, 53 (18) : 21156 - 21166
[37] An OpenCL-Based Hybrid CNN-RNN Inference Accelerator On FPGA
Sun, Yunfei
Liu, Brian
Xu, Xianchao
2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019), 2019, : 283 - 286
[38] CNN-RNN: a large-scale hierarchical image classification framework
Yanming Guo
Yu Liu
Erwin M. Bakker
Yuanhao Guo
Michael S. Lew
Multimedia Tools and Applications, 2018, 77 : 10251 - 10271
[39] CNN-RNN architecture to calculate BPM from underwater ECG samples
Thomas Beckingham
Joseph Spencer
Kirsty McKay
Applied Intelligence, 2023, 53 : 21156 - 21166
[40] Image Captioning Encoder-Decoder Models Using CNN-RNN Architectures: A Comparative Study
Suresh, K. Revati
Jarapala, Arun
Sudeep, P., V
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (10) : 5719 - 5742

← 1 2 3 4 5 →