Detecting Cough Recordings in Crowdsourced Data Using CNN-RNN

被引：1

作者：

Sharan, Roneel V. ^{[1
]}

Xiong, Hao ^{[1
]}

Berkovsky, Shlomo ^{[1
]}

机构：

[1] Macquarie Univ, Australian Inst Hlth Innovat, Sydney, NSW, Australia

来源：

2022 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI) JOINTLY ORGANISED WITH THE IEEE-EMBS INTERNATIONAL CONFERENCE ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS (BSN'22) | 2022年

关键词：

cough sound; crowdsourced; deep learning; melspectrogram; respiratory diseases;

D O I：

10.1109/BHI56158.2022.9926896

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The sound of cough is an important indicator of the condition of the respiratory system. Automatic cough sound evaluation can aid the diagnosis of respiratory diseases. Large crow dsourced cough sound datasets have recently been used by several groups around the world to develop cough classification models. However, not all recordings in these datasets contain cough sounds. As such, it is important to screen the recordings for the presence of cough sounds before developing cough classification models. This work proposes a method to screen crowdsourced audio recordings for cough sounds using deep learning methods. The proposed approach divides the audio recording into overlapping frames and converts each frame into a mel-spectrogram representation. A pretrained convolutional neural network for audio classification is trained to learn the spectral characteristics of cough and non-cough frames from its mel-spectrogram representation. It is combined with a recurrent neural network to learn the dependencies between the sequence of frames. The proposed method is evaluated on 400 crowdsourced audio recordings, manually annotated as cough or non-cough. An accuracy of 0.9800 (AUC of 0.9973) is achieved in classifying cough and non-cough recordings using the proposed method. The trained network is used to analyze the remaining audio recordings in the dataset, identifying only about 67% of recordings as containing usable cough sounds. This shows the need to exercise caution when using crowdsourced cough data.

引用

页数：4

共 50 条

[41] CNN-RNN: a large-scale hierarchical image classification framework
Guo, Yanming
Liu, Yu
Bakker, Erwin M.
Guo, Yuanhao
Lew, Michael S.
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (08) : 10251 - 10271
[42] Relative CNN-RNN: Learning Relative Atmospheric Visibility From Images
You, Yang
Lu, Cewu
Wang, Weiming
Tang, Chi-Keung
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) : 45 - 55
[43] Video Emotion Recognition Using Local Enhanced Motion History Image and CNN-RNN Networks
Wang, Haowen
Zhou, Guoxiang
Hu, Min
Wang, Xiaohua
BIOMETRIC RECOGNITION, CCBR 2018, 2018, 10996 : 109 - 119
[44] RUMBA-Mouse: Rapid User Mouse-Behavior Authentication Using a CNN-RNN Approach
Fu, Shen
Qin, Dong
Qiao, Daji
Amariucai, George T.
2020 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2020,
[45] Biomedical Named Entity Recognition Based on Hybrid Multistage CNN-RNN Learner
Phan, Robert
Luu, Thoai Man
Davey, Rachel
Chetty, Girija
2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA ENGINEERING (ICMLDE 2018), 2018, : 128 - 135
[46] 基于CNN-RNN集成的隧道事故异常声音识别
郎巨林
郑晟
电子测量技术, 2023, 46 (20) : 164 - 169
[47] Voice Pathology Detection Using a Two-Level Classifier Based on Combined CNN-RNN Architecture
Ksibi, Amel
Hakami, Nada Ali
Alturki, Nazik
Asiri, Mashael M. M.
Zakariah, Mohammed
Ayadi, Manel
SUSTAINABILITY, 2023, 15 (04)
[48] Detection of Deepfake Media Using a Hybrid CNN-RNN Model and Particle Swarm Optimization (PSO) Algorithm
Al-Adwan, Aryaf
Alazzam, Hadeel
Al-Anbaki, Noor
Alduweib, Eman
COMPUTERS, 2024, 13 (04)
[49] The assessment of 3D model representation for retrieval with CNN-RNN networks
Weizhi Nie
Kun Wang
Hongtao Wang
Yuting Su
Multimedia Tools and Applications, 2019, 78 : 16979 - 16994
[50] A UNIFIED CNN-RNN APPROACH FOR IN-AIR HANDWRITTEN ENGLISH WORD RECOGNITION
Gan, Ji
Wang, Weiqiang
Lu, Ke
2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,

← 1 2 3 4 5 →