Classification of Parkinson's disease from smartphone recording data using time-frequency analysis and convolutional neural network

被引：1

作者：

Worasawate, Denchai ^{[1
]}

Asawaponwiput, Warisara ^{[1
]}

Yoshimura, Natsue ^{[2
]}

Intarapanich, Apichart ^{[3
]}

Surangsrirat, Decho ^{[4
]}

机构：

[1] Kasetsart Univ, Fac Engn, Dept Elect Engn, Bangkok, Thailand

[2] Tokyo Inst Technol, Inst Innovat Res, Yokohama, Kanagawa, Japan

[3] Natl Elect & Comp Technol Ctr, Educ Technol Team, Pathum Thani, Thailand

[4] Natl Sci & Technol Dev Agcy, Assist Technol & Med Devices Res Ctr, Pathum Thani, Thailand

来源：

TECHNOLOGY AND HEALTH CARE | 2023年 / 31卷 / 02期

关键词：

PD voice; audio classification; convolutional neural network; mPower study; AUTOMATIC CLASSIFICATION;

D O I：

10.3233/THC-220386

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

BACKGROUND: Parkinson's disease (PD) is a long-term neurodegenerative disease of the central nervous system. The current diagnosis is dependent on clinical observation and the abilities and experience of a trained specialist. One of the symptoms that affects most patients is voice impairment. OBJECTIVE: Voice samples are non-invasive data that can be collected remotely for diagnosis and disease progression monitoring. In this study, we analyzed voice recording data from a smartphone as a possible medical self-diagnosis tool by using only one-second voice recording. The data from one of the largest mobile PD studies, the mPower study, was used. METHODS: A total of 29,798 ten-second voice recordings on smartphone from 4,051 participants were used for the analysis. The voice recordings were from sustained phonation by participants saying /aa/ for ten seconds into an iPhone microphone. A dataset comprising 385,143 short one-second audio samples was generated from the original ten-second voice recordings. The samples were converted to a spectrogram using a short-time Fourier transform. CNN models were then applied to classify the samples. RESULTS: Classification accuracies of the proposed method with LeNet-5, ResNet-50, and VGGNet-16 are 97.7 +/- 0.1%, 98.6 +/- 0.2%, and 99.3 +/- 0.1%, respectively. CONCLUSIONS: We achieve a respectable classification performance using a generalized approach on a dataset with a large number of samples. The result emphasizes that an analysis based on one-second clip recorded on a smartphone could be a promising non-invasive and remotely available PD biomarker.

引用

页码：705 / 718

页数：14

共 50 条

[31] AUTOMATIC RADAR WAVEFORM RECOGNITION BASED ON TIME-FREQUENCY ANALYSIS AND CONVOLUTIONAL NEURAL NETWORK
Wang, Chao
Wang, Jian
Zhang, Xudong
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2437 - 2441
[32] Classification of EEG signals based on time-frequency analysis and spiking neural network
Wang Qing-Hua
Wang Li-Na
Xu Song
2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020), 2020,
[33] Time-frequency Analysis and Convolutional Neural Network Based Fuze Jamming Signal Recognition
Yang, Jikai
Bai, Zhiquan
Hu, Jiacheng
Yang, Yingchao
Xian, Zhaoxia
Hao, Xinhong
Kwak, Kyungsup
International Conference on Advanced Communication Technology, ICACT, 2023, 2023-February : 277 - 282
[34] Arrhythmia Disease Diagnosis Based on ECG Time-Frequency Domain Fusion and Convolutional Neural Network
Wang, Bocheng
Chen, Guorong
Rong, Lu
Liu, Yuchuan
Yu, Anning
He, Xiaohui
Wen, Tingting
Zhang, Yixuan
Hu, Biaobiao
IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE, 2023, 11 : 116 - 125
[35] E-Nose: Time-Frequency Attention Convolutional Neural Network for Gas Classification and Concentration Prediction
Jiang, Minglv
Li, Na
Li, Mingyong
Wang, Zhou
Tian, Yuan
Peng, Kaiyan
Sheng, Haoran
Li, Haoyu
Li, Qiang
SENSORS, 2024, 24 (13)
[36] Improving time-frequency resolution in non-stationary signal analysis using a convolutional recurrent neural network
Krishna, B. Murali
Satyanarayana, S. V. V.
Satyanarayana, P. V. V.
Suman, M. Venkata
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (05) : 4797 - 4810
[37] Blind modulation classification in multiple input and output-orthogonal frequency division multiplexing using time-frequency analysis and customized convolutional neural network architecture
PramodKumar, Aylapogu
KiranKumar, Gurrala
TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2023, 34 (03)
[38] Evaluation of Recurrent Neural Network Models for Parkinson's Disease Classification Using Drawing Data
Shenoy, Arjun A., V
Lones, Michael A.
Smith, Stephen L.
Vallejo, Marta
2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 1702 - 1706
[39] In-Field Citrus Disease Classification via Convolutional Neural Network from Smartphone Images
Yang, Changcai
Teng, Zixuan
Dong, Caixia
Lin, Yaohai
Chen, Riqing
Wang, Jian
AGRICULTURE-BASEL, 2022, 12 (09):
[40] Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition
Arash Dehghani
Seyyed Ali Seyyedsalehi
Neural Processing Letters, 2023, 55 : 3205 - 3224

← 1 2 3 4 5 →