Classification of Parkinson's disease from smartphone recording data using time-frequency analysis and convolutional neural network

被引：1

作者：

Worasawate, Denchai ^{[1
]}

Asawaponwiput, Warisara ^{[1
]}

Yoshimura, Natsue ^{[2
]}

Intarapanich, Apichart ^{[3
]}

Surangsrirat, Decho ^{[4
]}

机构：

[1] Kasetsart Univ, Fac Engn, Dept Elect Engn, Bangkok, Thailand

[2] Tokyo Inst Technol, Inst Innovat Res, Yokohama, Kanagawa, Japan

[3] Natl Elect & Comp Technol Ctr, Educ Technol Team, Pathum Thani, Thailand

[4] Natl Sci & Technol Dev Agcy, Assist Technol & Med Devices Res Ctr, Pathum Thani, Thailand

来源：

TECHNOLOGY AND HEALTH CARE | 2023年 / 31卷 / 02期

关键词：

PD voice; audio classification; convolutional neural network; mPower study; AUTOMATIC CLASSIFICATION;

D O I：

10.3233/THC-220386

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

BACKGROUND: Parkinson's disease (PD) is a long-term neurodegenerative disease of the central nervous system. The current diagnosis is dependent on clinical observation and the abilities and experience of a trained specialist. One of the symptoms that affects most patients is voice impairment. OBJECTIVE: Voice samples are non-invasive data that can be collected remotely for diagnosis and disease progression monitoring. In this study, we analyzed voice recording data from a smartphone as a possible medical self-diagnosis tool by using only one-second voice recording. The data from one of the largest mobile PD studies, the mPower study, was used. METHODS: A total of 29,798 ten-second voice recordings on smartphone from 4,051 participants were used for the analysis. The voice recordings were from sustained phonation by participants saying /aa/ for ten seconds into an iPhone microphone. A dataset comprising 385,143 short one-second audio samples was generated from the original ten-second voice recordings. The samples were converted to a spectrogram using a short-time Fourier transform. CNN models were then applied to classify the samples. RESULTS: Classification accuracies of the proposed method with LeNet-5, ResNet-50, and VGGNet-16 are 97.7 +/- 0.1%, 98.6 +/- 0.2%, and 99.3 +/- 0.1%, respectively. CONCLUSIONS: We achieve a respectable classification performance using a generalized approach on a dataset with a large number of samples. The result emphasizes that an analysis based on one-second clip recorded on a smartphone could be a promising non-invasive and remotely available PD biomarker.

引用

页码：705 / 718

页数：14

共 50 条

[41] Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition
Dehghani, Arash
Seyyedsalehi, Seyyed Ali
NEURAL PROCESSING LETTERS, 2023, 55 (03) : 3205 - 3224
[42] Helicopter classification using time-frequency analysis
Yoon, SH
Kim, B
Kim, YS
ELECTRONICS LETTERS, 2000, 36 (22) : 1871 - 1872
[43] Frequency hopping modulation recognition of convolutional neural network based on time-frequency characteristics
Li H.-G.
Guo Y.
Sui P.
Qi Z.-S.
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2020, 54 (10): : 1945 - 1954
[44] Blind Identification of Radio Access Techniques Based on Time-Frequency Analysis and Convolutional Neural Network
Hiremath, Shrishail M.
Deshmukh, Siddharth
Rakesh, R.
Patra, Sarat Kumar
PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 1163 - 1167
[45] Classification of stroke disease using convolutional neural network
Marbun, J. T.
Seniman
Andayani, U.
2ND INTERNATIONAL CONFERENCE ON COMPUTING AND APPLIED INFORMATICS 2017, 2018, 978
[46] A time-frequency convolutional neural network for the offline classification of steady-state visual evoked potential responses
Cecotti, Hubert
PATTERN RECOGNITION LETTERS, 2011, 32 (08) : 1145 - 1153
[47] A Convolutional Neural Network Based Classification Method for Mild to Moderate Parkinson's Disease at Turns
Li, Xinge
Huang, Xiayu
Pang, Jun
Meng, Lin
Ming, Dong
12TH ASIAN-PACIFIC CONFERENCE ON MEDICAL AND BIOLOGICAL ENGINEERING, VOL 1, APCMBE 2023, 2024, 103 : 370 - 377
[48] A Deep Convolutional-Recurrent Neural Network Architecture for Parkinson's Disease EEG Classification
Lee, Soojin
Hussein, Ramy
McKeown, Martin J.
2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
[49] A Time-Frequency Depth Convolutional Recurrent Network for Seismic Waveform Automatic Classification
Li, Fu
Li, Diquan
Hu, Yanfang
Zhu, Yunqi
Liu, Yecheng
Wang, Zhe
Zhu, Hanyu
IEEE ACCESS, 2024, 12 : 155205 - 155217
[50] Time-Frequency Domain Deep Convolutional Neural Network for the Classification of Focal and Non-Focal EEG Signals
Madhavan, Srirangan
Tripathy, Rajesh Kumar
Pachori, Ram Bilas
IEEE SENSORS JOURNAL, 2020, 20 (06) : 3078 - 3086

← 1 2 3 4 5 →