Classification of Parkinson's disease from smartphone recording data using time-frequency analysis and convolutional neural network

被引:1
|
作者
Worasawate, Denchai [1 ]
Asawaponwiput, Warisara [1 ]
Yoshimura, Natsue [2 ]
Intarapanich, Apichart [3 ]
Surangsrirat, Decho [4 ]
机构
[1] Kasetsart Univ, Fac Engn, Dept Elect Engn, Bangkok, Thailand
[2] Tokyo Inst Technol, Inst Innovat Res, Yokohama, Kanagawa, Japan
[3] Natl Elect & Comp Technol Ctr, Educ Technol Team, Pathum Thani, Thailand
[4] Natl Sci & Technol Dev Agcy, Assist Technol & Med Devices Res Ctr, Pathum Thani, Thailand
关键词
PD voice; audio classification; convolutional neural network; mPower study; AUTOMATIC CLASSIFICATION;
D O I
10.3233/THC-220386
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
BACKGROUND: Parkinson's disease (PD) is a long-term neurodegenerative disease of the central nervous system. The current diagnosis is dependent on clinical observation and the abilities and experience of a trained specialist. One of the symptoms that affects most patients is voice impairment. OBJECTIVE: Voice samples are non-invasive data that can be collected remotely for diagnosis and disease progression monitoring. In this study, we analyzed voice recording data from a smartphone as a possible medical self-diagnosis tool by using only one-second voice recording. The data from one of the largest mobile PD studies, the mPower study, was used. METHODS: A total of 29,798 ten-second voice recordings on smartphone from 4,051 participants were used for the analysis. The voice recordings were from sustained phonation by participants saying /aa/ for ten seconds into an iPhone microphone. A dataset comprising 385,143 short one-second audio samples was generated from the original ten-second voice recordings. The samples were converted to a spectrogram using a short-time Fourier transform. CNN models were then applied to classify the samples. RESULTS: Classification accuracies of the proposed method with LeNet-5, ResNet-50, and VGGNet-16 are 97.7 +/- 0.1%, 98.6 +/- 0.2%, and 99.3 +/- 0.1%, respectively. CONCLUSIONS: We achieve a respectable classification performance using a generalized approach on a dataset with a large number of samples. The result emphasizes that an analysis based on one-second clip recorded on a smartphone could be a promising non-invasive and remotely available PD biomarker.
引用
收藏
页码:705 / 718
页数:14
相关论文
共 50 条
  • [31] AUTOMATIC RADAR WAVEFORM RECOGNITION BASED ON TIME-FREQUENCY ANALYSIS AND CONVOLUTIONAL NEURAL NETWORK
    Wang, Chao
    Wang, Jian
    Zhang, Xudong
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2437 - 2441
  • [32] Classification of EEG signals based on time-frequency analysis and spiking neural network
    Wang Qing-Hua
    Wang Li-Na
    Xu Song
    2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020), 2020,
  • [33] Time-frequency Analysis and Convolutional Neural Network Based Fuze Jamming Signal Recognition
    Yang, Jikai
    Bai, Zhiquan
    Hu, Jiacheng
    Yang, Yingchao
    Xian, Zhaoxia
    Hao, Xinhong
    Kwak, Kyungsup
    International Conference on Advanced Communication Technology, ICACT, 2023, 2023-February : 277 - 282
  • [34] Arrhythmia Disease Diagnosis Based on ECG Time-Frequency Domain Fusion and Convolutional Neural Network
    Wang, Bocheng
    Chen, Guorong
    Rong, Lu
    Liu, Yuchuan
    Yu, Anning
    He, Xiaohui
    Wen, Tingting
    Zhang, Yixuan
    Hu, Biaobiao
    IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE, 2023, 11 : 116 - 125
  • [35] E-Nose: Time-Frequency Attention Convolutional Neural Network for Gas Classification and Concentration Prediction
    Jiang, Minglv
    Li, Na
    Li, Mingyong
    Wang, Zhou
    Tian, Yuan
    Peng, Kaiyan
    Sheng, Haoran
    Li, Haoyu
    Li, Qiang
    SENSORS, 2024, 24 (13)
  • [36] Improving time-frequency resolution in non-stationary signal analysis using a convolutional recurrent neural network
    Krishna, B. Murali
    Satyanarayana, S. V. V.
    Satyanarayana, P. V. V.
    Suman, M. Venkata
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (05) : 4797 - 4810
  • [37] Blind modulation classification in multiple input and output-orthogonal frequency division multiplexing using time-frequency analysis and customized convolutional neural network architecture
    PramodKumar, Aylapogu
    KiranKumar, Gurrala
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2023, 34 (03)
  • [38] Evaluation of Recurrent Neural Network Models for Parkinson's Disease Classification Using Drawing Data
    Shenoy, Arjun A., V
    Lones, Michael A.
    Smith, Stephen L.
    Vallejo, Marta
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 1702 - 1706
  • [39] In-Field Citrus Disease Classification via Convolutional Neural Network from Smartphone Images
    Yang, Changcai
    Teng, Zixuan
    Dong, Caixia
    Lin, Yaohai
    Chen, Riqing
    Wang, Jian
    AGRICULTURE-BASEL, 2022, 12 (09):
  • [40] Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition
    Arash Dehghani
    Seyyed Ali Seyyedsalehi
    Neural Processing Letters, 2023, 55 : 3205 - 3224