Classification of Parkinson's disease from smartphone recording data using time-frequency analysis and convolutional neural network

被引:1
|
作者
Worasawate, Denchai [1 ]
Asawaponwiput, Warisara [1 ]
Yoshimura, Natsue [2 ]
Intarapanich, Apichart [3 ]
Surangsrirat, Decho [4 ]
机构
[1] Kasetsart Univ, Fac Engn, Dept Elect Engn, Bangkok, Thailand
[2] Tokyo Inst Technol, Inst Innovat Res, Yokohama, Kanagawa, Japan
[3] Natl Elect & Comp Technol Ctr, Educ Technol Team, Pathum Thani, Thailand
[4] Natl Sci & Technol Dev Agcy, Assist Technol & Med Devices Res Ctr, Pathum Thani, Thailand
关键词
PD voice; audio classification; convolutional neural network; mPower study; AUTOMATIC CLASSIFICATION;
D O I
10.3233/THC-220386
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
BACKGROUND: Parkinson's disease (PD) is a long-term neurodegenerative disease of the central nervous system. The current diagnosis is dependent on clinical observation and the abilities and experience of a trained specialist. One of the symptoms that affects most patients is voice impairment. OBJECTIVE: Voice samples are non-invasive data that can be collected remotely for diagnosis and disease progression monitoring. In this study, we analyzed voice recording data from a smartphone as a possible medical self-diagnosis tool by using only one-second voice recording. The data from one of the largest mobile PD studies, the mPower study, was used. METHODS: A total of 29,798 ten-second voice recordings on smartphone from 4,051 participants were used for the analysis. The voice recordings were from sustained phonation by participants saying /aa/ for ten seconds into an iPhone microphone. A dataset comprising 385,143 short one-second audio samples was generated from the original ten-second voice recordings. The samples were converted to a spectrogram using a short-time Fourier transform. CNN models were then applied to classify the samples. RESULTS: Classification accuracies of the proposed method with LeNet-5, ResNet-50, and VGGNet-16 are 97.7 +/- 0.1%, 98.6 +/- 0.2%, and 99.3 +/- 0.1%, respectively. CONCLUSIONS: We achieve a respectable classification performance using a generalized approach on a dataset with a large number of samples. The result emphasizes that an analysis based on one-second clip recorded on a smartphone could be a promising non-invasive and remotely available PD biomarker.
引用
收藏
页码:705 / 718
页数:14
相关论文
共 50 条
  • [41] Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition
    Dehghani, Arash
    Seyyedsalehi, Seyyed Ali
    NEURAL PROCESSING LETTERS, 2023, 55 (03) : 3205 - 3224
  • [42] Helicopter classification using time-frequency analysis
    Yoon, SH
    Kim, B
    Kim, YS
    ELECTRONICS LETTERS, 2000, 36 (22) : 1871 - 1872
  • [43] Frequency hopping modulation recognition of convolutional neural network based on time-frequency characteristics
    Li H.-G.
    Guo Y.
    Sui P.
    Qi Z.-S.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2020, 54 (10): : 1945 - 1954
  • [44] Blind Identification of Radio Access Techniques Based on Time-Frequency Analysis and Convolutional Neural Network
    Hiremath, Shrishail M.
    Deshmukh, Siddharth
    Rakesh, R.
    Patra, Sarat Kumar
    PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 1163 - 1167
  • [45] Classification of stroke disease using convolutional neural network
    Marbun, J. T.
    Seniman
    Andayani, U.
    2ND INTERNATIONAL CONFERENCE ON COMPUTING AND APPLIED INFORMATICS 2017, 2018, 978
  • [46] A time-frequency convolutional neural network for the offline classification of steady-state visual evoked potential responses
    Cecotti, Hubert
    PATTERN RECOGNITION LETTERS, 2011, 32 (08) : 1145 - 1153
  • [47] A Convolutional Neural Network Based Classification Method for Mild to Moderate Parkinson's Disease at Turns
    Li, Xinge
    Huang, Xiayu
    Pang, Jun
    Meng, Lin
    Ming, Dong
    12TH ASIAN-PACIFIC CONFERENCE ON MEDICAL AND BIOLOGICAL ENGINEERING, VOL 1, APCMBE 2023, 2024, 103 : 370 - 377
  • [48] A Deep Convolutional-Recurrent Neural Network Architecture for Parkinson's Disease EEG Classification
    Lee, Soojin
    Hussein, Ramy
    McKeown, Martin J.
    2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
  • [49] A Time-Frequency Depth Convolutional Recurrent Network for Seismic Waveform Automatic Classification
    Li, Fu
    Li, Diquan
    Hu, Yanfang
    Zhu, Yunqi
    Liu, Yecheng
    Wang, Zhe
    Zhu, Hanyu
    IEEE ACCESS, 2024, 12 : 155205 - 155217
  • [50] Time-Frequency Domain Deep Convolutional Neural Network for the Classification of Focal and Non-Focal EEG Signals
    Madhavan, Srirangan
    Tripathy, Rajesh Kumar
    Pachori, Ram Bilas
    IEEE SENSORS JOURNAL, 2020, 20 (06) : 3078 - 3086