Classification of Parkinson's disease from smartphone recording data using time-frequency analysis and convolutional neural network

被引:1
|
作者
Worasawate, Denchai [1 ]
Asawaponwiput, Warisara [1 ]
Yoshimura, Natsue [2 ]
Intarapanich, Apichart [3 ]
Surangsrirat, Decho [4 ]
机构
[1] Kasetsart Univ, Fac Engn, Dept Elect Engn, Bangkok, Thailand
[2] Tokyo Inst Technol, Inst Innovat Res, Yokohama, Kanagawa, Japan
[3] Natl Elect & Comp Technol Ctr, Educ Technol Team, Pathum Thani, Thailand
[4] Natl Sci & Technol Dev Agcy, Assist Technol & Med Devices Res Ctr, Pathum Thani, Thailand
关键词
PD voice; audio classification; convolutional neural network; mPower study; AUTOMATIC CLASSIFICATION;
D O I
10.3233/THC-220386
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
BACKGROUND: Parkinson's disease (PD) is a long-term neurodegenerative disease of the central nervous system. The current diagnosis is dependent on clinical observation and the abilities and experience of a trained specialist. One of the symptoms that affects most patients is voice impairment. OBJECTIVE: Voice samples are non-invasive data that can be collected remotely for diagnosis and disease progression monitoring. In this study, we analyzed voice recording data from a smartphone as a possible medical self-diagnosis tool by using only one-second voice recording. The data from one of the largest mobile PD studies, the mPower study, was used. METHODS: A total of 29,798 ten-second voice recordings on smartphone from 4,051 participants were used for the analysis. The voice recordings were from sustained phonation by participants saying /aa/ for ten seconds into an iPhone microphone. A dataset comprising 385,143 short one-second audio samples was generated from the original ten-second voice recordings. The samples were converted to a spectrogram using a short-time Fourier transform. CNN models were then applied to classify the samples. RESULTS: Classification accuracies of the proposed method with LeNet-5, ResNet-50, and VGGNet-16 are 97.7 +/- 0.1%, 98.6 +/- 0.2%, and 99.3 +/- 0.1%, respectively. CONCLUSIONS: We achieve a respectable classification performance using a generalized approach on a dataset with a large number of samples. The result emphasizes that an analysis based on one-second clip recorded on a smartphone could be a promising non-invasive and remotely available PD biomarker.
引用
收藏
页码:705 / 718
页数:14
相关论文
共 50 条
  • [1] Sparse time-frequency analysis of seismic data via convolutional neural network
    Liu, Naihao
    Lei, Youbo
    Yang, Yang
    Wang, Zhiguo
    Liu, Rongchang
    Gao, Jinghuai
    Wei, Tao
    INTERPRETATION-A JOURNAL OF SUBSURFACE CHARACTERIZATION, 2024, 12 (01): : T47 - T62
  • [2] An efficient Parkinson's disease detection framework: Leveraging time-frequency representation and AlexNet convolutional neural network
    Siuly S.
    Khare S.K.
    Kabir E.
    Sadiq M.T.
    Wang H.
    Comput. Biol. Med., 2024,
  • [3] Explainable time-frequency convolutional neural network for microseismic waveform classification
    Bi, Xin
    Zhang, Chao
    He, Yao
    Zhao, Xiangguo
    Sun, Yongjiao
    Ma, Yuliang
    INFORMATION SCIENCES, 2021, 546 : 883 - 896
  • [4] Structural damage severity classification from time-frequency acceleration data using convolutional neural networks
    Jamshidi, Maziar
    El-Badry, Mamdouh
    STRUCTURES, 2023, 54 : 236 - 253
  • [5] Research on Arrhythmia Classification by Using Convolutional Neural Network with Mixed Time-Frequency Domain Features
    Lü H.
    Jiang M.-F.
    Li Y.
    Zhang J.-C.
    Wang Z.-K.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (03): : 701 - 711
  • [6] Time-frequency Performance Study on Urban Sound Classification with Convolutional Neural Network
    Shu, Haiyan
    Song, Ying
    Zhou, Huan
    PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 1713 - 1717
  • [7] Multimodal Retinal Imaging Classification for Parkinson's Disease Using a Convolutional Neural Network
    Richardson, Alexander
    Kundu, Anita
    Henao, Ricardo
    Lee, Terry
    Scott, Burton L.
    Grewal, Dilraj S.
    Fekrat, Sharon
    TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2024, 13 (08):
  • [8] Convolutional neural network based emotion classification using electrodermal activity signals and time-frequency features
    Ganapathy, Nagarajan
    Veeranki, Yedukondala Rao
    Swaminathan, Ramakrishnan
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 159
  • [9] Classification of Multistates of Internal Damage in Concrete Based on Convolutional Neural Network Analysis of Time-Frequency Images
    Shang, Gao
    Chen, Jun
    JOURNAL OF PERFORMANCE OF CONSTRUCTED FACILITIES, 2022, 36 (06)
  • [10] Convolutional Neural Network-Based Parkinson Disease Classification Using SPECT Imaging Data
    Hathaliya, Jigna
    Parekh, Raj
    Patel, Nisarg
    Gupta, Rajesh
    Tanwar, Sudeep
    Alqahtani, Fayez
    Elghatwary, Magdy
    Ivanov, Ovidiu
    Raboaca, Maria Simona
    Neagu, Bogdan-Constantin
    MATHEMATICS, 2022, 10 (15)