An Unsupervised Two-Step Convolution Sparse Transfer Learning Algorithm for Parkinson's Disease Speech Diagnosis

被引:0
|
作者
Zhang X.-H. [1 ,2 ]
Zhang X.-Y. [1 ]
Li Y.-M. [1 ]
Wang P. [1 ]
Liu Y.-C. [1 ]
机构
[1] College of Communication Engineering, Chongqing University, Chongqing
[2] Chongqing Radio & TV University, Chongqing
来源
关键词
Convolutional sparse coding transfer learning; Domain adaptation; Parkinson's disease(PD); Speech diagnosis; Two-step sparse transfer learning;
D O I
10.12263/DZXB.20201003
中图分类号
学科分类号
摘要
Parkinson's disease(PD) speech diagnosis has a small sample problem. Although it is possible to transfer learning with the help of relevant speech datasets. The introduction of other samples will lead to the distribution difference between samples of different subjects, so the classification accuracy is greatly affected. Therefore, in this paper, to solve the problems above, we propose a novel unsupervised two-step convolutional sparse transfer leaning algorithm. The algorithm is divided into two steps: fast convolutional sparse coding with coordinate selection of samples and features(FCSC&SF), joint local structure distribution alignment(JLSDA). In the FCSC&SF, speech structure among public speech dataset is quickly learned by fast convolution sparse coding(FCSC), and transferred into the target dataset, after that, the more valuable information is obtained by coordinate selection of samples and features. JLSDA is designed to maintain the local structure information in the two domains, and reduce the distribution difference between the two domains at the same time. The experimental results showed that each step of the proposed algorithm has a positive effect on the classification results; compared with the representative relevant algorithms, the accuracy of the proposed method is significantly higher at 97.5%. © 2022, Chinese Institute of Electronics. All right reserved.
引用
收藏
页码:177 / 184
页数:7
相关论文
共 28 条
  • [21] CAI X J, GU G Y, HE B S, Et al., A proximal point algorithm revisit on the alternating direction method of multipliers, Science China Mathematics, 56, 10, pp. 2179-2186, (2013)
  • [22] HE X, NIYOGI P., Locality preserving projections, Proceedings of Conference on Advances in Neural Information Processing Systems(NIPS), pp. 153-160, (2004)
  • [23] CANTURK I, KARABIBER F., A machine learning system for the diagnosis of Parkinson's disease from speech signals and its application to multiple speech signal types, Arabian Journal for Science and Engineering, 41, 12, pp. 5049-5059, (2016)
  • [24] ZHANG H H, YANG L, LIU Y, Et al., Classification of Parkinson's disease utilizing multi-edit nearest-neighbor and ensemble learning algorithms with speech samples, Biomedical Engineering Online, 15, 1, pp. 122-143, (2016)
  • [25] LI Y M, ZHANG C, JIA Y J, Et al., Simultaneous learning of speech feature and segment for classification of Parkinson disease, 2017 IEEE 19th International Conference on e-Health Networking, Applications and Services (Healthcom), pp. 1-6, (2017)
  • [26] BENBA A, JILBAB A, HAMMOUCH A., Using human factor cepstral coefficient on multiple types of voice recordings for detecting patients with Parkinson's disease, IRBM, 38, 6, pp. 346-351, (2017)
  • [27] BENBA A, JILBAB A, HAMMOUCH A., Analysis of multiple types of voice recordings in cepstral domain using MFCC for discriminating between patients with Parkinson's disease and healthy people, International Journal of Speech Technology, 19, 3, pp. 449-456, (2016)
  • [28] ALI L, ZHU C, ZHANG Z H, Et al., Automated detection of Parkinson's disease based on multiple types of sustained phonations using linear discriminant analysis and genetically optimized neural network, IEEE Journal of Translational Engineering in Health and Medicine, 7, pp. 1-10, (2019)