Parkinson忆s Disease Detection Model Based on Hierarchical Fusion of Multi-type Speech Information

被引:0
|
作者
Wu, Di [1 ]
Ji, Wei [1 ]
Zheng, Huifen [2 ]
Li, Yun [3 ]
机构
[1] School of Communications and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing,210003, China
[2] Geriatric Hospital of Nanjing Medical University, Nanjing,210009, China
[3] School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing,210023, China
关键词
Contrastive Learning - Data integration - Deep learning - Speech enhancement - Speech recognition;
D O I
10.16451/j.cnki.issn1003-6059.202409005
中图分类号
学科分类号
摘要
Speech data for Parkinson' s disease detection typically includes sustained vowels, repeated syllables and contextual dialogues. Most of the existing models adopt a single type of speech data as input, making them susceptible to noise interference and a lack of robustness. The current challenge of Parkinson's disease detection is effectively integrating different types of speech data and extracting critical pathological information. In this paper, a Parkinson's disease detection method based on hierarchical fusion of multi-type speech information is proposed, aiming to extract rich and comprehensive pathological information and achieve better detection performance. Firstly, various acoustic features are extracted for different types of Parkinson's disease speech data. Then, a representation learning scheme is designed to mine deep information from multiple types of acoustic features. The underlying pathological information in acoustic features is reflected more accurately by extracting articulation and rhythm information. Furthermore, a decoupled representation learning space is designed for two mentioned types of information above to extract their respective private features, while learning their shared representation simultaneously. Finally, a cross-type attention hierarchical fusion module is designed to progressively fuse shared and private representations using cross-attention mechanisms at different granularities, aiming to enhance Parkinson's disease detection performance. Experiments on publicly available Italian Parkinson's disease speech dataset and a self-collected Chinese Parkinson's disease speech dataset demonstrate the accuracy improvement of the proposed approach. © 2024 Science Press. All rights reserved.
引用
收藏
页码:811 / 823
相关论文
共 50 条
  • [1] A multi-type vulnerability detection framework with parallel perspective fusion and hierarchical feature enhancement
    Kong, Lingdi
    Luo, Senlin
    Pan, Limin
    Wu, Zhouting
    Li, Xinshuai
    COMPUTERS & SECURITY, 2024, 140
  • [2] Envelope multi-type transformation ensemble algorithm of Parkinson speech samples
    Li, Yongming
    Liu, Chengyu
    Wang, Pin
    Zhang, Hehua
    Wei, Anhai
    Zhang, Yanling
    APPLIED INTELLIGENCE, 2023, 53 (12) : 15957 - 15978
  • [3] Envelope multi-type transformation ensemble algorithm of Parkinson speech samples
    Yongming Li
    Chengyu Liu
    Pin Wang
    Hehua Zhang
    Anhai Wei
    Yanling Zhang
    Applied Intelligence, 2023, 53 : 15957 - 15978
  • [4] Multi-type features separating fusion learning for Speech Emotion Recognition
    Xu, Xinlei
    Li, Dongdong
    Zhou, Yijun
    Wang, Zhe
    APPLIED SOFT COMPUTING, 2022, 130
  • [5] FuseLGNet: Fusion of Local and Global Information for Detection of Parkinson's Disease
    Chen, Ming
    Ren, Tao
    Sun, Pihai
    Wu, Jianfei
    Zhang, Jinfeng
    Zhao, Aite
    INFORMATION, 2023, 14 (02)
  • [6] Fusion mode of multi-type scientific and technological information and its application
    曾文
    LIU Xiaolin
    MA Hongyan
    High Technology Letters, 2024, 30 (04) : 433 - 440
  • [7] Fusion mode of multi-type scientific and technological information and its application
    Zeng, Wen
    Liu, Xiaolin
    Ma, Hongyan
    High Technology Letters, 2024, 30 (04) : 433 - 440
  • [8] Multi-type Feature Mining and Fusion Model for Temporal Prediction
    Lu, Wei
    Hu, Yan
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [9] The Detection of Parkinson's Disease From Speech Using Voice Source Information
    Narendra, N. P.
    Schuller, Bjorn
    Alku, Paavo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (29) : 1925 - 1936
  • [10] A Bilingual Multi-type Spam Detection Model Based on M-BERT
    Cao, Jie
    Lai, Chengzhe
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,