Parkinson忆s Disease Detection Model Based on Hierarchical Fusion of Multi-type Speech Information

被引:0
|
作者
Wu, Di [1 ]
Ji, Wei [1 ]
Zheng, Huifen [2 ]
Li, Yun [3 ]
机构
[1] School of Communications and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing,210003, China
[2] Geriatric Hospital of Nanjing Medical University, Nanjing,210009, China
[3] School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing,210023, China
关键词
Contrastive Learning - Data integration - Deep learning - Speech enhancement - Speech recognition;
D O I
10.16451/j.cnki.issn1003-6059.202409005
中图分类号
学科分类号
摘要
Speech data for Parkinson' s disease detection typically includes sustained vowels, repeated syllables and contextual dialogues. Most of the existing models adopt a single type of speech data as input, making them susceptible to noise interference and a lack of robustness. The current challenge of Parkinson's disease detection is effectively integrating different types of speech data and extracting critical pathological information. In this paper, a Parkinson's disease detection method based on hierarchical fusion of multi-type speech information is proposed, aiming to extract rich and comprehensive pathological information and achieve better detection performance. Firstly, various acoustic features are extracted for different types of Parkinson's disease speech data. Then, a representation learning scheme is designed to mine deep information from multiple types of acoustic features. The underlying pathological information in acoustic features is reflected more accurately by extracting articulation and rhythm information. Furthermore, a decoupled representation learning space is designed for two mentioned types of information above to extract their respective private features, while learning their shared representation simultaneously. Finally, a cross-type attention hierarchical fusion module is designed to progressively fuse shared and private representations using cross-attention mechanisms at different granularities, aiming to enhance Parkinson's disease detection performance. Experiments on publicly available Italian Parkinson's disease speech dataset and a self-collected Chinese Parkinson's disease speech dataset demonstrate the accuracy improvement of the proposed approach. © 2024 Science Press. All rights reserved.
引用
收藏
页码:811 / 823
相关论文
共 50 条
  • [31] Multi-hierarchical blackboard model for communication intercept information fusion
    Xu, C.F., 2001, Chinese Institute of Electronics (29):
  • [32] Exemplar-Based Sparse Representations for Detection of Parkinson's Disease From Speech
    Reddy, Mittapalle Kiran
    Alku, Paavo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1386 - 1396
  • [33] Detection Possibilities of Depression and Parkinson's disease Based on the Ratio of Transient Parts of the Speech
    Kiss, Gabor
    Takacs, Artur Bendeguz
    Sztaho, David
    Vicsi, Klara
    2018 9TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2018, : 165 - 168
  • [34] Addressing smartphone mismatch in Parkinson's disease detection aid systems based on speech
    Madruga, Mario
    Campos-Roca, Yolanda
    Perez, Carlos J.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 80
  • [35] Parkinson's Disease Detection Based on Running Speech Data From Phone Calls
    Laganas, Christos
    Iakovakis, Dimitrios
    Hadjidimitriou, Stelios
    Charisis, Vasileios
    Dias, Sofia B.
    Bostantzopoulou, Sevasti
    Katsarou, Zoe
    Klingelhoefer, Lisa
    Reichmann, Heinz
    Trivedi, Dhaval
    Chaudhuri, K. Ray
    Hadjileontiadis, Leontios J.
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2022, 69 (05) : 1573 - 1584
  • [36] Empower rumor events detection from Chinese microblogs with multi-type individual information
    Wang, Zhihong
    Guo, Yi
    KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (09) : 3585 - 3614
  • [37] Multi-task Learning Based on Multi-type Dataset for Retinal Abnormality Detection
    Zhao, Linna
    Li, Jianqiang
    Ma, Zerui
    Guan, Yu
    Xu, Xi
    Wang, Xiaoxi
    Li, Li
    2021 IEEE INTERNATIONAL CONFERENCE ON DIGITAL HEALTH (ICDH 2021), 2021, : 160 - 165
  • [38] Empower rumor events detection from Chinese microblogs with multi-type individual information
    Zhihong Wang
    Yi Guo
    Knowledge and Information Systems, 2020, 62 : 3585 - 3614
  • [39] Detection of Speech Impairments in Parkinson Disease Using Handcrafted Feature-Based Model on Spanish Speech Corpus
    Zahid, Laiba
    Maqsood, Muazzam
    Farooq, Sehar Shahzad
    Aadil, Farhan
    Mehmood, Irfan
    Fiaz, Mustansar
    Jung, Soon Ki
    FRONTIERS OF COMPUTER VISION, 2020, 1212 : 54 - 65
  • [40] Defect detection on multi-type rail surfaces via IoU decoupling and multi-information alignment
    Ni, Xuefeng
    Fieguth, Paul W.
    Ma, Ziji
    Shi, Bo
    Liu, Hongli
    ADVANCED ENGINEERING INFORMATICS, 2024, 62