Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech

被引:12
|
作者
Korzekwa, Daniel [1 ]
Barra-Chicote, Roberto [1 ]
Kostek, Bozena [2 ]
Drugman, Thomas [1 ]
Lajszczak, Mateusz [1 ]
机构
[1] Amazon TTS Res, Cambridge, England
[2] Gdansk Univ Technol, Fac ETI, Gdansk, Poland
来源
关键词
dysarthria detection; speech recognition; speech synthesis; interpretable deep learning models;
D O I
10.21437/Interspeech.2019-1206
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
We present a novel deep learning model for the detection and reconstruction of dysarthric speech. We train the model with a multi-task learning technique to jointly solve dysarthria detection and speech reconstruction tasks. The model key feature is a low-dimensional latent space that is meant to encode the properties of dysarthric speech. It is commonly believed that neural networks are black boxes that solve problems but do not provide interpretable outputs. On the contrary, we show that this latent space successfully encodes interpretable characteristics of dysarthria, is effective at detecting dysarthria, and that manipulation of the latent space allows the model to reconstruct healthy speech from dysarthric speech. This work can help patients and speech pathologists to improve their understanding of the condition, lead to more accurate diagnoses and aid in reconstructing healthy speech for afflicted patients.
引用
收藏
页码:3890 / 3894
页数:5
相关论文
共 50 条
  • [41] Deep Learning for Hate Speech Detection in Tweets
    Badjatiya, Pinkesh
    Gupta, Shashank
    Gupta, Manish
    Varma, Vasudeva
    WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 759 - 760
  • [42] Deep Learning Ensembles for Hate Speech Detection
    Alsafari, Safa
    Sadaoui, Samira
    Mouhoub, Malek
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 526 - 531
  • [43] A Speech Command Control-Based Recognition System for Dysarthric Patients Based on Deep Learning Technology
    Lin, Yu-Yi
    Zheng, Wei-Zhong
    Chu, Wei Chung
    Han, Ji-Yan
    Hung, Ying-Hsiu
    Ho, Guan-Min
    Chang, Chia-Yuan
    Lai, Ying-Hui
    APPLIED SCIENCES-BASEL, 2021, 11 (06):
  • [44] An interpretable deep learning model to map land subsidence hazard
    Rahmani, Paria
    Gholami, Hamid
    Golzari, Shahram
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2024, 31 (11) : 17372 - 17386
  • [45] Bayesian deep learning: A model-based interpretable approach
    Matsubara, Takashi
    IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2020, 11 (01): : 16 - 35
  • [46] Interpretable Deep Learning Prediction Model for Compressive Strength of Concrete
    Zhang, Wei-Qi
    Wang, Hui-Ming
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (05): : 738 - 744
  • [47] AN INTERPRETABLE DEEP LEARNING MODEL TO PREDICT SYMPTOMATIC KNEE OSTEOARTHRITIS
    Zokaeinikoo, M.
    Li, X.
    Yang, M.
    OSTEOARTHRITIS AND CARTILAGE, 2021, 29 : S354 - S354
  • [48] AFM signal model for dysarthric speech classification using speech biomarkers
    Shabber, Shaik Mulla
    Sumesh, Eratt Parameswaran
    FRONTIERS IN HUMAN NEUROSCIENCE, 2024, 18
  • [49] Deep PLS: A Lightweight Deep Learning Model for Interpretable and Efficient Data Analytics
    Kong, Xiangyin
    Ge, Zhiqiang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8923 - 8937
  • [50] AudioProtoPNet: An interpretable deep learning model for bird sound classification
    Heinrich, Rene
    Rauch, Lukas
    Sick, Bernhard
    Scholz, Christoph
    ECOLOGICAL INFORMATICS, 2025, 87