Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech

被引：12

作者：

Korzekwa, Daniel ^{[1
]}

Barra-Chicote, Roberto ^{[1
]}

Kostek, Bozena ^{[2
]}

Drugman, Thomas ^{[1
]}

Lajszczak, Mateusz ^{[1
]}

机构：

[1] Amazon TTS Res, Cambridge, England

[2] Gdansk Univ Technol, Fac ETI, Gdansk, Poland

来源：

INTERSPEECH 2019 | 2019年

关键词：

dysarthria detection; speech recognition; speech synthesis; interpretable deep learning models;

D O I：

10.21437/Interspeech.2019-1206

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

We present a novel deep learning model for the detection and reconstruction of dysarthric speech. We train the model with a multi-task learning technique to jointly solve dysarthria detection and speech reconstruction tasks. The model key feature is a low-dimensional latent space that is meant to encode the properties of dysarthric speech. It is commonly believed that neural networks are black boxes that solve problems but do not provide interpretable outputs. On the contrary, we show that this latent space successfully encodes interpretable characteristics of dysarthria, is effective at detecting dysarthria, and that manipulation of the latent space allows the model to reconstruct healthy speech from dysarthric speech. This work can help patients and speech pathologists to improve their understanding of the condition, lead to more accurate diagnoses and aid in reconstructing healthy speech for afflicted patients.

引用

页码：3890 / 3894

页数：5

共 50 条

[41] Deep Learning for Hate Speech Detection in Tweets
Badjatiya, Pinkesh
Gupta, Shashank
Gupta, Manish
Varma, Vasudeva
WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 759 - 760
[42] Deep Learning Ensembles for Hate Speech Detection
Alsafari, Safa
Sadaoui, Samira
Mouhoub, Malek
2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 526 - 531
[43] A Speech Command Control-Based Recognition System for Dysarthric Patients Based on Deep Learning Technology
Lin, Yu-Yi
Zheng, Wei-Zhong
Chu, Wei Chung
Han, Ji-Yan
Hung, Ying-Hsiu
Ho, Guan-Min
Chang, Chia-Yuan
Lai, Ying-Hui
APPLIED SCIENCES-BASEL, 2021, 11 (06):
[44] An interpretable deep learning model to map land subsidence hazard
Rahmani, Paria
Gholami, Hamid
Golzari, Shahram
ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2024, 31 (11) : 17372 - 17386
[45] Bayesian deep learning: A model-based interpretable approach
Matsubara, Takashi
IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2020, 11 (01): : 16 - 35
[46] Interpretable Deep Learning Prediction Model for Compressive Strength of Concrete
Zhang, Wei-Qi
Wang, Hui-Ming
Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (05): : 738 - 744
[47] AN INTERPRETABLE DEEP LEARNING MODEL TO PREDICT SYMPTOMATIC KNEE OSTEOARTHRITIS
Zokaeinikoo, M.
Li, X.
Yang, M.
OSTEOARTHRITIS AND CARTILAGE, 2021, 29 : S354 - S354
[48] AFM signal model for dysarthric speech classification using speech biomarkers
Shabber, Shaik Mulla
Sumesh, Eratt Parameswaran
FRONTIERS IN HUMAN NEUROSCIENCE, 2024, 18
[49] Deep PLS: A Lightweight Deep Learning Model for Interpretable and Efficient Data Analytics
Kong, Xiangyin
Ge, Zhiqiang
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8923 - 8937
[50] AudioProtoPNet: An interpretable deep learning model for bird sound classification
Heinrich, Rene
Rauch, Lukas
Sick, Bernhard
Scholz, Christoph
ECOLOGICAL INFORMATICS, 2025, 87

← 1 2 3 4 5 →