Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers

被引：0

作者：

Santiago Omar Caballero Morales

Stephen J. Cox

机构：

[1] University of East Anglia,Speech, Language, and Music Group, School of Computing Sciences

来源：

EURASIP Journal on Advances in Signal Processing | / 2009卷

关键词：

Recognition Accuracy; Confusion Matrix; Automatic Speech Recognition; Acoustic Model; Speech Disorder;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Dysarthria is a motor speech disorder characterized by weakness, paralysis, or poor coordination of the muscles responsible for speech. Although automatic speech recognition (ASR) systems have been developed for disordered speech, factors such as low intelligibility and limited phonemic repertoire decrease speech recognition accuracy, making conventional speaker adaptation algorithms perform poorly on dysarthric speakers. In this work, rather than adapting the acoustic models, we model the errors made by the speaker and attempt to correct them. For this task, two techniques have been developed: (1) a set of "metamodels" that incorporate a model of the speaker's phonetic confusion matrix into the ASR process; (2) a cascade of weighted finite-state transducers at the confusion matrix, word, and language levels. Both techniques attempt to correct the errors made at the phonetic level and make use of a language model to find the best estimate of the correct word sequence. Our experiments show that both techniques outperform standard adaptation techniques.

引用

共 50 条

[1] Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers
Morales, Santiago Omar Caballero
Cox, Stephen J.
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2009,
[2] A Survey of Automatic Speech Recognition for Dysarthric Speech
Qian, Zhaopeng
Xiao, Kejing
ELECTRONICS, 2023, 12 (20)
[3] Automatic recognition of Arabic dysarthric speech
Tolba, Hesham M.
El-Torgoman, Ahmed S.
AEJ - Alexandria Engineering Journal, 2010, 49 (02): : 131 - 138
[4] Evaluation of an Automatic Speech Recognition Platform for Dysarthric Speech
Calvo, Irene
Tropea, Peppino
Vigano, Mauro
Scialla, Maria
Cavalcante, Agnieszka B.
Grajzer, Monika
Gilardone, Marco
Corbo, Massimo
FOLIA PHONIATRICA ET LOGOPAEDICA, 2021, 73 (05) : 432 - 441
[5] Dysarthric speakers' intelligibility and speech characteristics in relation to computer speech recognition
AAC Augmentative Altern Commun, 3 (165):
[6] A survey of technologies for automatic Dysarthric speech recognition
Qian, Zhaopeng
Xiao, Kejing
Yu, Chongchong
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
[7] A survey of technologies for automatic Dysarthric speech recognition
Zhaopeng Qian
Kejing Xiao
Chongchong Yu
EURASIP Journal on Audio, Speech, and Music Processing, 2023
[8] Interface of an Automatic Recognition System for Dysarthric Speech
Zaidi, Brahim-Fares
Boudraa, Malika
Selouani, Sid-Ahmed
Addou, Djamel
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (09) : 560 - 564
[9] Towards the Improvement of Automatic Recognition of Dysarthric Speech
Tolba, Hesham
EL Torgoman, Ahmed S.
2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 1, 2009, : 277 - +
[10] Difficulties in Automatic Speech Recognition of Dysarthric Speakers and Implications for Speech-Based Applications Used by the Elderly: A Literature Review
Young, Victoria
Mihailidis, Alex
ASSISTIVE TECHNOLOGY, 2010, 22 (02) : 99 - 112

← 1 2 3 4 5 →