Comparing Humans and Automatic Speech Recognition Systems in Recognizing Dysarthric Speech

被引：0

作者：

Mengistu, Kinfe Tadesse ^{[1
]}

Rudzicz, Frank ^{[1
]}

机构：

[1] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada

来源：

ADVANCES IN ARTIFICIAL INTELLIGENCE | 2011年 / 6657卷

关键词：

speech recognition; dysarthric speech; intelligibility; INTELLIGIBILITY;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Speech is a complex process that requires control and coordination of articulation, breathing, voicing, and prosody. Dysarthria is a manifestation of an inability to control and coordinate one or more of these aspects, which results in poorly articulated and hardly intelligible speech. Hence individuals with dysarthria are rarely understood by human listeners. In this paper, we compare and evaluate how well dysarthric speech can be recognized by an automatic speech recognition system (ASR) and naive adult human listeners. The results show that despite the encouraging performance of ASR systems, and contrary to the claims in other studies, on average human listeners perform better in recognizing single-word dysarthric speech. In particular, the mean word recognition accuracy of speaker-adapted monophone ASR systems on stimuli produced by six dysarthric speakers is 68.39% while the mean percentage correct response of 14 naive human listeners on the same speech is 79.78% as evaluated using single-word multiple-choice intelligibility test.

引用

页码：291 / 300

页数：10

共 50 条

[1] A Survey of Automatic Speech Recognition for Dysarthric Speech
Qian, Zhaopeng
Xiao, Kejing
ELECTRONICS, 2023, 12 (20)
[2] Evaluation of an Automatic Speech Recognition Platform for Dysarthric Speech
Calvo, Irene
Tropea, Peppino
Vigano, Mauro
Scialla, Maria
Cavalcante, Agnieszka B.
Grajzer, Monika
Gilardone, Marco
Corbo, Massimo
FOLIA PHONIATRICA ET LOGOPAEDICA, 2021, 73 (05) : 432 - 441
[3] Automatic recognition of Arabic dysarthric speech
Tolba, Hesham M.
El-Torgoman, Ahmed S.
AEJ - Alexandria Engineering Journal, 2010, 49 (02): : 131 - 138
[4] Speech Technology for Automatic Recognition and Assessment of Dysarthric Speech: An Overview
Bhat, Chitralekha
Strik, Helmer
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2025, 68 (02): : 547 - 577
[5] A survey of technologies for automatic Dysarthric speech recognition
Qian, Zhaopeng
Xiao, Kejing
Yu, Chongchong
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
[6] A survey of technologies for automatic Dysarthric speech recognition
Zhaopeng Qian
Kejing Xiao
Chongchong Yu
EURASIP Journal on Audio, Speech, and Music Processing, 2023
[7] Towards the Improvement of Automatic Recognition of Dysarthric Speech
Tolba, Hesham
EL Torgoman, Ahmed S.
2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 1, 2009, : 277 - +
[8] Interface of an Automatic Recognition System for Dysarthric Speech
Zaidi, Brahim-Fares
Boudraa, Malika
Selouani, Sid-Ahmed
Addou, Djamel
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (09) : 560 - 564
[9] Improved Acoustic Modeling for Automatic Dysarthric Speech Recognition
Sriranjani, R.
Reddy, M. Ramasubba
Umesh, S.
2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
[10] Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers
Santiago Omar Caballero Morales
Stephen J. Cox
EURASIP Journal on Advances in Signal Processing, 2009

← 1 2 3 4 5 →