Comparing Humans and Automatic Speech Recognition Systems in Recognizing Dysarthric Speech

被引:0
|
作者
Mengistu, Kinfe Tadesse [1 ]
Rudzicz, Frank [1 ]
机构
[1] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
来源
ADVANCES IN ARTIFICIAL INTELLIGENCE | 2011年 / 6657卷
关键词
speech recognition; dysarthric speech; intelligibility; INTELLIGIBILITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech is a complex process that requires control and coordination of articulation, breathing, voicing, and prosody. Dysarthria is a manifestation of an inability to control and coordinate one or more of these aspects, which results in poorly articulated and hardly intelligible speech. Hence individuals with dysarthria are rarely understood by human listeners. In this paper, we compare and evaluate how well dysarthric speech can be recognized by an automatic speech recognition system (ASR) and naive adult human listeners. The results show that despite the encouraging performance of ASR systems, and contrary to the claims in other studies, on average human listeners perform better in recognizing single-word dysarthric speech. In particular, the mean word recognition accuracy of speaker-adapted monophone ASR systems on stimuli produced by six dysarthric speakers is 68.39% while the mean percentage correct response of 14 naive human listeners on the same speech is 79.78% as evaluated using single-word multiple-choice intelligibility test.
引用
收藏
页码:291 / 300
页数:10
相关论文
共 50 条
  • [1] A Survey of Automatic Speech Recognition for Dysarthric Speech
    Qian, Zhaopeng
    Xiao, Kejing
    ELECTRONICS, 2023, 12 (20)
  • [2] Evaluation of an Automatic Speech Recognition Platform for Dysarthric Speech
    Calvo, Irene
    Tropea, Peppino
    Vigano, Mauro
    Scialla, Maria
    Cavalcante, Agnieszka B.
    Grajzer, Monika
    Gilardone, Marco
    Corbo, Massimo
    FOLIA PHONIATRICA ET LOGOPAEDICA, 2021, 73 (05) : 432 - 441
  • [3] Automatic recognition of Arabic dysarthric speech
    Tolba, Hesham M.
    El-Torgoman, Ahmed S.
    AEJ - Alexandria Engineering Journal, 2010, 49 (02): : 131 - 138
  • [4] Speech Technology for Automatic Recognition and Assessment of Dysarthric Speech: An Overview
    Bhat, Chitralekha
    Strik, Helmer
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2025, 68 (02): : 547 - 577
  • [5] A survey of technologies for automatic Dysarthric speech recognition
    Qian, Zhaopeng
    Xiao, Kejing
    Yu, Chongchong
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [6] A survey of technologies for automatic Dysarthric speech recognition
    Zhaopeng Qian
    Kejing Xiao
    Chongchong Yu
    EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [7] Towards the Improvement of Automatic Recognition of Dysarthric Speech
    Tolba, Hesham
    EL Torgoman, Ahmed S.
    2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 1, 2009, : 277 - +
  • [8] Interface of an Automatic Recognition System for Dysarthric Speech
    Zaidi, Brahim-Fares
    Boudraa, Malika
    Selouani, Sid-Ahmed
    Addou, Djamel
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (09) : 560 - 564
  • [9] Improved Acoustic Modeling for Automatic Dysarthric Speech Recognition
    Sriranjani, R.
    Reddy, M. Ramasubba
    Umesh, S.
    2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
  • [10] Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers
    Santiago Omar Caballero Morales
    Stephen J. Cox
    EURASIP Journal on Advances in Signal Processing, 2009