Evaluating Automatic Speech Recognition for Child Speech Therapy Applications

被引:3
|
作者
Hair, Adam [1 ]
Ballard, Kirrie J. [2 ]
Ahmed, Beena [3 ,4 ]
Gutierrez-Osuna, Ricardo [1 ]
机构
[1] Texas A&M Univ, College Stn, TX 77843 USA
[2] Univ Sydney, Sydney, NSW, Australia
[3] Univ New South Wales, Sydney, NSW, Australia
[4] Texas A&M Univ Qatar, Doha, Qatar
关键词
Assistive Technology; Computer-Assisted Pronunciation Training (CAPT);
D O I
10.1145/3308561.3354606
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Automatic speech recognition (ASR) technology can be a useful tool in mobile apps for child speech therapy, empowering children to complete their practice with limited caregiver supervision. However, little is known about the feasibility of performing ASR on mobile devices, particularly when training data is limited. In this study, we investigated the performance of two low-resource ASR systems on disordered speech from children. We compared the open-source PocketSphinx (PS) recognizer using adapted acoustic models and a custom template-matching (TM) recognizer. TM and the adapted models significantly out-perform the default PS model. On average, maximum likelihood linear regression and maximum a posteriori adaptation increased PS accuracy from 59.4% to 63.8% and 80.0%, respectively, suggesting that the models successfully captured speaker-specific word production variations. TM reached a mean accuracy of 75.8%.
引用
收藏
页码:578 / 580
页数:3
相关论文
共 50 条
  • [31] Use of automatic speech recognition: Current and potential applications
    Noyes, J
    Starr, A
    [J]. COMPUTING & CONTROL ENGINEERING JOURNAL, 1996, 7 (05): : 203 - 208
  • [32] Active learning:: Theory and applications to automatic speech recognition
    Riccardi, G
    Hakkani-Tür, D
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (04): : 504 - 511
  • [33] ASSESSMENT OF TECHNOLOGY OF AUTOMATIC SPEECH RECOGNITION FOR MILITARY APPLICATIONS
    BEEK, B
    NEUBERG, EP
    HODGE, DC
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1977, 25 (04): : 310 - 322
  • [34] SELECTED MILITARY APPLICATIONS OF AUTOMATIC SPEECH RECOGNITION TECHNOLOGY
    WOODARD, JP
    CUPPLES, EJ
    [J]. IEEE COMMUNICATIONS MAGAZINE, 1983, 21 (09) : 35 - 41
  • [35] Speech production knowledge in automatic speech recognition
    King, Simon
    Frankel, Joe
    Livescu, Karen
    McDermott, Erik
    Richmond, Korin
    Wester, Mirjam
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (02): : 723 - 742
  • [36] Improving Speech Synthesis by Automatic Speech Recognition and Speech Discriminator
    Huang, Li-Yu
    Chen, Chia-Ping
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2024, 40 (01) : 189 - 200
  • [37] TALKING TO THE COMPUTER - THE PRINCIPLES AND APPLICATIONS OF AUTOMATIC SPEECH RECOGNITION
    KLUGMANN, D
    [J]. SIEMENS REVIEW, 1980, 47 (04): : 16 - 19
  • [38] TOWARD BELL SYSTEM APPLICATIONS OF AUTOMATIC SPEECH RECOGNITION
    HOLMGREN, JE
    [J]. BELL SYSTEM TECHNICAL JOURNAL, 1983, 62 (06): : 1865 - 1880
  • [39] Modern standard Arabic speech corpus for implementing and evaluating automatic continuous speech recognition systems
    Abushariah, Mohammad Abd-Alrahman Mahmoud
    Ainon, Raja Noor
    Zainuddin, Roziati
    Alqudah, Assal Ali Mustafa
    Ahmed, Moustafa Elshafei
    Khalifa, Othman Omran
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2012, 349 (07): : 2215 - 2242
  • [40] Advances in Automatic Speech Recognition for Child Speech Using Factored Time Delay Neural Network
    Wu, Fei
    Garcia, Leibny Paola
    Povey, Daniel
    Khudanpur, Sanjeev
    [J]. INTERSPEECH 2019, 2019, : 1 - 5