Evaluating Automatic Speech Recognition for Child Speech Therapy Applications

被引:3
|
作者
Hair, Adam [1 ]
Ballard, Kirrie J. [2 ]
Ahmed, Beena [3 ,4 ]
Gutierrez-Osuna, Ricardo [1 ]
机构
[1] Texas A&M Univ, College Stn, TX 77843 USA
[2] Univ Sydney, Sydney, NSW, Australia
[3] Univ New South Wales, Sydney, NSW, Australia
[4] Texas A&M Univ Qatar, Doha, Qatar
关键词
Assistive Technology; Computer-Assisted Pronunciation Training (CAPT);
D O I
10.1145/3308561.3354606
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Automatic speech recognition (ASR) technology can be a useful tool in mobile apps for child speech therapy, empowering children to complete their practice with limited caregiver supervision. However, little is known about the feasibility of performing ASR on mobile devices, particularly when training data is limited. In this study, we investigated the performance of two low-resource ASR systems on disordered speech from children. We compared the open-source PocketSphinx (PS) recognizer using adapted acoustic models and a custom template-matching (TM) recognizer. TM and the adapted models significantly out-perform the default PS model. On average, maximum likelihood linear regression and maximum a posteriori adaptation increased PS accuracy from 59.4% to 63.8% and 80.0%, respectively, suggesting that the models successfully captured speaker-specific word production variations. TM reached a mean accuracy of 75.8%.
引用
收藏
页码:578 / 580
页数:3
相关论文
共 50 条
  • [1] Evaluating and Improving Child-Directed Automatic Speech Recognition
    Booth, Eric
    Carns, Jake
    Kennington, Casey
    Rafla, Nader
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6340 - 6345
  • [2] EVALUATING VAD FOR AUTOMATIC SPEECH RECOGNITION
    Tong, Sibo
    Chen, Nanxin
    Qian, Yanmin
    Yu, Kai
    [J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 2308 - 2314
  • [3] PRINCIPLES AND APPLICATIONS OF AUTOMATIC SPEECH RECOGNITION
    KLUGMANN, D
    DREISBACH, B
    GNETTNER, W
    [J]. SIEMENS FORSCHUNGS-UND ENTWICKLUNGSBERICHTE-SIEMENS RESEARCH AND DEVELOPMENT REPORTS, 1981, 10 (05): : 316 - 322
  • [4] Automatic speech recognition and its applications
    Levitt, H
    [J]. ISSUES UNRESOLVED: NEW PERSPECTIVES ON LANGUAGE AND DEAF EDUCATION, 1998, : 133 - 138
  • [5] Generating and evaluating segmentations for automatic speech recognition of conversational telephone speech
    Tranter, SE
    Yu, K
    Evermann, G
    Woodland, RC
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 753 - 756
  • [6] CEASR: A Corpus for Evaluating Automatic Speech Recognition
    Ulasik, Malgorzata Anna
    Huerlimann, Manuela
    Germann, Fabian
    Gedik, Esin
    Benites, Fernando
    Cieliebak, Mark
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6477 - 6485
  • [7] Applications of automatic speech recognition to speech and language development in young children
    Russell, M
    Brown, C
    Skilling, A
    Series, R
    Wallace, J
    Bonham, B
    Barker, P
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 176 - 179
  • [8] Fundamental Frequency of Child-Directed Speech Using Automatic Speech Recognition
    VanDam, Mark
    De Palma, Paul
    [J]. 2014 JOINT 7TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 15TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2014, : 1349 - 1353
  • [9] Automatic Speech Recognition for speech therapy in a Parkinson's disease patient
    Nuzzi, A.
    [J]. MOVEMENT DISORDERS, 2022, 37 : S1 - S1
  • [10] Speech production and automatic speech recognition
    [J]. 2000, Inst of Acoustics, St. Albans, Engl (25):