Predicting Automatic Speech Recognition Performance over Communication Channels from Instrumental Speech Quality and Intelligibility Scores

被引:9
|
作者
Gallardo, Laura Fernandez [1 ]
Moeller, Sebastian [1 ]
Beerends, John [2 ]
机构
[1] TU Berlin, Qual & Usabil Lab, Telekom Innovat Labs, Berlin, Germany
[2] TNO, The Hague, Netherlands
关键词
automatic speech recognition; speech intelligibility; instrumental speech quality; communication channels; ITU-T STANDARD; ASSESSMENT POLQA;
D O I
10.21437/Interspeech.2017-36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of automatic speech recognition based on coded-decoded speech heavily depends on the quality of the transmitted signals, determined by channel impairments. This paper examines relationships between speech recognition performance and measurements of speech quality and intelligibility over transmission channels. Different to previous studies, the effects of super-wideband transmissions are analyzed and compared to those of wideband and narrowband channels. Furthermore, intelligibility scores. gathered by conducting a listening test based on logatomes. are also considered for the prediction of automatic speech recognition results. The modern instrumental measurement techniques POLQA and POLQA-based intelligibility have been respectively applied to estimate the quality and the intelligibility of transmitted speech. Based on our results. polynomial models are proposed that permit the prediction of speech recognition accuracy from the subjective and instrumental measures. involving a number of channel distortions in the three bandwidths. This approach can save the costs of performing automatic speech recognition experiments and can be seen as a first step towards a useful tool for communication channel designers.
引用
收藏
页码:2939 / 2943
页数:5
相关论文
共 50 条
  • [31] Should WebRTC Prioritise Intelligibility over Speech Quality?
    Sun, Pheobe Wenyi
    Hines, Andrew
    2020 31ST IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC), 2020, : 312 - 317
  • [32] A Study on Speech Coders for Automatic Speech Recognition in Adverse Communication Environments
    Choi, Seung Ho
    INFORMATICS ENGINEERING AND INFORMATION SCIENCE, PT II, 2011, 252 : 67 - 75
  • [33] Children's speech recognition scores: The speech, intelligibility index and proficiency factors for age and hearing level
    Scollie, Susan D.
    EAR AND HEARING, 2008, 29 (04): : 543 - 556
  • [34] Predicting Speech Recognition Using the Speech Intelligibility Index and Other Variables for Cochlear Implant Users
    Lee, Sungmin
    Mendel, Lisa Lucks
    Bidelman, Gavin M.
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2019, 62 (05): : 1517 - 1531
  • [35] AUTOMATIC SPEECH RECOGNITION IN INTELLIGENCE COMMUNICATION.
    Datta, A.K.
    Ganguli, N.R.
    Journal of the Institution of Electronics and Telecommunication Engineers, 1980, 26 (01): : 82 - 84
  • [36] Predicting Intelligibility Scores of Children with Dysarthria and Cerebral Palsy from Phonetic Measures of Speech Accuracy
    Hodge, Megan M.
    Brown, Candace
    Kuzyk, Taryn
    JOURNAL OF MEDICAL SPEECH-LANGUAGE PATHOLOGY, 2012, 20 (04) : 41 - 46
  • [37] Investigation of speech recognition over IP channels
    Van Sciver, J
    Ma, JZ
    Vanpoucke, F
    Van Hamme, H
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3812 - 3815
  • [38] Matrix sentence intelligibility prediction using an automatic speech recognition system
    Schaedler, Marc Rene
    Warzybok, Anna
    Hochmuth, Sabine
    Kollmeier, Birger
    INTERNATIONAL JOURNAL OF AUDIOLOGY, 2015, 54 : 100 - 107
  • [39] Predicting Word Accuracy for the Automatic Speech Recognition of Non-Native Speech
    Yoon, Su-Youn
    Chen, Lei
    Zechner, Klaus
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 773 - 776
  • [40] A model predicting the effect of speech of varying intelligibility on work performance
    Hongisto, V
    INDOOR AIR, 2005, 15 (06) : 458 - 468