Predicting Automatic Speech Recognition Performance over Communication Channels from Instrumental Speech Quality and Intelligibility Scores

被引:9
|
作者
Gallardo, Laura Fernandez [1 ]
Moeller, Sebastian [1 ]
Beerends, John [2 ]
机构
[1] TU Berlin, Qual & Usabil Lab, Telekom Innovat Labs, Berlin, Germany
[2] TNO, The Hague, Netherlands
关键词
automatic speech recognition; speech intelligibility; instrumental speech quality; communication channels; ITU-T STANDARD; ASSESSMENT POLQA;
D O I
10.21437/Interspeech.2017-36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of automatic speech recognition based on coded-decoded speech heavily depends on the quality of the transmitted signals, determined by channel impairments. This paper examines relationships between speech recognition performance and measurements of speech quality and intelligibility over transmission channels. Different to previous studies, the effects of super-wideband transmissions are analyzed and compared to those of wideband and narrowband channels. Furthermore, intelligibility scores. gathered by conducting a listening test based on logatomes. are also considered for the prediction of automatic speech recognition results. The modern instrumental measurement techniques POLQA and POLQA-based intelligibility have been respectively applied to estimate the quality and the intelligibility of transmitted speech. Based on our results. polynomial models are proposed that permit the prediction of speech recognition accuracy from the subjective and instrumental measures. involving a number of channel distortions in the three bandwidths. This approach can save the costs of performing automatic speech recognition experiments and can be seen as a first step towards a useful tool for communication channel designers.
引用
收藏
页码:2939 / 2943
页数:5
相关论文
共 50 条
  • [1] Harmonicity based dereverberation for improving automatic speech recognition performance and speech intelligibility
    Kinoshita, K
    Nakatani, T
    Miyoshi, M
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1724 - 1731
  • [2] Autonomous measurement of speech intelligibility utilizing automatic speech recognition
    Meyer, Bernd T.
    Kollmeier, Birger
    Ooster, Jasper
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2982 - 2986
  • [3] Using Automatic Speech Recognition to Measure the Intelligibility of Speech Synthesized from Brain Signals
    Varshney, Suvi
    Farias, Dana
    Brandman, David M.
    Stavisky, Sergey D.
    Miller, Lee M.
    2023 11TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING, NER, 2023,
  • [4] The use of automatic speech recognition showing the influence of nasality on speech intelligibility
    S. Mayr
    K. Burkhardt
    M. Schuster
    K. Rogler
    A. Maier
    H. Iro
    European Archives of Oto-Rhino-Laryngology, 2010, 267 : 1719 - 1725
  • [5] The use of automatic speech recognition showing the influence of nasality on speech intelligibility
    Mayr, S.
    Burkhardt, K.
    Schuster, M.
    Rogler, K.
    Maier, A.
    Iro, H.
    EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2010, 267 (11) : 1719 - 1725
  • [6] Intelligibility of laryngectomees' substitute speech:: automatic speech recognition and subjective rating
    Schuster, M
    Haderlein, T
    Nöth, E
    Lohscheller, J
    Eysholdt, U
    Rosanowski, F
    EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2006, 263 (02) : 188 - 193
  • [7] Intelligibility of laryngectomees’ substitute speech: automatic speech recognition and subjective rating
    Maria Schuster
    Tino Haderlein
    Elmar Nöth
    Jörg Lohscheller
    Ulrich Eysholdt
    Frank Rosanowski
    European Archives of Oto-Rhino-Laryngology and Head & Neck, 2006, 263 : 188 - 193
  • [8] Computing scores of voice quality and speech intelligibility in tracheoesophageal speech for speech stimuli of varying lengths
    Clapham, Renee P.
    Martens, Jean-Pierre
    van Son, Rob J. J. H.
    Hilgers, Frans J. M.
    van den Brekel, Michiel M. W.
    Middag, Catherine
    COMPUTER SPEECH AND LANGUAGE, 2016, 37 : 1 - 10
  • [9] Assessing Automatic Speech Recognition in measuring speech intelligibility: A study of Malay speakers with speech impairments
    Rosdi, Fadhilah
    Mustafa, Mumtaz Begum
    Salim, Siti Salwah
    PROCEEDINGS OF THE 2017 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICEEI'17), 2017,
  • [10] AN ASSESSMENT OF AUTOMATIC SPEECH RECOGNITION AS SPEECH INTELLIGIBILITY ESTIMATION IN THE CONTEXT OF ADDITIVE NOISE
    Liu, Wei M.
    Mason, John S. D.
    Evans, Nicholas W. D.
    Jellyman, Keith A.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2166 - 2169