Matrix sentence intelligibility prediction using an automatic speech recognition system

被引:48
|
作者
Schaedler, Marc Rene
Warzybok, Anna
Hochmuth, Sabine
Kollmeier, Birger
机构
[1] Carl von Ossietzky Univ Oldenburg, Med Phys, D-26111 Oldenburg, Germany
[2] Carl von Ossietzky Univ Oldenburg, Cluster Excellence Hearing4all, D-26111 Oldenburg, Germany
关键词
Speech intelligibility predictions; SII; ASR; speech in noise; matrix test; STEADY BACKGROUND-NOISE; MODEL; HEARING; PERCEPTION; LISTENERS; INDEX; COMPRESSION; REFLECTIONS; THRESHOLD;
D O I
10.3109/14992027.2015.1061708
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Objective: The feasibility of predicting the outcome of the German matrix sentence test for different types of stationary background noise using an automatic speech recognition (ASR) system was studied. Design: Speech reception thresholds (SRT) of 50% intelligibility were predicted in seven noise conditions. The ASR system used Mel-frequency cepstral coefficients as a front-end and employed whole-word Hidden Markov models on the back-end side. The ASR system was trained and tested with noisy matrix sentences on a broad range of signal-to-noise ratios. Study sample: The ASR-based predictions were compared to data from the literature (Hochmuth et al, 2015) obtained with 10 native German listeners with normal hearing and predictions of the speech intelligibility index (SII). Results: The ASR-based predictions showed a high and significant correlation (R-2 = 0.95, p < 0.001) with the empirical data across different noise conditions, outperforming the SII-based predictions which showed no correlation with the empirical data (R-2 = 0.00, p = 0.987). Conclusions: The SRTs for the German matrix test for listeners with normal hearing in different stationary noise conditions could well be predicted based on the acoustical properties of the speech and noise signals. Minimum assumptions were made about human speech processing already incorporated in a reference-free ordinary ASR system.
引用
收藏
页码:100 / 107
页数:8
相关论文
共 50 条
  • [1] Application of an automatic conversation intelligibility test system using computerized speech recognition technique
    Hattori, Mariko
    Sumita, Yuka I.
    Kimura, Shinta
    Taniguchi, Hisashi
    [J]. JOURNAL OF PROSTHODONTIC RESEARCH, 2010, 54 (01) : 7 - 13
  • [2] Automatic intelligibility classification of sentence-level pathological speech
    Kim, Jangwon
    Kumar, Naveen
    Tsiartas, Andreas
    Li, Ming
    Narayanan, Shrikanth S.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2015, 29 (01): : 132 - 144
  • [3] Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction
    Tu, Zehai
    Ma, Ning
    Barker, Jon
    [J]. INTERSPEECH 2022, 2022, : 3493 - 3497
  • [4] An Italian matrix sentence test for the evaluation of speech intelligibility in noise
    Puglisi, Giuseppina Emma
    Warzybok, Anna
    Hochmuth, Sabine
    Visentin, Chiara
    Astolfi, Arianna
    Prodi, Nicola
    Kollmeier, Birger
    [J]. INTERNATIONAL JOURNAL OF AUDIOLOGY, 2015, 54 : 44 - 50
  • [5] Polish sentence matrix test for speech intelligibility measurement in noise
    Ozimek, Edward
    Warzybok, Anna
    Kutzner, Dariusz
    [J]. INTERNATIONAL JOURNAL OF AUDIOLOGY, 2010, 49 (06) : 444 - 454
  • [6] Using Automatic Speech Recognition to Measure the Intelligibility of Speech Synthesized from Brain Signals
    Varshney, Suvi
    Farias, Dana
    Brandman, David M.
    Stavisky, Sergey D.
    Miller, Lee M.
    [J]. 2023 11TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING, NER, 2023,
  • [7] Autonomous measurement of speech intelligibility utilizing automatic speech recognition
    Meyer, Bernd T.
    Kollmeier, Birger
    Ooster, Jasper
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2982 - 2986
  • [8] APPLICATION OF SPEECH RECOGNITION TO AUTOMATIC INTELLIGIBILITY TESTING PROCEDURES
    TEACHER, CF
    RICHARDS, JR
    HEWITT, H
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1970, 48 (01): : 131 - &
  • [9] Development of a Dutch matrix sentence test to assess speech intelligibility in noise
    Houben, Rolph
    Koopman, Jan
    Luts, Heleen
    Wagener, Kirsten C.
    van Wieringen, Astrid
    Verschuure, Hans
    Dreschler, Wouter A.
    [J]. INTERNATIONAL JOURNAL OF AUDIOLOGY, 2014, 53 (10) : 760 - 763
  • [10] The use of automatic speech recognition showing the influence of nasality on speech intelligibility
    S. Mayr
    K. Burkhardt
    M. Schuster
    K. Rogler
    A. Maier
    H. Iro
    [J]. European Archives of Oto-Rhino-Laryngology, 2010, 267 : 1719 - 1725