Estimation of Speech Intelligibility Using Speech Recognition Systems

被引:1
|
作者
Takano, Yusuke [1 ]
Kondo, Kazuhiro [1 ]
机构
[1] Yamagata Univ, Grad Sch Sci & Engn, Yonezawa, Yamagata 9928510, Japan
来源
关键词
objective estimation; speech intelligibility; speech recognition; Japanese Diagnostic Rhyme Test; noise adaptation;
D O I
10.1587/transinf.E93.D.3368
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We attempted to estimate subjective scores of the Japanese Diagnostic Rhyme Test (DRT) a two to one forced selection speech Intelligibility test We used automatic speech recognizers with language models that force one of the words in the word pair mimicking the human recognition process of the DRT Initial testing was done using speaker Independent models and they showed significantly lower scores than subjective scores The acoustic models were then adapted to each of the speakers in the corpus and then adapted to noise at a specified SNR Three different types of noise were tested white noise multi talker (babble) noise and pseudo speech noise The match between subjective and estimated scores improved significantly with noise adapted models compared to speaker independent models and the speaker adapted models when the adapted noise level and the tested level match However when SNR conditions do not match the recognition scores degraded especially when tested SNR conditions were higher than the adapted noise level Accordingly we adapted the models to mixed levels of noise i e multi condition training The adapted models now showed relatively high intelligibility matching subjective intelligibility performance over all levels of noise The correlation between subjective and estimated intelligibility scores increased to 0 94 with multi talker noise 0 93 with white noise and 0 89 with pseudo speech noise while the root mean square error (RMSE) reduced from more than 40 to 13 10 13 05 and 16 06 respectively
引用
收藏
页码:3368 / 3376
页数:9
相关论文
共 50 条
  • [1] Aphasic Speech Recognition using a Mixture of Speech Intelligibility Experts
    Perez, Matthew
    Aldeneh, Zakaria
    Provost, Emily Mower
    [J]. INTERSPEECH 2020, 2020, : 4986 - 4990
  • [2] AN ASSESSMENT OF AUTOMATIC SPEECH RECOGNITION AS SPEECH INTELLIGIBILITY ESTIMATION IN THE CONTEXT OF ADDITIVE NOISE
    Liu, Wei M.
    Mason, John S. D.
    Evans, Nicholas W. D.
    Jellyman, Keith A.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2166 - 2169
  • [3] Automatic Speech Recognition Used for Intelligibility Assessment of Text-to-Speech Systems
    Vich, Robert
    Nouza, Jan
    Vondra, Martin
    [J]. VERBAL AND NONVERBAL FEATURES OF HUMAN-HUMAN AND HUMAN-MACHINE INTERACTIONS, 2008, 5042 : 136 - +
  • [4] Estimation of speech intelligibility using objective measures
    Kondo, Kazuhiro
    [J]. APPLIED ACOUSTICS, 2013, 74 (01) : 63 - 70
  • [5] The Evaluation Process Automation of Phrase and Word Intelligibility Using Speech Recognition Systems
    Kostuchenko, Evgeny
    Novokhrestova, Dariya
    Tirskaya, Marina
    Shelupanov, Alexander
    Nemirovich-Danchenko, Mikhail
    Choynzonov, Evgeny
    Balatskaya, Lidiya
    [J]. SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 237 - 246
  • [6] Estimation of binaural speech intelligibility using machine learning
    Kondo, Kazuhiro
    Taira, Kazuya
    [J]. APPLIED ACOUSTICS, 2018, 129 : 408 - 416
  • [7] Using Automatic Speech Recognition to Measure the Intelligibility of Speech Synthesized from Brain Signals
    Varshney, Suvi
    Farias, Dana
    Brandman, David M.
    Stavisky, Sergey D.
    Miller, Lee M.
    [J]. 2023 11TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING, NER, 2023,
  • [8] Speech recognition for multiple bands: Implications for the Speech Intelligibility Index
    Humes, Larry E.
    Kidd, Gary R.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (03): : 2019 - 2026
  • [9] Dysarthric speech: A comparison of computerized speech recognition and listener intelligibility
    Doyle, PC
    Leeper, HA
    Kotler, AL
    ThomasStonell, N
    ONeill, C
    Dylke, MC
    Rolls, K
    [J]. JOURNAL OF REHABILITATION RESEARCH AND DEVELOPMENT, 1997, 34 (03): : 309 - 316
  • [10] Autonomous measurement of speech intelligibility utilizing automatic speech recognition
    Meyer, Bernd T.
    Kollmeier, Birger
    Ooster, Jasper
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2982 - 2986