Measuring Speech Recognition With a Matrix Test Using Synthetic Speech

被引:18
|
作者
Nuesse, Theresa [1 ,2 ]
Wiercinski, Bianca [1 ]
Brand, Thomas [2 ,3 ]
Holube, Inga [1 ,2 ]
机构
[1] Jade Univ Appl Sci, Inst Hearing Technol & Audiol, Ofener Str 16-19, D-26121 Oldenburg, Germany
[2] Cluster Excellence Hearing4All, Oldenburg, Germany
[3] Carl von Ossietzky Univ Oldenburg, Med Phys, Oldenburg, Germany
来源
TRENDS IN HEARING | 2019年 / 23卷
关键词
speech audiometry; speech reception threshold; Oldenburg sentence test; text-to-speech; synthetic speech; COGNITIVE LOAD; SENTENCE TEST; INTELLIGIBILITY; NOISE;
D O I
10.1177/2331216519862982
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Speech audiometry is an essential part of audiological diagnostics and clinical measurements. Development times of speech recognition tests are rather long, depending on the size of speech corpus and optimization necessity. The aim of this study was to examine whether this development effort could be reduced by using synthetic speech in speech audiometry, especially in a matrix test for speech recognition. For this purpose, the speech material of the German matrix test was replicated using a preselected commercial system to generate the synthetic speech files. In contrast to the conventional matrix test, no level adjustments or optimization tests were performed while producing the synthetic speech material. Evaluation measurements were conducted by presenting both versions of the German matrix test (with natural or synthetic speech), alternately and at three different signal-to-noise ratios, to 48 young, normal-hearing participants. Psychometric functions were fitted to the empirical data. Speech recognition thresholds were 0.5 dB signal-to-noise ratio higher (worse) for the synthetic speech, while slopes were equal for both speech types. Nevertheless, speech recognition scores were comparable with the literature and the threshold difference lay within the same range as recordings of two different natural speakers. Although no optimization was applied, the synthetic-speech signals led to equivalent recognition of the different test lists and word categories. The outcomes of this study indicate that the application of synthetic speech in speech recognition tests could considerably reduce the development costs and evaluation time. This offers the opportunity to increase the speech corpus for speech recognition tests with acceptable effort.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Development of a Phrase-Based Speech-Recognition Test Using Synthetic Speech
    Ibelings, Saskia
    Brand, Thomas
    Ruigendijk, Esther
    Holube, Inga
    TRENDS IN HEARING, 2024, 28
  • [2] Speech Recognition and Listening Effort of Meaningful Sentences Using Synthetic Speech
    Ibelings, Saskia
    Brand, Thomas
    Holube, Inga
    TRENDS IN HEARING, 2022, 26
  • [3] Measuring the naturalness of synthetic speech
    Univ of Chicago, Chicago, United States
    Int J Speech Technol, 1 (7-19):
  • [4] Measuring the naturalness of synthetic speech
    Howard C. Nusbaum
    Alexander L. Francis
    Anne S. Henly
    International Journal of Speech Technology, 1997, 2 (1) : 7 - 19
  • [5] EMOTION RECOGNITION USING SYNTHETIC SPEECH AS NEUTRAL REFERENCE
    Lotfian, Reza
    Busso, Carlos
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4759 - 4763
  • [6] Multi-modal mathematics: Conveying Math using synthetic speech and speech recognition
    Fitzpatrick, D
    Karshmer, AI
    COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS: PROCEEDINGS, 2004, 3118 : 644 - 647
  • [7] Using speech synthesis to explain automatic speaker recognition: a new application of synthetic speech
    Brown, Georgina
    Kirchhubel, Christin
    Cuthbert, Ramiz
    INTERSPEECH 2023, 2023, : 4723 - 4727
  • [8] Refining maritime Automatic Speech Recognition by leveraging synthetic speech
    Martius, Christoph
    Nakilcioglu, Emin Cagatay
    Reimann, Maximilian
    John, Ole
    MARITIME TRANSPORT RESEARCH, 2024, 7
  • [9] Real and synthetic Punjabi speech datasets for automatic speech recognition
    Singh, Satwinder
    Hou, Feng
    Wang, Ruili
    Data in Brief, 52
  • [10] Real and synthetic Punjabi speech datasets for automatic speech recognition
    Singh, Satwinder
    Hou, Feng
    Wang, Ruili
    DATA IN BRIEF, 2024, 52