Measuring Speech Recognition With a Matrix Test Using Synthetic Speech

被引:18
|
作者
Nuesse, Theresa [1 ,2 ]
Wiercinski, Bianca [1 ]
Brand, Thomas [2 ,3 ]
Holube, Inga [1 ,2 ]
机构
[1] Jade Univ Appl Sci, Inst Hearing Technol & Audiol, Ofener Str 16-19, D-26121 Oldenburg, Germany
[2] Cluster Excellence Hearing4All, Oldenburg, Germany
[3] Carl von Ossietzky Univ Oldenburg, Med Phys, Oldenburg, Germany
来源
TRENDS IN HEARING | 2019年 / 23卷
关键词
speech audiometry; speech reception threshold; Oldenburg sentence test; text-to-speech; synthetic speech; COGNITIVE LOAD; SENTENCE TEST; INTELLIGIBILITY; NOISE;
D O I
10.1177/2331216519862982
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Speech audiometry is an essential part of audiological diagnostics and clinical measurements. Development times of speech recognition tests are rather long, depending on the size of speech corpus and optimization necessity. The aim of this study was to examine whether this development effort could be reduced by using synthetic speech in speech audiometry, especially in a matrix test for speech recognition. For this purpose, the speech material of the German matrix test was replicated using a preselected commercial system to generate the synthetic speech files. In contrast to the conventional matrix test, no level adjustments or optimization tests were performed while producing the synthetic speech material. Evaluation measurements were conducted by presenting both versions of the German matrix test (with natural or synthetic speech), alternately and at three different signal-to-noise ratios, to 48 young, normal-hearing participants. Psychometric functions were fitted to the empirical data. Speech recognition thresholds were 0.5 dB signal-to-noise ratio higher (worse) for the synthetic speech, while slopes were equal for both speech types. Nevertheless, speech recognition scores were comparable with the literature and the threshold difference lay within the same range as recordings of two different natural speakers. Although no optimization was applied, the synthetic-speech signals led to equivalent recognition of the different test lists and word categories. The outcomes of this study indicate that the application of synthetic speech in speech recognition tests could considerably reduce the development costs and evaluation time. This offers the opportunity to increase the speech corpus for speech recognition tests with acceptable effort.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Speech bandwidth extension method using speech recognition and speech synthesis
    Takashina, Masashi
    Kuroiwa, Shingo
    Tsuge, Satoru
    Ren, Fuji
    2006 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS, 2006, : 1273 - +
  • [22] Measuring the cognitive load of synthetic speech using a dual task paradigm
    Govender, Avashna
    King, Simon
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2843 - 2847
  • [23] Visual Acuity Test for Isolated Words using Speech Recognition
    Khan, Saud
    Ullah, Khalil
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN ELECTRICAL ENGINEERING AND COMPUTATIONAL TECHNOLOGIES (ICIEECT), 2017,
  • [24] The whisper test and speech recognition tests
    Dick, Finlay
    OCCUPATIONAL MEDICINE-OXFORD, 2018, 68 (07): : 488 - 489
  • [25] The NTID speech recognition test: NSRT®
    Bochner, Joseph H.
    Garrison, Wayne M.
    Doherty, Karen A.
    INTERNATIONAL JOURNAL OF AUDIOLOGY, 2015, 54 (07) : 490 - 498
  • [26] Evaluation of Italian Simplified Matrix Test for Speech-Recognition Measurements in Noise
    Puglisi, Giuseppina Emma
    di Berardino, Federica
    Montuschi, Carla
    Sellami, Fatma
    Albera, Andrea
    Zanetti, Diego
    Albera, Roberto
    Astolfi, Arianna
    Kollmeier, Birger
    Warzybok, Anna
    AUDIOLOGY RESEARCH, 2021, 11 (01) : 73 - 88
  • [27] Robust distributed speech recognition using speech enhancement
    Flynn, Ronan
    Jones, Edward
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (03) : 1267 - 1273
  • [28] Automated cleft speech evaluation using speech recognition
    Vucovich, Megan
    Hallac, Rami R.
    Kane, Alex A.
    Cook, Julie
    Van'T Slot, Cortney
    Seaward, James R.
    JOURNAL OF CRANIO-MAXILLOFACIAL SURGERY, 2017, 45 (08) : 1268 - 1271
  • [29] Measuring the intelligibility of dysarthric speech through automatic speech recognition in a pluricentric language
    Xue, Wei
    Cucchiarini, Catia
    van Hout, Roeland
    Strik, Helmer
    SPEECH COMMUNICATION, 2023, 148 : 23 - 30
  • [30] Estimation of Speech Intelligibility Using Speech Recognition Systems
    Takano, Yusuke
    Kondo, Kazuhiro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (12): : 3368 - 3376