The use of automatic speech recognition showing the influence of nasality on speech intelligibility

被引:4
|
作者
Mayr, S. [1 ]
Burkhardt, K. [1 ]
Schuster, M. [2 ]
Rogler, K. [1 ]
Maier, A. [3 ]
Iro, H. [1 ]
机构
[1] Univ Erlangen Nurnberg, Univ Hosp, Dept Otolaryngol Head & Neck Surg, D-91054 Erlangen, Germany
[2] Univ Erlangen Nurnberg, Univ Hosp, Div Phoniatr & Pedaudiol, D-91054 Erlangen, Germany
[3] Univ Erlangen Nurnberg, Sch Engn, Pattern Recognit Lab, D-91058 Erlangen, Germany
关键词
Sinus surgery; Nasality; Speech intelligibility; Automatic speech recognition; Word recognition rate; Nasal peak inspiratory flow; ENDOSCOPIC SINUS SURGERY; VELOPHARYNGEAL DYSFUNCTION; CLEFT-PALATE; VOICE; TRANSNASAL; QUANTIFICATION; RHINOSINUSITIS; ETHMOIDECTOMY; DISORDERS;
D O I
10.1007/s00405-010-1256-5
中图分类号
R76 [耳鼻咽喉科学];
学科分类号
100213 ;
摘要
Altered nasality influences speech intelligibility. Automatic speech recognition (ASR) has proved suitable for quantifying speech intelligibility in patients with different degrees of nasal emissions. We investigated the influence of hyponasality on the results of speech recognition before and after nasal surgery using ASR. Speech recordings, nasal peak inspiratory flow and self-perception measurements were carried out in 20 German-speaking patients (8 women, 12 men; aged 38 +/- A 22 years) who underwent surgery for various nasal and sinus pathologies. The degree of speech intelligibility was quantified as the percentage of correctly recognized words of a standardized word chain by ASR (word recognition rate; WR). WR was measured 1 day before (t1), 1 day after with nasal packings (t2), and 3 months after (t3) surgery; nasal peak flow on t1 and t3. WR was calculated with program for the automatic evaluation of all kinds of speech disorders (PEAKS). WR as a parameter of speech intelligibility was significantly decreased immediately after surgery (t1 vs. t2 p < 0.01) but increased 3 months after surgery (t2 vs. t3 p < 0.01). WR showed no association with age or gender. There was no significant difference between WR at t1 and t3, despite a post-operative increase in nasal peak inspiratory flow measurements. The results show that ASR is capable of quantifying the influence of hyponasality on speech; nasal obstruction leads to significantly reduced WR and nasal peak flow cannot replace evaluation of nasality.
引用
收藏
页码:1719 / 1725
页数:7
相关论文
共 50 条
  • [11] APPLICATION OF SPEECH RECOGNITION TO AUTOMATIC INTELLIGIBILITY TESTING PROCEDURES
    TEACHER, CF
    RICHARDS, JR
    HEWITT, H
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1970, 48 (01): : 131 - &
  • [12] Evaluation of speech intelligibility for children with cleft lip and palate by means of automatic speech recognition
    Schuster, Maria
    Maier, Andreas
    Haderlein, Tino
    Nkenke, Emeka
    Wohlleben, Ulrike
    Rosanowski, Frank
    Eysholdt, Ulrich
    Noeth, Elmar
    [J]. INTERNATIONAL JOURNAL OF PEDIATRIC OTORHINOLARYNGOLOGY, 2006, 70 (10) : 1741 - 1747
  • [13] Using Automatic Speech Recognition to Measure the Intelligibility of Speech Synthesized from Brain Signals
    Varshney, Suvi
    Farias, Dana
    Brandman, David M.
    Stavisky, Sergey D.
    Miller, Lee M.
    [J]. 2023 11TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING, NER, 2023,
  • [14] LEVERAGING AUTOMATIC SPEECH RECOGNITION IN COCHLEAR IMPLANTS FOR IMPROVED SPEECH INTELLIGIBILITY UNDER REVERBERATION
    Hazrati, Oldooz
    Ghaffarzadegan, Shabnam
    Hansen, John H. L.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5093 - 5097
  • [15] Automatic enhancement of speech intelligibility
    Colotte, V
    Laprie, Y
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1057 - 1060
  • [16] Intelligibility Rating with Automatic Speech Recognition, Prosodic, and Cepstral Evaluation
    Haderlein, Tino
    Moers, Cornelia
    Moebius, Bernd
    Rosanowski, Frank
    Noeth, Elmar
    [J]. TEXT, SPEECH AND DIALOGUE, TSD 2011, 2011, 6836 : 195 - 202
  • [17] Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction
    Tu, Zehai
    Ma, Ning
    Barker, Jon
    [J]. INTERSPEECH 2022, 2022, : 3493 - 3497
  • [18] The use of lexica in automatic speech recognition
    Adda-Decker, M
    Lamel, L
    [J]. LEXICON DEVELOPMENT FOR SPEECH AND LANGUAGE PROCESSING, 2000, 12 : 235 - 266
  • [19] Estimation of Speech Intelligibility Using Speech Recognition Systems
    Takano, Yusuke
    Kondo, Kazuhiro
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (12): : 3368 - 3376
  • [20] Matrix sentence intelligibility prediction using an automatic speech recognition system
    Schaedler, Marc Rene
    Warzybok, Anna
    Hochmuth, Sabine
    Kollmeier, Birger
    [J]. INTERNATIONAL JOURNAL OF AUDIOLOGY, 2015, 54 : 100 - 107