Speech intelligibility prediction using a Neurogram Similarity Index Measure

被引:60
|
作者
Hines, Andrew [1 ]
Harte, Naomi [1 ]
机构
[1] Trinity Coll Dublin, Sigmedia Grp, Dept Elect & Elect Engn, Dublin, Ireland
关键词
Auditory periphery model; Simulated performance intensity function; NSIM; SSIM; Speech Intelligibility; QUALITY ASSESSMENT; PHENOMENOLOGICAL MODEL; TEMPORAL INFORMATION; NORMAL-HEARING; RECOGNITION; RESPONSES; LOUDNESS; PHONEME;
D O I
10.1016/j.specom.2011.09.004
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Discharge patterns produced by fibres from normal and impaired auditory nerves in response to speech and other complex sounds can be discriminated subjectively through visual inspection. Similarly, responses from auditory nerves where speech is presented at diminishing sound levels progressively deteriorate from those at normal listening levels. This paper presents a Neurogram Similarity Index Measure (NSIM) that automates this inspection process, and translates the response pattern differences into a bounded discrimination metric. Performance intensity functions can be used to provide additional information over measurement of speech reception threshold and maximum phoneme recognition by plotting a test subject's recognition probability over a range of sound intensities. A computational model of the auditory periphery was used to replace the human subject and develop a methodology that simulates a real listener test. The newly developed NSIM is used to evaluate the model outputs in response to Consonant-Vowel-Consonant (CVC) word lists and produce phoneme discrimination scores. The simulated results are rigorously compared to those from normal hearing subjects in both quiet and noise conditions. The accuracy of the tests and the minimum number of word lists necessary for repeatable results is established and the results are compared to predictions using the speech intelligibility index (SII). The experiments demonstrate that the proposed simulated performance intensity function (SPIF) produces results with confidence intervals within the human error bounds expected with real listener tests. This work represents an important step in validating the use of auditory nerve models to predict speech intelligibility. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:306 / 320
页数:15
相关论文
共 50 条
  • [1] Prediction of Speech Intelligibility Using a Neurogram Orthogonal Polynomial Measure (NOPM)
    Mamun, Nursadul
    Jassim, Wissam A.
    Zilany, Muhammad S. A.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (04) : 760 - 773
  • [2] Speech intelligibility of simulated hearing loss sounds and its prediction using the Gammachirp Envelope Similarity Index (GESI)
    Irino, Toshio
    Tamaru, Honoka
    Yamamoto, Ayako
    INTERSPEECH 2022, 2022, : 3929 - 3933
  • [3] Reference-Free Assessment of Speech Intelligibility Using Bispectrum of an Auditory Neurogram
    Hossain, Mohammad E.
    Jassim, Wissam A.
    Zilany, Muhammad S. A.
    PLOS ONE, 2016, 11 (03):
  • [4] An improved speech transmission index for intelligibility prediction
    Schwerin, Belinda
    Paliwal, Kuldip
    SPEECH COMMUNICATION, 2014, 65 : 9 - 19
  • [5] Binaural intelligibility prediction based on the speech transmission index
    van Wijngaarden, Sander J.
    Drullman, Rob
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (06): : 4514 - 4523
  • [6] Chinese speech intelligibility and speech intelligibility index for the elderly
    Zeng, Jiazhong
    Peng, Jianxin
    Xiang, Shuyin
    SPEECH COMMUNICATION, 2024, 160
  • [7] Extended speech intelligibility index for the prediction of the speech reception threshold in fluctuating noise
    Rhebergen, Koenraad S.
    Versfeld, Niek J.
    Dreschler, Wouter A.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (06): : 3988 - 3997
  • [8] Extended speech intelligibility index for the prediction of the speech reception threshold in fluctuating noise
    Rhebergen, Koenraad S.
    Versfeld, Niek J.
    Dreschler, Wouter. A.
    Journal of the Acoustical Society of America, 2006, 120 (06): : 3988 - 3997
  • [9] On the effect of speech level on intelligibility: The correspondence of speech intelligibility index (SII) to intelligibility
    Dept. of Architecture, Faculty of Science and Engineering, Kinki Univ., Japan
    J. Environ. Eng., 2009, 642 (931-936):
  • [10] Coherence and the speech intelligibility index
    Kates, JM
    Arehart, KH
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 117 (04): : 2224 - 2237