Binaural intelligibility prediction based on the speech transmission index

被引:60
|
作者
van Wijngaarden, Sander J. [1 ]
Drullman, Rob [1 ]
机构
[1] TNO Human Factors, NL-3769 ZG Soesterberg, Netherlands
来源
关键词
D O I
10.1121/1.2905245
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Although the speech transmission index (STI) is a well-accepted and standardized method for objective prediction of speech intelligibility in a wide range of environments and applications, it is essentially a monaural model. Advantages of binaural hearing in speech intelligibility are disregarded. In specific conditions, this leads to considerable mismatches between subjective intelligibility and the STI. A binaural version of the STI was developed based on interaural cross correlograms, which shows a considerably improved correspondence with subjective intelligibility in dichotic listening conditions. The new binaural STI is designed to be a relatively simple model, which adds only few parameters to the original standardized STI and changes none of the existing model parameters. For monaural conditions, the outcome is identical to the standardized STI. The new model was validated on a set of 39 dichotic listening conditions, featuring anechoic, classroom, listening room, and strongly echoic environments. For these 39 conditions, speech intelligibility [consonant-vowel-consonant (CVC) word score] and binaural STI were measured. On the basis of these conditions, the relation between binaural STI and CVC word scores closely matches the STI reference curve (standardized relation between STI and CVC word score) for monaural listening. A better-ear STI appears to perform quite well in relation to the binaural STI model; the monaural STI performs poorly in these cases. (C) 2008 Acoustical Society of America.
引用
收藏
页码:4514 / 4523
页数:10
相关论文
共 50 条
  • [1] An improved speech transmission index for intelligibility prediction
    Schwerin, Belinda
    Paliwal, Kuldip
    [J]. SPEECH COMMUNICATION, 2014, 65 : 9 - 19
  • [2] Prediction of binaural speech intelligibility against noise in rooms
    Lavandier, Mathieu
    Culling, John F.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (01): : 387 - 399
  • [3] NON-INTRUSIVE BINAURAL PREDICTION OF SPEECH INTELLIGIBILITY BASED ON PHONEME CLASSIFICATION
    Rossbach, Jana
    Roettges, Saskia
    Hauth, Christopher F.
    Brand, Thomas
    Meyer, Bernd T.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 396 - 400
  • [4] Prediction of the influence of reverberation on binaural speech intelligibility in noise and in quiet
    Rennies, Jan
    Brand, Thomas
    Kollmeier, Birger
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 130 (05): : 2999 - 3012
  • [5] Experimental comparison between speech transmission index, rapid speech transmission index, and speech intelligibility index
    Larm, P
    Hongisto, V
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (02): : 1106 - 1117
  • [6] Segmentation of binaural room impulse responses for speech intelligibility prediction
    Kokabi, Omid
    Brinkmann, Fabian
    Weinzierl, Stefan
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 144 (05): : 2793 - 2800
  • [8] Experimental comparison between speech transmission index, rapid speech transmission index, and speech intelligibility index
    Larm, Petra
    Hongisto, Valtteri
    [J]. Journal of the Acoustical Society of America, 2006, 119 (02): : 1106 - 1117
  • [9] Speech intelligibility prediction in reverberation: Towards an integrated model of speech transmission, spatial unmasking, and binaural de-reverberation
    Leclere, Thibaud
    Lavandier, Mathieu
    Culling, John F.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 137 (06): : 3335 - 3345
  • [10] Binaural Speech Intelligibility Prediction in the Presence of Multiple Babble Interferers Based on Mutual Information
    Geravanchizadeh, Masoud
    Avanaki, Hadi Jamshidi
    Dadvar, Paria
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2017, 65 (04): : 285 - 292