Binaural intelligibility prediction based on the speech transmission index

被引：60

作者：

van Wijngaarden, Sander J. ^{[1
]}

Drullman, Rob ^{[1
]}

机构：

[1] TNO Human Factors, NL-3769 ZG Soesterberg, Netherlands

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2008年 / 123卷 / 06期

关键词：

D O I：

10.1121/1.2905245

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Although the speech transmission index (STI) is a well-accepted and standardized method for objective prediction of speech intelligibility in a wide range of environments and applications, it is essentially a monaural model. Advantages of binaural hearing in speech intelligibility are disregarded. In specific conditions, this leads to considerable mismatches between subjective intelligibility and the STI. A binaural version of the STI was developed based on interaural cross correlograms, which shows a considerably improved correspondence with subjective intelligibility in dichotic listening conditions. The new binaural STI is designed to be a relatively simple model, which adds only few parameters to the original standardized STI and changes none of the existing model parameters. For monaural conditions, the outcome is identical to the standardized STI. The new model was validated on a set of 39 dichotic listening conditions, featuring anechoic, classroom, listening room, and strongly echoic environments. For these 39 conditions, speech intelligibility [consonant-vowel-consonant (CVC) word score] and binaural STI were measured. On the basis of these conditions, the relation between binaural STI and CVC word scores closely matches the STI reference curve (standardized relation between STI and CVC word score) for monaural listening. A better-ear STI appears to perform quite well in relation to the binaural STI model; the monaural STI performs poorly in these cases. (C) 2008 Acoustical Society of America.

引用

页码：4514 / 4523

页数：10

共 50 条

[1] An improved speech transmission index for intelligibility prediction
Schwerin, Belinda
Paliwal, Kuldip
[J]. SPEECH COMMUNICATION, 2014, 65 : 9 - 19
[2] Prediction of binaural speech intelligibility against noise in rooms
Lavandier, Mathieu
Culling, John F.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (01): : 387 - 399
[3] NON-INTRUSIVE BINAURAL PREDICTION OF SPEECH INTELLIGIBILITY BASED ON PHONEME CLASSIFICATION
Rossbach, Jana
Roettges, Saskia
Hauth, Christopher F.
Brand, Thomas
Meyer, Bernd T.
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 396 - 400
[4] Prediction of the influence of reverberation on binaural speech intelligibility in noise and in quiet
Rennies, Jan
Brand, Thomas
Kollmeier, Birger
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 130 (05): : 2999 - 3012
[5] Experimental comparison between speech transmission index, rapid speech transmission index, and speech intelligibility index
Larm, P
Hongisto, V
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (02): : 1106 - 1117
[6] Segmentation of binaural room impulse responses for speech intelligibility prediction
Kokabi, Omid
Brinkmann, Fabian
Weinzierl, Stefan
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 144 (05): : 2793 - 2800
[7] Segmentation of binaural room impulse responses for speech intelligibility prediction
[J]. 1600, Acoustical Society of America (144):
[8] Experimental comparison between speech transmission index, rapid speech transmission index, and speech intelligibility index
Larm, Petra
Hongisto, Valtteri
[J]. Journal of the Acoustical Society of America, 2006, 119 (02): : 1106 - 1117
[9] Speech intelligibility prediction in reverberation: Towards an integrated model of speech transmission, spatial unmasking, and binaural de-reverberation
Leclere, Thibaud
Lavandier, Mathieu
Culling, John F.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 137 (06): : 3335 - 3345
[10] Binaural Speech Intelligibility Prediction in the Presence of Multiple Babble Interferers Based on Mutual Information
Geravanchizadeh, Masoud
Avanaki, Hadi Jamshidi
Dadvar, Paria
[J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2017, 65 (04): : 285 - 292

← 1 2 3 4 5 →