An improved speech transmission index for intelligibility prediction

被引:20
|
作者
Schwerin, Belinda [1 ]
Paliwal, Kuldip [1 ]
机构
[1] Griffith Univ, Griffith Sch Engn, Signol Proc Lab, Nathan, Qld 4111, Australia
关键词
Speech transmission index; Modulation transfer function; Speech enhancement; Objective evaluation; Speech intelligibility; Short-time modulation spectrum; COHERENCE;
D O I
10.1016/j.specom.2014.05.003
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The speech transmission index (STI) is a well known measure of intelligibility, most suited to the evaluation of speech intelligibility in rooms, with stimuli subjected to additive noise and reverberance. However, STI and its many variations do not effectively represent the intelligibility of stimuli containing non-linear distortions such as those resulting from processing by enhancement algorithms. In this paper, we revisit the STI approach and propose a variation which processes the modulation envelope in short-time segments, requiring only an assumption of quasi-stationarity (rather than the stationarity assumption of STI) of the modulation signal. Results presented in this work show that the proposed approach improves the measures correlation to subjective intelligibility scores compared to traditional STI for a range of noise types and subjected to different enhancement approaches. The approach is also shown to have higher correlation than other coherence, correlation and distance measures tested, but is unsuited to the evaluation of stimuli heavily distorted with (for example) masking based processing, where an alternative approach such as STOI is recommended. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:9 / 19
页数:11
相关论文
共 50 条
  • [1] Binaural intelligibility prediction based on the speech transmission index
    van Wijngaarden, Sander J.
    Drullman, Rob
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (06): : 4514 - 4523
  • [2] Experimental comparison between speech transmission index, rapid speech transmission index, and speech intelligibility index
    Larm, P
    Hongisto, V
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (02): : 1106 - 1117
  • [3] Experimental comparison between speech transmission index, rapid speech transmission index, and speech intelligibility index
    Larm, Petra
    Hongisto, Valtteri
    Journal of the Acoustical Society of America, 2006, 119 (02): : 1106 - 1117
  • [4] Primary discussion on speech intelligibility of Chinese and the speech transmission index
    SHEN Hao(Institute of Acoustics
    Chinese Journal of Acoustics, 1990, (01) : 74 - 81
  • [5] The speech intelligibility and applicability of the speech transmission index in large spaces
    Liu, Hongshan
    Ma, Hui
    Kang, Jian
    Wang, Chao
    APPLIED ACOUSTICS, 2020, 167 (167)
  • [6] The disagreement between speech transmission index (STI) and speech intelligibility
    Onaga, H.
    Furue, Y.
    Ikeda, T.
    Acoustical Science and Technology, 2001, 22 (04) : 265 - 271
  • [7] Relationship Between Chinese Speech Intelligibility of Elderly and Speech Transmission Index
    Peng, Jianxin
    Zeng, Jiazhong
    Zhao, Yuezhe
    ARCHIVES OF ACOUSTICS, 2021, 46 (03) : 229 - 235
  • [8] Application of the speech transmission index to speech intelligibility evaluation in broadcast studies
    Lane, M.Yu.
    Radiotekhnika i Elektronika, 1994, 40 (04): : 694 - 696
  • [9] Relationship Between Chinese Speech Intelligibility of Elderly and Speech Transmission Index
    Peng, Jianxin
    Zeng, Jiazhong
    Zhao, Yuezhe
    ARCHIVES OF ACOUSTICS, 2021, 46 (02) : 229 - 235
  • [10] Chinese speech intelligibility and speech intelligibility index for the elderly
    Zeng, Jiazhong
    Peng, Jianxin
    Xiang, Shuyin
    SPEECH COMMUNICATION, 2024, 160