Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing

被引:156
|
作者
Jorgensen, Soren [1 ]
Dau, Torsten [1 ]
机构
[1] Tech Univ Denmark, Dept Elect Engn, Ctr Appl Hearing Res, DK-2800 Lyngby, Denmark
来源
关键词
MASKING-LEVEL DIFFERENCES; AMPLITUDE-MODULATION; RECEPTION THRESHOLD; TRANSMISSION INDEX; TEMPORAL ENVELOPE; ROOM ACOUSTICS; COMPRESSION; SPECTRUM; RECOGNITION; INTENSITY;
D O I
10.1121/1.3621502
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A model for predicting the intelligibility of processed noisy speech is proposed. The speech-based envelope power spectrum model has a similar structure as the model of Ewert and Dau [(2000). J. Acoust. Soc. Am. 108, 1181-1196], developed to account for modulation detection and masking data. The model estimates the speech-to-noise envelope power ratio, SNRenv, at the output of a modulation filterbank and relates this metric to speech intelligibility using the concept of an ideal observer. Predictions were compared to data on the intelligibility of speech presented in stationary speech-shaped noise. The model was further tested in conditions with noisy speech subjected to reverberation and spectral subtraction. Good agreement between predictions and data was found in all cases. For spectral subtraction, an analysis of the model's internal representation of the stimuli revealed that the predicted decrease of intelligibility was caused by the estimated noise envelope power exceeding that of the speech. The classical concept of the speech transmission index fails in this condition. The results strongly suggest that the signal-to-noise ratio at the output of a modulation frequency selective process provides a key measure of speech intelligibility. [DOI: 10.1121/1.3621502]
引用
收藏
页码:1475 / 1487
页数:13
相关论文
共 50 条
  • [41] Convergence analysis of a joint signal-to-noise ratio and channel estimator for frequency selective channels in orthogonal frequency division multiplexing context
    Savaux, Vincent
    Djoko-Kouam, Moise
    Louet, Yves
    Skrzypczak, Alexandre
    IET SIGNAL PROCESSING, 2014, 8 (06) : 693 - 701
  • [42] A Tuning Method Based on Signal-to-Noise Power Ratio for Adaptive PLL and its Relationship with Equivalent Noise Bandwidth
    Won, Jong-Hoon
    Eissfeller, Bernd
    IEEE COMMUNICATIONS LETTERS, 2013, 17 (02) : 393 - 396
  • [43] Joint Modulation Format Identification and Optical Signal-to-Noise Ratio Monitoring Based on Ternary Neural Networks
    Zhou, Peng
    Li, Chuanqi
    Chen, Dong
    Zhang, Yu
    Lu, Ye
    IEEE ACCESS, 2022, 10 : 133324 - 133332
  • [44] Application of a Fourier transform based filtering technique to improve signal-to-noise ratio in modulation spectroscopy experiments
    Ghosh, S
    Arora, BM
    REVIEW OF SCIENTIFIC INSTRUMENTS, 1997, 68 (08): : 3260 - 3261
  • [45] Multi-Subband Radar Signal Fusion Processing Based on Deep Neural Network in Low Signal-to-Noise Ratio
    Jiang, Yilin
    Tang, Sanqiang
    Lu, Manjun
    Zhang, Liting
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [46] On-line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement
    Dat, Tran Huy
    Takeda, Kazuya
    Itakura, Fumitada
    SPEECH COMMUNICATION, 2006, 48 (11) : 1515 - 1527
  • [47] An automatic detection algorithm for multi-target modulation spectrum shaft frequency under low signal-to-noise ratio
    Ma K.
    Chen Z.
    Wang Y.
    Cheng Y.
    Zhendong yu Chongji/Journal of Vibration and Shock, 2022, 41 (24): : 19 - 26
  • [48] High signal-to-noise ratio optical frequency comb generation based on a Brillouin amplified recirculating frequency shifter loop
    Zhang, Xiang
    Xu, Yin
    Wang, Yihan
    Bao, Hualong
    OPTICS AND LASER TECHNOLOGY, 2025, 181
  • [49] TRUTH-TO-ESTIMATE RATIO MASK: A POST-PROCESSING METHOD FOR SPEECH ENHANCEMENT DIRECT AT LOW SIGNAL-TO-NOISE RATIOS
    Chen, Bohan
    Wang, He
    Wei, Yue
    So, Richard H. Y.
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7509 - 7513
  • [50] Note: A signal-to-noise ratio enhancement based on wafer light irradiation system for optical modulation spectroscopy measurement
    Chouaib, H.
    Kelly, P. V.
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2012, 83 (02):