Measuring time-frequency importance functions of speech with bubble noise

被引:12
|
作者
Mandel, Michael I. [1 ]
Yoho, Sarah E. [2 ]
Healy, Eric W. [2 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
[2] Ohio State Univ, Dept Speech & Hearing Sci, Columbus, OH 43210 USA
来源
关键词
PERCEPTUAL COMPENSATION; BAND-IMPORTANCE; RECOGNITION; INTELLIGIBILITY; INFORMATION; SENTENCES; AGREEMENT; CHANNEL; WORDS; MODEL;
D O I
10.1121/1.4964102
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Listeners can reliably perceive speech in noisy conditions, but it is not well understood what specific features of speech they use to do this. This paper introduces a data-driven framework to identify the time-frequency locations of these features. Using the same speech utterance mixed with many different noise instances, the framework is able to compute the importance of each time-frequency point in the utterance to its intelligibility. The mixtures have approximately the same global signal-to-noise ratio at each frequency, but very different recognition rates. The difference between these intelligible vs unintelligible mixtures is the alignment between the speech and spectro-temporally modulated noise, providing different combinations of "glimpses" of speech in each mixture. The current results reveal the locations of these important noise-robust phonetic features in a restricted set of syllables. Classification models trained to predict whether individual mixtures are intelligible based on the location of these glimpses can generalize to new conditions, successfully predicting the intelligibility of novel mixtures. They are able to generalize to novel noise instances, novel productions of the same word by the same talker, novel utterances of the same word spoken by different talkers, and, to some extent, novel consonants. (C) 2016 Acoustical Society of America.
引用
收藏
页码:2542 / 2553
页数:12
相关论文
共 50 条
  • [1] Evaluation of the importance of time-frequency contributions to speech intelligibility in noise
    Yu, Chengzhu
    Wojcicki, Kamil K.
    Loizou, Philipos C.
    Hansen, John H. L.
    Johnson, Michael T.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (05): : 3007 - 3016
  • [2] Noise estimation based on time-frequency correlation for speech enhancement
    Yuan, Wenhao
    Lin, Jiajun
    An, Wei
    Wang, Yu
    Chen, Ning
    [J]. APPLIED ACOUSTICS, 2013, 74 (05) : 770 - 781
  • [3] Speech intelligibility in background noise with ideal binary time-frequency masking
    Wang, DeLiang
    Kjems, Ulrik
    Pedersen, Michael S.
    Boldt, Jesper B.
    Lunner, Thomas
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (04): : 2336 - 2347
  • [4] Time-Frequency Domain Impulsive Noise Detection System in Speech Signal
    Choi, Min-Seok
    Shin, Ho Seon
    Hwang, Young-Soo
    Kang, Hong-Goo
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2011, 30 (02): : 73 - 79
  • [5] Review of Time-Frequency Masking Approach for Improving Speech Intelligibility in Noise
    Kim, Gibak
    [J]. IETE TECHNICAL REVIEW, 2022, 39 (03) : 623 - 634
  • [6] Perceptual effects of noise reduction by time-frequency masking of noisy speech
    Brons, Inge
    Houben, Rolph
    Dreschler, Wouter A.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (04): : 2690 - 2699
  • [7] Perceptual learning for speech in noise after application of binary time-frequency masks
    Ahmadi, Mahnaz
    Gross, Vauna L.
    Sinex, Donal G.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (03): : 1687 - 1692
  • [8] Unsupervised learning of time-frequency patches as a noise-robust representation of speech
    Van Segbroeck, Maarten
    Van Hamme, Hugo
    [J]. SPEECH COMMUNICATION, 2009, 51 (11) : 1124 - 1138
  • [9] The Application of Time-Frequency Masking To Improve Intelligibility of Dysarthric Speech in Background Noise
    Borrie, Stephanie A.
    Yoho, Sarah E.
    Healy, Eric W.
    Barrett, Tyson S.
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2023, 66 (05): : 1853 - 1866
  • [10] Recognition of speech in noise after application of time-frequency masks: Dependence on frequency and threshold parameters
    Sinex, Donal G.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (04): : 2390 - 2396