Measuring time-frequency importance functions of speech with bubble noise

被引：12

作者：

Mandel, Michael I. ^{[1
]}

Yoho, Sarah E. ^{[2
]}

Healy, Eric W. ^{[2
]}

机构：

[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA

[2] Ohio State Univ, Dept Speech & Hearing Sci, Columbus, OH 43210 USA

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2016年 / 140卷 / 04期

关键词：

PERCEPTUAL COMPENSATION; BAND-IMPORTANCE; RECOGNITION; INTELLIGIBILITY; INFORMATION; SENTENCES; AGREEMENT; CHANNEL; WORDS; MODEL;

D O I：

10.1121/1.4964102

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Listeners can reliably perceive speech in noisy conditions, but it is not well understood what specific features of speech they use to do this. This paper introduces a data-driven framework to identify the time-frequency locations of these features. Using the same speech utterance mixed with many different noise instances, the framework is able to compute the importance of each time-frequency point in the utterance to its intelligibility. The mixtures have approximately the same global signal-to-noise ratio at each frequency, but very different recognition rates. The difference between these intelligible vs unintelligible mixtures is the alignment between the speech and spectro-temporally modulated noise, providing different combinations of "glimpses" of speech in each mixture. The current results reveal the locations of these important noise-robust phonetic features in a restricted set of syllables. Classification models trained to predict whether individual mixtures are intelligible based on the location of these glimpses can generalize to new conditions, successfully predicting the intelligibility of novel mixtures. They are able to generalize to novel noise instances, novel productions of the same word by the same talker, novel utterances of the same word spoken by different talkers, and, to some extent, novel consonants. (C) 2016 Acoustical Society of America.

引用

页码：2542 / 2553

页数：12

共 50 条

[1] Evaluation of the importance of time-frequency contributions to speech intelligibility in noise
Yu, Chengzhu
Wojcicki, Kamil K.
Loizou, Philipos C.
Hansen, John H. L.
Johnson, Michael T.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (05): : 3007 - 3016
[2] Noise estimation based on time-frequency correlation for speech enhancement
Yuan, Wenhao
Lin, Jiajun
An, Wei
Wang, Yu
Chen, Ning
[J]. APPLIED ACOUSTICS, 2013, 74 (05) : 770 - 781
[3] Speech intelligibility in background noise with ideal binary time-frequency masking
Wang, DeLiang
Kjems, Ulrik
Pedersen, Michael S.
Boldt, Jesper B.
Lunner, Thomas
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (04): : 2336 - 2347
[4] Time-Frequency Domain Impulsive Noise Detection System in Speech Signal
Choi, Min-Seok
Shin, Ho Seon
Hwang, Young-Soo
Kang, Hong-Goo
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2011, 30 (02): : 73 - 79
[5] Review of Time-Frequency Masking Approach for Improving Speech Intelligibility in Noise
Kim, Gibak
[J]. IETE TECHNICAL REVIEW, 2022, 39 (03) : 623 - 634
[6] Perceptual effects of noise reduction by time-frequency masking of noisy speech
Brons, Inge
Houben, Rolph
Dreschler, Wouter A.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (04): : 2690 - 2699
[7] Perceptual learning for speech in noise after application of binary time-frequency masks
Ahmadi, Mahnaz
Gross, Vauna L.
Sinex, Donal G.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (03): : 1687 - 1692
[8] Unsupervised learning of time-frequency patches as a noise-robust representation of speech
Van Segbroeck, Maarten
Van Hamme, Hugo
[J]. SPEECH COMMUNICATION, 2009, 51 (11) : 1124 - 1138
[9] The Application of Time-Frequency Masking To Improve Intelligibility of Dysarthric Speech in Background Noise
Borrie, Stephanie A.
Yoho, Sarah E.
Healy, Eric W.
Barrett, Tyson S.
[J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2023, 66 (05): : 1853 - 1866
[10] Recognition of speech in noise after application of time-frequency masks: Dependence on frequency and threshold parameters
Sinex, Donal G.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (04): : 2390 - 2396

← 1 2 3 4 5 →