On time-frequency masking in voiced speech

被引：16

作者：

Skoglund, J ^{[1
]}

Kleijn, WB ^{[1
]}

机构：

[1] Chalmers Univ Technol, Dept Signals & Syst, Informat Theory Grp, SE-41296 Gothenburg, Sweden

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2000年 / 8卷 / 04期

关键词：

auditory masking; phase spectrum; speech coding; temporal weighting;

D O I：

10.1109/89.848218

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper addresses the issue of masking of noise in voiced speech, First, we examine the audibility of cyclostationary narrow-band noise bursts added to voiced speech generated by synthetic excitation. Varying the temporal location of noise within a pitch cycle corresponds to varying its phase spectrum. Using this fact, we found that a change of phase of the noise in the high frequency region is more perceptible for a low-pitched sound than for a high-pitched sound, We then propose a pitch-dependent temporal weighting function which can be employed in quantization of pitch cycle waveforms, In a second experiment, we found that the audibility of high-frequency noise added to natural speech can be significantly reduced using this weighting function.

引用

页码：361 / 369

页数：9

共 50 条

[1] Robust speech separation using time-frequency masking
Aarabi, P
Shi, GJ
Jahromi, O
[J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 741 - 744
[2] Accurate Labeling of Time-Frequency Units in Monaural Voiced Speech Segregation
Shamlou, Sanam Imani
Geravanchizadeh, Masoud
[J]. 2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 902 - 906
[3] An Assessment of the Improvement Potential of Time-Frequency Masking for Speech Dereverberation
Zheng, Chenxi
Falk, Tiago H.
Chan, Wai-Yip
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 212 - +
[4] Blind separation of speech mixtures via time-frequency masking
Yilmaz, Ö
Rickard, S
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (07) : 1830 - 1847
[5] Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising
Williamson, Donald S.
Wang, DeLiang
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (07) : 1492 - 1501
[6] Time-Frequency Masking For Large Scale Robust Speech Recognition
Wang, Yuxuan
Misra, Ananya
Chine, Kean K.
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2469 - 2473
[7] Cepstral representation of speech motivated by time-frequency masking: An application to speech recognition
Aikawa, K
Singer, H
Kawahara, H
Tohkura, Y
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (01): : 603 - 614
[8] Label Driven Time-Frequency Masking for Robust Continuous Speech Recognition
Soni, Meet
Panda, Ashish
[J]. INTERSPEECH 2019, 2019, : 426 - 430
[9] Speech intelligibility in background noise with ideal binary time-frequency masking
Wang, DeLiang
Kjems, Ulrik
Pedersen, Michael S.
Boldt, Jesper B.
Lunner, Thomas
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (04): : 2336 - 2347
[10] On the integration of time-frequency masking speech separation and recognition in underdetermined environments
Jafari, Ingrid
Haque, Serajul
Togneri, Roberto
Nordholm, Sven
[J]. 2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1613 - 1617

← 1 2 3 4 5 →