On time-frequency masking in voiced speech

被引:16
|
作者
Skoglund, J [1 ]
Kleijn, WB [1 ]
机构
[1] Chalmers Univ Technol, Dept Signals & Syst, Informat Theory Grp, SE-41296 Gothenburg, Sweden
来源
关键词
auditory masking; phase spectrum; speech coding; temporal weighting;
D O I
10.1109/89.848218
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper addresses the issue of masking of noise in voiced speech, First, we examine the audibility of cyclostationary narrow-band noise bursts added to voiced speech generated by synthetic excitation. Varying the temporal location of noise within a pitch cycle corresponds to varying its phase spectrum. Using this fact, we found that a change of phase of the noise in the high frequency region is more perceptible for a low-pitched sound than for a high-pitched sound, We then propose a pitch-dependent temporal weighting function which can be employed in quantization of pitch cycle waveforms, In a second experiment, we found that the audibility of high-frequency noise added to natural speech can be significantly reduced using this weighting function.
引用
收藏
页码:361 / 369
页数:9
相关论文
共 50 条
  • [1] Robust speech separation using time-frequency masking
    Aarabi, P
    Shi, GJ
    Jahromi, O
    [J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 741 - 744
  • [2] Accurate Labeling of Time-Frequency Units in Monaural Voiced Speech Segregation
    Shamlou, Sanam Imani
    Geravanchizadeh, Masoud
    [J]. 2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 902 - 906
  • [3] An Assessment of the Improvement Potential of Time-Frequency Masking for Speech Dereverberation
    Zheng, Chenxi
    Falk, Tiago H.
    Chan, Wai-Yip
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 212 - +
  • [4] Blind separation of speech mixtures via time-frequency masking
    Yilmaz, Ö
    Rickard, S
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (07) : 1830 - 1847
  • [5] Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising
    Williamson, Donald S.
    Wang, DeLiang
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (07) : 1492 - 1501
  • [6] Time-Frequency Masking For Large Scale Robust Speech Recognition
    Wang, Yuxuan
    Misra, Ananya
    Chine, Kean K.
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2469 - 2473
  • [7] Cepstral representation of speech motivated by time-frequency masking: An application to speech recognition
    Aikawa, K
    Singer, H
    Kawahara, H
    Tohkura, Y
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (01): : 603 - 614
  • [8] Label Driven Time-Frequency Masking for Robust Continuous Speech Recognition
    Soni, Meet
    Panda, Ashish
    [J]. INTERSPEECH 2019, 2019, : 426 - 430
  • [9] Speech intelligibility in background noise with ideal binary time-frequency masking
    Wang, DeLiang
    Kjems, Ulrik
    Pedersen, Michael S.
    Boldt, Jesper B.
    Lunner, Thomas
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (04): : 2336 - 2347
  • [10] On the integration of time-frequency masking speech separation and recognition in underdetermined environments
    Jafari, Ingrid
    Haque, Serajul
    Togneri, Roberto
    Nordholm, Sven
    [J]. 2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1613 - 1617