Accurate Labeling of Time-Frequency Units in Monaural Voiced Speech Segregation

被引:0
|
作者
Shamlou, Sanam Imani [1 ]
Geravanchizadeh, Masoud [1 ]
机构
[1] Univ Tabriz, Fac Elect & Comp Engn, Tabriz 5166615813, Iran
关键词
Computational Auditory Scene Analysis (CASA); Voiced Speech Segregation; Enhanced Envelope Autocorrelation Function (EEACF); Envelope Cross-Channel Correlation;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Segregation of speech in presence of noise and other interferences is a challenging problem, especially in monaural case. Since accurate labeling of time-frequency (T-F) units has great impact on segregation results, in this paper we propose a new method for labeling T-F units in monaural voiced speech segregation which marks these units using envelope cross-channel correlation in the segmentation stage and label them with different features in low and high frequencies in the grouping stage. We use enhanced envelope autocorrelation function (EEACF) for labeling T-F units in the high frequency range. The evaluation of the system including subjective and objective criteria is done for different types of noise mixed with speech signals which shows better segregation results than conventional methods.
引用
收藏
页码:902 / 906
页数:5
相关论文
共 50 条
  • [1] On time-frequency masking in voiced speech
    Skoglund, J
    Kleijn, WB
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (04): : 361 - 369
  • [2] TIME-FREQUENCY ATTENTION FOR MONAURAL SPEECH ENHANCEMENT
    Zhang, Qiquan
    Song, Qi
    Ni, Zhaoheng
    Nicolson, Aaron
    Li, Haizhou
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7852 - 7856
  • [3] Monaural Voiced Speech Segregation Based on Dynamic Harmonic Function
    Zhang, Xueliang
    Liu, Wenju
    Xu, Bo
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2010,
  • [4] Monaural Voiced Speech Segregation Based on Dynamic Harmonic Function
    Xueliang Zhang
    Wenju Liu
    Bo Xu
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2010
  • [5] Monaural Segregation of Voiced Speech using Discriminative Random Fields
    Prabhavalkar, Rohit
    Jin, Zhaozhang
    Fosler-Lussier, Eric
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 864 - 867
  • [6] Monaural Voiced Speech Segregation Based on Pitch and Comb Filter
    Zhang, Xueliang
    Liu, Wenju
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1752 - +
  • [7] Monaural voiced speech segregation based on elaborate harmonic grouping strategies
    Liu WenJu
    Zhang XueLiang
    Jiang Wei
    Li Peng
    Xu Bo
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2011, 54 (12) : 2471 - 2480
  • [8] Monaural voiced speech segregation based on elaborate harmonic grouping strategies
    WenJu Liu
    XueLiang Zhang
    Wei Jiang
    Peng Li
    Bo Xu
    [J]. Science China Information Sciences, 2011, 54 : 2471 - 2480
  • [9] Monaural voiced speech segregation based on elaborate harmonic grouping strategies
    LIU WenJu 1
    2 Digital Media Content Technology Research Center
    [J]. Science China(Information Sciences), 2011, 54 (12) : 2491 - 2500
  • [10] Segmentation on time-frequency domain for speech segregation
    Lim, Sung-Kil
    Lee, Hyon-Soo
    [J]. 2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2, 2006, : 433 - +