Accurate Labeling of Time-Frequency Units in Monaural Voiced Speech Segregation

被引：0

作者：

Shamlou, Sanam Imani ^{[1
]}

Geravanchizadeh, Masoud ^{[1
]}

机构：

[1] Univ Tabriz, Fac Elect & Comp Engn, Tabriz 5166615813, Iran

来源：

2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST) | 2012年

关键词：

Computational Auditory Scene Analysis (CASA); Voiced Speech Segregation; Enhanced Envelope Autocorrelation Function (EEACF); Envelope Cross-Channel Correlation;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Segregation of speech in presence of noise and other interferences is a challenging problem, especially in monaural case. Since accurate labeling of time-frequency (T-F) units has great impact on segregation results, in this paper we propose a new method for labeling T-F units in monaural voiced speech segregation which marks these units using envelope cross-channel correlation in the segmentation stage and label them with different features in low and high frequencies in the grouping stage. We use enhanced envelope autocorrelation function (EEACF) for labeling T-F units in the high frequency range. The evaluation of the system including subjective and objective criteria is done for different types of noise mixed with speech signals which shows better segregation results than conventional methods.

引用

页码：902 / 906

页数：5

共 50 条

[1] On time-frequency masking in voiced speech
Skoglund, J
Kleijn, WB
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (04): : 361 - 369
[2] TIME-FREQUENCY ATTENTION FOR MONAURAL SPEECH ENHANCEMENT
Zhang, Qiquan
Song, Qi
Ni, Zhaoheng
Nicolson, Aaron
Li, Haizhou
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7852 - 7856
[3] Monaural Voiced Speech Segregation Based on Dynamic Harmonic Function
Zhang, Xueliang
Liu, Wenju
Xu, Bo
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2010,
[4] Monaural Voiced Speech Segregation Based on Dynamic Harmonic Function
Xueliang Zhang
Wenju Liu
Bo Xu
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2010
[5] Monaural Segregation of Voiced Speech using Discriminative Random Fields
Prabhavalkar, Rohit
Jin, Zhaozhang
Fosler-Lussier, Eric
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 864 - 867
[6] Monaural Voiced Speech Segregation Based on Pitch and Comb Filter
Zhang, Xueliang
Liu, Wenju
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1752 - +
[7] Monaural voiced speech segregation based on elaborate harmonic grouping strategies
Liu WenJu
Zhang XueLiang
Jiang Wei
Li Peng
Xu Bo
[J]. SCIENCE CHINA-INFORMATION SCIENCES, 2011, 54 (12) : 2471 - 2480
[8] Monaural voiced speech segregation based on elaborate harmonic grouping strategies
WenJu Liu
XueLiang Zhang
Wei Jiang
Peng Li
Bo Xu
[J]. Science China Information Sciences, 2011, 54 : 2471 - 2480
[9] Monaural voiced speech segregation based on elaborate harmonic grouping strategies
LIU WenJu 1
2 Digital Media Content Technology Research Center
[J]. Science China(Information Sciences), 2011, 54 (12) : 2491 - 2500
[10] Segmentation on time-frequency domain for speech segregation
Lim, Sung-Kil
Lee, Hyon-Soo
[J]. 2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2, 2006, : 433 - +

← 1 2 3 4 5 →