Accurate Labeling of Time-Frequency Units in Monaural Voiced Speech Segregation

被引:0
|
作者
Shamlou, Sanam Imani [1 ]
Geravanchizadeh, Masoud [1 ]
机构
[1] Univ Tabriz, Fac Elect & Comp Engn, Tabriz 5166615813, Iran
关键词
Computational Auditory Scene Analysis (CASA); Voiced Speech Segregation; Enhanced Envelope Autocorrelation Function (EEACF); Envelope Cross-Channel Correlation;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Segregation of speech in presence of noise and other interferences is a challenging problem, especially in monaural case. Since accurate labeling of time-frequency (T-F) units has great impact on segregation results, in this paper we propose a new method for labeling T-F units in monaural voiced speech segregation which marks these units using envelope cross-channel correlation in the segmentation stage and label them with different features in low and high frequencies in the grouping stage. We use enhanced envelope autocorrelation function (EEACF) for labeling T-F units in the high frequency range. The evaluation of the system including subjective and objective criteria is done for different types of noise mixed with speech signals which shows better segregation results than conventional methods.
引用
收藏
页码:902 / 906
页数:5
相关论文
共 50 条
  • [31] Accurate and efficient implementation of the time-frequency matched filter
    O'Toole, J. M.
    Mesbah, M.
    Boashash, B.
    [J]. IET SIGNAL PROCESSING, 2010, 4 (04) : 428 - 437
  • [32] Accurate channel identification with time-frequency interferometry for OFDM
    Ahn, Chang-Jun
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2007, E90A (11): : 2641 - 2645
  • [33] A HYBRID TIME-FREQUENCY DOMAIN ARTICULATORY SPEECH SYNTHESIZER
    SONDHI, MM
    SCHROETER, J
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1987, 35 (07): : 955 - 967
  • [34] A Time-Frequency Attention Module for Neural Speech Enhancement
    Zhang, Qiquan
    Qian, Xinyuan
    Ni, Zhaoheng
    Nicolson, Aaron
    Ambikairajah, Eliathamby
    Li, Haizhou
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 462 - 475
  • [35] Proposition of adaptative time-frequency representation of speech signal
    Moussa, Sonia
    Hajaiej, Zied
    Garsallah, Ali
    [J]. 2016 4TH INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING & INFORMATION TECHNOLOGY (CEIT), 2016,
  • [36] Integrated speech enhancement and coding in the time-frequency domain
    Drygajlo, A
    Carnero, B
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1183 - 1186
  • [37] SPEECH ENHANCEMENT BASED ON JOINT TIME-FREQUENCY SEGMENTATION
    Tantibundhit, C.
    Pernkopf, F.
    Kubin, G.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4673 - +
  • [38] Adaptive time-frequency data fusion for speech enhancement
    Shi, G
    Aarabi, P
    Lazic, N
    [J]. FUSION 2003: PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE OF INFORMATION FUSION, VOLS 1 AND 2, 2003, : 394 - 399
  • [39] TIME-FREQUENCY RESOLUTION EXPERIMENT IN SPEECH ANALYSIS AND SYNTHESIS
    PATISAUL, CR
    HAMMETT, JC
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 58 (06): : 1296 - 1307
  • [40] Robust speech separation using time-frequency masking
    Aarabi, P
    Shi, GJ
    Jahromi, O
    [J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 741 - 744