Hybrid Approach to Single-Channel Speech Separation Based on Coherent-Incoherent Modulation Filtering

被引:2
|
作者
Mahmoodzadeh, Azar [1 ]
Abutalebi, Hamid Reza [1 ]
机构
[1] Yazd Univ, Dept Elect Engn, Pajuhesh St,Postal Box 89195-741, Safaieh, Yazd, Iran
关键词
Single-channel speech separation; Coherent and incoherent demodulation; Modulation filtering; Carrier estimator; Instantaneous frequency; Modulator signal; PITCH TRACKING; FREQUENCY; SEGREGATION; ALGORITHM;
D O I
10.1007/s00034-016-0388-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Single-channel speech separation is a challenging problem that has been of particular interest in recent years. Here the goal is to separate the target speech signal from the interference signals, with high accuracy. We propose a new hybrid single-channel speech separation system that applies adaptive coherent modulation filtering for low-frequency subbands and iterative incoherent speech separation technique for high-frequency subbands. In the adaptive coherent modulation filtering, an affine projection filter is applied to subband envelope in order to eliminate the interference signal. The subband envelope is determined via demodulation of the subband signal using a coherently detected subband carrier based on the time-dependent spectral center-of-gravity demodulation. The adaptive affine projection filter uses the separated target signal obtained from the iterative incoherent speech separation system as a reference signal. This system first obtains a rough estimate of target fundamental frequency range and then uses this estimate to segregate target speech. It then improves both fundamental frequency range estimation and voiced speech separation iteratively. Perceptual evaluation of speech quality, as one of the evaluation indices investigated in this paper, indicates that the proposed system extracts the majority of target speech segments with minimal interference and outperforms previous systems in voiced speech separation.
引用
收藏
页码:1970 / 1988
页数:19
相关论文
共 50 条
  • [1] Hybrid Approach to Single-Channel Speech Separation Based on Coherent–Incoherent Modulation Filtering
    Azar Mahmoodzadeh
    Hamid Reza Abutalebi
    [J]. Circuits, Systems, and Signal Processing, 2017, 36 : 1970 - 1988
  • [2] A HYBRID COHERENT-INCOHERENT METHOD OF MODULATION FILTERING FOR SINGLE CHANNEL SPEECH SEPARATION
    Mahmoodzadeh, A.
    Sheikhzadeh, H.
    Abutalebi, H. R.
    Soltanian-Zadeh, H.
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 329 - 332
  • [3] Coherent modulation spectral filtering for single-channel music source separation
    Atlas, L
    Janssen, C
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 461 - 464
  • [4] Single-channel speech separation based on modulation frequency
    Gu, Lingyun
    Stern, Richard M.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 25 - 28
  • [5] Single-channel speech separation using soft mask filtering
    Radfar, Mohammad H.
    Dansereau, Richard M.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2299 - 2310
  • [6] Single-channel speech enhancement using Kalman filtering in the modulation domain
    So, Stephen
    Wojcicki, Kamil K.
    Paliwal, Kuldip K.
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 993 - 996
  • [7] Modulation-domain Kalman filtering for single-channel speech enhancement
    So, Stephen
    Paliwal, Kuldip K.
    [J]. SPEECH COMMUNICATION, 2011, 53 (06) : 818 - 829
  • [8] A PITCH-AWARE APPROACH TO SINGLE-CHANNEL SPEECH SEPARATION
    Wang, Ke
    Soong, Frank
    Xie, Lei
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 296 - 300
  • [9] Sinusoidal Approach for the Single-Channel Speech Separation and Recognition Challenge
    Mowlaee, P.
    Saeidi, R.
    Tan, Z. -H.
    Christensen, M. G.
    Kinnunen, T.
    Franti, P.
    Jensen, S. H.
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 684 - +
  • [10] A Joint Approach for Single-Channel Speaker Identification and Speech Separation
    Mowlaee, Pejman
    Saeidi, Rahim
    Christensen, Mads Grsboll
    Tan, Zheng-Hua
    Kinnunen, Tomi
    Franti, Pasi
    Jensen, Soren Holdt
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (09): : 2586 - 2601