Hybrid Approach to Single-Channel Speech Separation Based on Coherent-Incoherent Modulation Filtering

被引：2

作者：

Mahmoodzadeh, Azar ^{[1
]}

Abutalebi, Hamid Reza ^{[1
]}

机构：

[1] Yazd Univ, Dept Elect Engn, Pajuhesh St,Postal Box 89195-741, Safaieh, Yazd, Iran

来源：

CIRCUITS SYSTEMS AND SIGNAL PROCESSING | 2017年 / 36卷 / 05期

关键词：

Single-channel speech separation; Coherent and incoherent demodulation; Modulation filtering; Carrier estimator; Instantaneous frequency; Modulator signal; PITCH TRACKING; FREQUENCY; SEGREGATION; ALGORITHM;

D O I：

10.1007/s00034-016-0388-2

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Single-channel speech separation is a challenging problem that has been of particular interest in recent years. Here the goal is to separate the target speech signal from the interference signals, with high accuracy. We propose a new hybrid single-channel speech separation system that applies adaptive coherent modulation filtering for low-frequency subbands and iterative incoherent speech separation technique for high-frequency subbands. In the adaptive coherent modulation filtering, an affine projection filter is applied to subband envelope in order to eliminate the interference signal. The subband envelope is determined via demodulation of the subband signal using a coherently detected subband carrier based on the time-dependent spectral center-of-gravity demodulation. The adaptive affine projection filter uses the separated target signal obtained from the iterative incoherent speech separation system as a reference signal. This system first obtains a rough estimate of target fundamental frequency range and then uses this estimate to segregate target speech. It then improves both fundamental frequency range estimation and voiced speech separation iteratively. Perceptual evaluation of speech quality, as one of the evaluation indices investigated in this paper, indicates that the proposed system extracts the majority of target speech segments with minimal interference and outperforms previous systems in voiced speech separation.

引用

页码：1970 / 1988

页数：19

共 50 条

[1] Hybrid Approach to Single-Channel Speech Separation Based on Coherent–Incoherent Modulation Filtering
Azar Mahmoodzadeh
Hamid Reza Abutalebi
[J]. Circuits, Systems, and Signal Processing, 2017, 36 : 1970 - 1988
[2] A HYBRID COHERENT-INCOHERENT METHOD OF MODULATION FILTERING FOR SINGLE CHANNEL SPEECH SEPARATION
Mahmoodzadeh, A.
Sheikhzadeh, H.
Abutalebi, H. R.
Soltanian-Zadeh, H.
[J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 329 - 332
[3] Coherent modulation spectral filtering for single-channel music source separation
Atlas, L
Janssen, C
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 461 - 464
[4] Single-channel speech separation based on modulation frequency
Gu, Lingyun
Stern, Richard M.
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 25 - 28
[5] Single-channel speech separation using soft mask filtering
Radfar, Mohammad H.
Dansereau, Richard M.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2299 - 2310
[6] Single-channel speech enhancement using Kalman filtering in the modulation domain
So, Stephen
Wojcicki, Kamil K.
Paliwal, Kuldip K.
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 993 - 996
[7] Modulation-domain Kalman filtering for single-channel speech enhancement
So, Stephen
Paliwal, Kuldip K.
[J]. SPEECH COMMUNICATION, 2011, 53 (06) : 818 - 829
[8] A PITCH-AWARE APPROACH TO SINGLE-CHANNEL SPEECH SEPARATION
Wang, Ke
Soong, Frank
Xie, Lei
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 296 - 300
[9] Sinusoidal Approach for the Single-Channel Speech Separation and Recognition Challenge
Mowlaee, P.
Saeidi, R.
Tan, Z. -H.
Christensen, M. G.
Kinnunen, T.
Franti, P.
Jensen, S. H.
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 684 - +
[10] A Joint Approach for Single-Channel Speaker Identification and Speech Separation
Mowlaee, Pejman
Saeidi, Rahim
Christensen, Mads Grsboll
Tan, Zheng-Hua
Kinnunen, Tomi
Franti, Pasi
Jensen, Soren Holdt
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (09): : 2586 - 2601

← 1 2 3 4 5 →