Underdetermined blind separation of overlapped speech mixtures in time-frequency domain with estimated number of sources

被引:22
|
作者
Zhang, Haijian [1 ,4 ]
Hua, Guang [2 ]
Yu, Lei [1 ]
Cai, Yunlong [3 ]
Bi, Guoan [4 ]
机构
[1] Wuhan Univ, Sch Elect Informat, Wuhan, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan, Peoples R China
[3] Zhejiang Univ, Dept Informat Sci & Elect Engn, Hangzhou, Zhejiang, Peoples R China
[4] Nanyang Technol Univ, Sch EEE, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Underdetermined blind source separation; Noise suppression; Estimation of number of sources; Estimation of mixing matrix; Short-time Fourier transform; MIXING MATRIX ESTIMATION; NONSTATIONARY SIGNALS; CONVOLUTIVE MIXTURES; ALGORITHM; MODEL; EXTRACTION;
D O I
10.1016/j.specom.2017.02.003
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Noise suppression and the estimation of the number of sources are two practical issues in applications of underdetermined blind source separation (UBSS). This paper proposes a noise-robust instantaneous UBSS algorithm for highly overlapped speech sources in the short-time Fourier transform (STFT) domain. The proposed algorithm firstly estimates the unknown complex-valued mixing matrix and the number of sources, which are then used to compute the STFT coefficients of corresponding sources at each auto source time-frequency (TF) point. After that, the original sources are recovered by the inverse STFT. To mitigate the noise effect on the detection of auto-source TF points, we propose a method to effectively detect the auto-term location of the sources by using the principal component analysis (PCA) of the STFTs of noisy mixtures. The PCA-based detection method can achieve similar UBSS outcome as some filtering based methods. More importantly, an efficient method to estimate the mixing matrix is proposed based on subspace projection and clustering approaches. The number of sources is obtained by counting the number of the resultant clusters. Evaluations have been carried out by using the speech corpus NOIZEUS and the experimental results have shown improved robustness and efficiency of the proposed algorithm. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [1] Underdetermined Blind Source Separation of Anechoic Speech Mixtures in the Time-frequency Domain
    Lv Yao
    Li Shuangtian
    [J]. ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 22 - 25
  • [2] A TIME-FREQUENCY BLIND SEPARATION METHOD FOR UNDERDETERMINED SPEECH MIXTURES
    Lv Yao Li Shuangtian(Institute of Acoustics
    [J]. Journal of Electronics(China), 2008, (05) : 702 - 708
  • [3] Underdetermined blind separation of nondisjoint sources in the time-frequency domain
    Aissa-El-Bey, Abdeldjalil
    Linh-Trung, Nguyen
    Abed-Meraim, Karim
    Belouchrani, Adel
    Grenier, Yves
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2007, 55 (03) : 897 - 907
  • [4] A Time-Frequency Domain Blind Source Separation Method for Underdetermined Instantaneous Mixtures
    Peng, Tianliang
    Chen, Yang
    Liu, Zengli
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2015, 34 (12) : 3883 - 3895
  • [5] A time-frequency blind signal separation method applicable to underdetermined mixtures of dependent sources
    Abrard, F
    Deville, Y
    [J]. SIGNAL PROCESSING, 2005, 85 (07) : 1389 - 1403
  • [6] Underdetermined blind separation of audio sources from the time-frequency representation of their convolutive mixtures
    Aissa-El-Bey, Abdeldjalil
    Abed-Meraim, Karim
    Grenier, Yves
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 153 - 156
  • [7] A NEW TIME-FREQUENCY APPROACH FOR UNDERDETERMINED CONVOLUTIVE BLIND SPEECH SEPARATION
    Bouafif, Mariem
    Lachiri, Zied
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 3226 - 3230
  • [8] An Underdetermined Convolutional Blind Separation Algorithm for Time-Frequency Overlapped Wireless Communication Signals with Unknown Source Number
    Ma, Hao
    Zheng, Xiang
    Yu, Lu
    Wu, Xinrong
    Zhang, Yu
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (13):
  • [9] Blind separation of underdetermined convolutive mixtures using their time-frequency representation
    Aissa-El-Bey, Abdeldjalil
    Abed-Meraim, Karim
    Grenier, Yves
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (05): : 1540 - 1550
  • [10] Underdetermined blind source separation of non-disjoint nonstationary sources in the time-frequency domain
    Linh-Trung, N
    Aïssa-El-Bey, A
    Abed-Meraim, K
    Belouchrani, A
    [J]. ISSPA 2005: THE 8TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2005, : 46 - 49