Blind source separation of acoustic signals based on multistage ICA combining frequency-domain ICA and time-domain ICA

被引:0
|
作者
Nishikawa, T [1 ]
Saruwatari, H [1 ]
Shikano, K [1 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Ikoma 6300101, Japan
关键词
blind source separation; time-domain independent component analysis; frequency-domain independent component; analysis; reverberation; microphone array;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a new algorithm for blind source separation (BSS), in which frequency-domain independent component analysis (FDICA) and time-domain ICA (TDICA) are combined to achieve a superior source-separation performance under reverberant conditions. Generally speaking, conventional TDICA fails to separate source signals under heavily reverberant conditions because of the low convergence in the iterative learning of the inverse of the mixing system. On the other hand, the separation performance of conventional FDICA also degrades significantly because the independence assumption of narrow-band signals collapses when the number of subbands increases. In the proposed method, the separated signals of FDICA are regarded as the input signals for TDICA, and we can remove the residual crosstalk components of FDICA by using TDICA. The experimental results obtained under the reverberant condition reveal that the separation performance of the proposed method is superior to those of TDICA- and FDICA-based BSS methods.
引用
收藏
页码:846 / 858
页数:13
相关论文
共 50 条
  • [21] Frequency-Domain Pearson Distribution Approach for Independent Component Analysis (FD-Pearson-ICA) in Blind Source Separation
    Solvang, Hiroko Kato
    Nagahara, Yuichi
    Araki, Shoko
    Sawada, Hiroshi
    Makino, Shoji
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04): : 639 - 649
  • [22] Blind source separation based on binaural ICA
    Takatani, T
    Nishikawa, T
    Saruwatari, H
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 321 - 324
  • [23] Blind source separation of speech signals based on an ICA geometric procedure
    Rodríguez-Alvarez, M
    Rojas, F
    Salmerón, M
    Rojas, I
    Ros, E
    Puntonet, CG
    PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 631 - 636
  • [24] Time-Domain Blind Separation of Audio Sources on the Basis of a Complete ICA Decomposition of an Observation Space
    Koldovsky, Zbynek
    Tichavsky, Petr
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 406 - 416
  • [25] Overdetermined blind source separation of real acoustic sounds based on multistage ICA using subarray processing
    Nishikawa, T
    Abe, H
    Saruwatari, H
    Shikano, K
    PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 510 - 513
  • [26] An online blind source separation for convolutive acoustic signals in frequency-domain
    Wu Wenyan
    Zhang Liming
    ADVANCES IN NATURAL COMPUTATION, PT 1, 2006, 4221 : 451 - 460
  • [27] Estimating Phase Linearity in the Frequency-Domain ICA Demixing Matrix
    Toyama, Keisuke
    Plumbley, Mark D.
    INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 362 - +
  • [28] A frequency-time domain stepwise ICA algorithm for blind speech enhancement
    Yong, F
    Yue, L
    ICEMI 2005: Conference Proceedings of the Seventh International Conference on Electronic Measurement & Instruments, Vol 3, 2005, : 657 - 661
  • [29] INSIGHTS INTO THE FREQUENCY DOMAIN ICA APPROACH
    Zhang, Wenyi
    Masnadi-Shirazi, Alireza
    Rao, Bhaskar D.
    2011 CONFERENCE RECORD OF THE FORTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS (ASILOMAR), 2011, : 2164 - 2168
  • [30] PERMUTATION ALIGNMENT OF FREQUENCY-DOMAIN ICA BY THE MAXIMIZATION OF INTRA-SOURCE ENVELOPE CORRELATIONS
    Nikunen, J.
    Virtanen, T.
    Pertila, P.
    Vilermo, M.
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1489 - 1493