Blind Source Separation of Acoustic Signals in Realistic Environments Based on ICA in the Time-Frequency Domain

被引:7
|
作者
Ding, Shuxue [1 ,2 ]
Cichocki, Andrzej [2 ]
Huang, Jie [1 ]
Wei, Daming [1 ]
机构
[1] Univ Aizu, Dept Comp Software, Ikki Machi, Aizu Wakamatsu, Fukushima 9658580, Japan
[2] RIKEN, Brain Sci Inst, Saitama 3510198, Japan
关键词
Blind Source Separation (BSS); Independent Component Analysis (ICA); deconvolution; CFPI; permutation;
D O I
10.1108/17427370580000115
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We present an approach for blind separation of acoustic sources produced from multiple speakers mixed in realistic room environments. We first transform recorded signals into the time-frequency domain to make mixing become instantaneous. We then separate the sources in each frequency bin based on an independent component analysis (ICA) algorithm. For the present paper, we choose the complex version of fixedpoint iteration (CFPI), i.e. the complex version of FastICA, as the algorithm. From the separated signals in the time-frequency domain, we reconstruct output-separated signals in the time domain. To solve the so-called permutation problem due to the indeterminacy of permutation in the standard ICA, we propose a method that applies a special property of the CFPI cost function. Generally, the cost function has several optimal points that correspond to the different permutations of the outputs. These optimal points are isolated by some non-optimal regions of the cost function. In different but neighboring bins, optimal points with the same permutation are at almost the same position in the space of separation parameters. Based on this property, if an initial separation matrix for a learning process in a frequency bin is chosen equal to the final separation matrix of the learning process in the neighboring frequency bin, the learning process automatically leads us to separated signals with the same permutation as that of the neighbor frequency bin. In each bin, but except the starting one, by chosen the initial separation matrix in such a way, the permutation problem in the time domain reconstruction can be avoided. We present the results of some simulations and experiments on both artificially synthesized speech data and real-world speech data, which show the effectiveness of our approach.
引用
收藏
页码:89 / 100
页数:12
相关论文
共 50 条
  • [1] Blind source separation of acoustic signals based on multistage ICA combining frequency-domain ICA and time-domain ICA
    Nishikawa, T
    Saruwatari, H
    Shikano, K
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2003, E86A (04) : 846 - 858
  • [2] A batch algorithm for blind source separation of acoustic signals using ICA and time-frequency masking
    Hoffmann, Eugen
    Kolossa, Dorothea
    Orglmeister, Reinhold
    [J]. INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2007, 4666 : 480 - +
  • [3] ROBUST UNDERDETERMINED BLIND AUDIO SOURCE SEPARATION OF SPARSE SIGNALS IN THE TIME-FREQUENCY DOMAIN
    Sbai, Si Mohamed Aziz
    Aissa-El-Bey, Abdeldjalil
    Pastor, Dominique
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 3716 - 3719
  • [4] A Time-Frequency Domain Underdetermined Blind Source Separation Algorithm for MIMO Radar Signals
    Guo, Qiang
    Ruan, Guoqing
    Liao, Yanping
    [J]. SYMMETRY-BASEL, 2017, 9 (07):
  • [5] Underdetermined source separation of EEG signals in the time-frequency domain
    Shan, Zeyong
    Swary, Jacob
    Aviyente, Selin
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3637 - 3640
  • [6] Blind source separation in the time-frequency domain based on multiple hypothesis testing
    Cirillo, Luke
    Zoubir, Abdelhak
    Amin, Moeness
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2008, 56 (06) : 2267 - 2279
  • [7] Overcomplete blind source separation by combining ICA and binary time-frequency masking
    Pedersen, MHS
    Wang, DL
    Larsen, J
    Kjems, U
    [J]. 2005 IEEE WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2005, : 15 - 20
  • [8] Blind source separation of acoustic mixtures using time-frequency domain independent component analysis
    Jayaraman, S
    Sitaraman, G
    Seshadri, R
    [J]. ICCS 2002: 8TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS SYSTEMS, VOLS 1 AND 2, PROCEEDINGS, 2002, : 1016 - 1019
  • [9] Blind source separation of acoustic mixtures using time-frequency domain independent component analysis
    Jayaraman, S
    Sitaraman, G
    Seshadri, R
    [J]. ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 1383 - 1387
  • [10] Blind source separation based on multi-stage ICA combining frequency-domain ICA and time-domain ICA
    Nishikawa, T
    Saruwatari, H
    Shikano, K
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 917 - 920