Low Complexity Blind Separation Technique to Solve the Permutation Ambiguity of Convolutive Speech Mixtures

被引:0
|
作者
Lima, Pedro F. C. [1 ]
Miranda, Ricardo Kehrle [1 ,2 ]
da Costa, Joao Paulo C. L. [1 ,2 ,3 ]
Zelenovsky, Ricardo [1 ]
Yuan, Yizheng [4 ]
Del Galdo, Giovanni [2 ,3 ]
机构
[1] Univ Brasilia UnB, Dept Elect Engn, Brasilia, DF, Brazil
[2] Ilmenau Univ Technol, Inst Informat Technol, Ilmenau, Germany
[3] Fraunhofer Inst Integrated Circuits IIS, Erlangen, Germany
[4] Freie Univ, Berlin, Germany
关键词
blind speech separation; permutation ambiguity;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Microphone arrays can be incorporated in several devices ranging from hearing aids and bioacoustic recording equipment to teleconference phones and forensic sound recorders in order to attenuate the interference of unwanted sounds. The separation of speech mixtures can be easily performed on the frequency domain independently for each frequency component. However, in order to combine the separated signals of each frequency component, the permutation ambiguity should be solved. The state-of-the-art technique relies on an iterative computation of the dispersions of the differences between the source profiles. In this paper, we propose a low complexity solution for the permutation ambiguity with similar accuracy based on one dispersion.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] A MULTISTAGE APPROACH FOR BLIND SEPARATION OF CONVOLUTIVE SPEECH MIXTURES
    Jan, Tariqullah
    Wang, Wenwu
    Wang, DeLiang
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1713 - +
  • [2] A multistage approach to blind separation of convolutive speech mixtures
    Jan, Tariqullah
    Wang, Wenwu
    Wang, DeLiang
    SPEECH COMMUNICATION, 2011, 53 (04) : 524 - 539
  • [3] A SPARSITY BASED CRITERION FOR SOLVING THE PERMUTATION AMBIGUITY IN CONVOLUTIVE BLIND SOURCE SEPARATION
    Mazur, Radoslaw
    Mertins, Alfred
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 1996 - 1999
  • [4] Blind speech separation of nonlinear convolutive mixtures for robust speech recognition
    Koutras, A.
    Dermatas, E.
    Kokkinakis, G.
    Control and Intelligent Systems, 2002, 30 (02) : 83 - 90
  • [5] Subband based blind source separation for convolutive mixtures of speech
    Araki, S
    Makino, S
    Aichner, R
    Nishikawa, T
    Saruwatari, H
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 509 - 512
  • [6] Blind source separation of convolutive mixtures of speech in frequency domain
    Makino, S
    Sawada, H
    Mukai, R
    Araki, S
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1640 - 1655
  • [7] Multichannel blind deconvolution for source separation in convolutive mixtures of speech
    Kokkinakis, K
    Nandi, AK
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 200 - 212
  • [8] Oriented PCA Method for Blind Speech Separation of Convolutive Mixtures
    Benabderrahmane, Yasmina
    Selouani, Sid Ahmed
    O'Shaughnessy, Douglas
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 390 - +
  • [9] Solving the indeterminations of blind source separation of convolutive speech mixtures
    Rivet, B
    Girin, L
    Jutten, C
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 533 - 536
  • [10] Convolutive blind separation of speech mixtures using the natural gradient
    Douglas, SC
    Sun, XA
    SPEECH COMMUNICATION, 2003, 39 (1-2) : 65 - 78