Blind Separation of Convolutive Speech Mixtures Based on Local Sparsity and K-means

被引:0
|
作者
Huang, Yuyang [1 ]
Chu, Ping [1 ]
Liao, Bin [1 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Blind source separation; convolutive speech mixture; K-means; permutation ambiguity;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, an accurate and efficient blind source separation method based on local sparsity and K-means (LSK-BSS) is proposed. Specifically, the proposed LSK-BSS approach exploits the local sparsity of speech sources in the transformed domain to obtain closed-form solution for per-frequency mixing system estimation. On this basis, through designing superior initial points of clustering, the well-established K-means algorithm is employed to achieve accurate permutation alignment. Simulations with real reverberant speech sources show that the LSK-BSS approach yields competitive efficiency, robustness and effectiveness, in comparison with the state-of-the-arts methods.
引用
收藏
页码:271 / 275
页数:5
相关论文
共 50 条
  • [1] BLIND SEPARATION OF CONVOLUTIVE MIXTURES OF SPEECH SOURCES: EXPLOITING LOCAL SPARSITY
    Fu, Xiao
    Ma, Wing-Kin
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 4315 - 4319
  • [2] Underdetermined Blind Source Separation of Speech Mixtures Based on K-means Clustering
    Xie, Yuan
    Xie, Kan
    Wu, Zongze
    Xie, Shengli
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 42 - 46
  • [3] Subband based blind source separation for convolutive mixtures of speech
    Araki, S
    Makino, S
    Aichner, R
    Nishikawa, T
    Saruwatari, H
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 509 - 512
  • [4] Subband-based blind separation for convolutive mixtures of speech
    Araki, S
    Makino, S
    Aichner, R
    Nishikawa, T
    Saruwatari, H
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (12) : 3593 - 3603
  • [5] A MULTISTAGE APPROACH FOR BLIND SEPARATION OF CONVOLUTIVE SPEECH MIXTURES
    Jan, Tariqullah
    Wang, Wenwu
    Wang, DeLiang
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1713 - +
  • [6] A multistage approach to blind separation of convolutive speech mixtures
    Jan, Tariqullah
    Wang, Wenwu
    Wang, DeLiang
    SPEECH COMMUNICATION, 2011, 53 (04) : 524 - 539
  • [7] A Blind Source Separation Approach Based on IVA for Convolutive Speech Mixtures
    Jan, Tariqullah
    Zafar, Haseeb
    Khalil, Ruhulamin
    Ashraf, Majid
    2016 8TH COMPUTER SCIENCE AND ELECTRONIC ENGINEERING CONFERENCE (CEEC), 2016, : 140 - 145
  • [8] SPARSITY-BASED ALGORITHMS FOR BLIND SEPARATION OF CONVOLUTIVE MIXTURES WITH APPLICATION TO EMG SIGNALS
    Boudjellal, A.
    Abed-Meraim, K.
    Aissa-El-Bey, A.
    Belouchrani, A.
    Ravier, Ph.
    2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 189 - 192
  • [9] Blind speech separation of nonlinear convolutive mixtures for robust speech recognition
    Koutras, A.
    Dermatas, E.
    Kokkinakis, G.
    Control and Intelligent Systems, 2002, 30 (02) : 83 - 90
  • [10] Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech Mixtures
    Nion, Dimitri
    Mokios, Kleanthis N.
    Sidiropoulos, Nicholas D.
    Potamianos, Alexandros
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1193 - 1207