Subband based blind source separation for convolutive mixtures of speech

被引:0
|
作者
Araki, S [1 ]
Makino, S [1 ]
Aichner, R [1 ]
Nishikawa, T [1 ]
Saruwatari, H [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Kyoto 6190237, Japan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Subband processing is applied to blind source, separation (BSS) for convolutive mixtures of speech. This is motivated by the drawback of frequency-domain BSS, i.e., when a long frame with a fixed frame-shift is used to cover reverberation, the number of samples in each frequency decreases and the separation performance is degraded. In our proposed subband BSS, (1) by using a moderate number of subbands, a sufficient number of samples can be held in each subband, and (2) by using FIR filters in each subband, we can handle long reverberation. Subband BSS achieves better performance than frequency-domain BSS. Moreover, we propose efficient separation procedures that take into consideration the frequency characteristics of room reverberation and speech signals. We achieve this (3) by using longer unmixing filters in low frequency bands, and (4) by adopting overlap-blockshift in BSS's batch adaptation in low frequency bands. Consequently, frequency-dependent subband processing is successfully realized in the proposed subband BSS.
引用
收藏
页码:509 / 512
页数:4
相关论文
共 50 条
  • [1] Subband-based blind separation for convolutive mixtures of speech
    Araki, S
    Makino, S
    Aichner, R
    Nishikawa, T
    Saruwatari, H
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (12) : 3593 - 3603
  • [2] Subband-based blind signal processing for source separation in convolutive mixtures of speech
    Kokkinakis, Kostas
    Loizou, Philipos C.
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 917 - +
  • [3] A Blind Source Separation Approach Based on IVA for Convolutive Speech Mixtures
    Jan, Tariqullah
    Zafar, Haseeb
    Khalil, Ruhulamin
    Ashraf, Majid
    [J]. 2016 8TH COMPUTER SCIENCE AND ELECTRONIC ENGINEERING CONFERENCE (CEEC), 2016, : 140 - 145
  • [4] Subband Blind Source Separation for Convolutive Mixture of Speech Signals Based on Dynamic Modeling
    Mosayebi, Raziyeh
    Sheikhzadeh, Hamid
    Raahemifar, Kaamran
    [J]. 2013 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (IEEE ISSPIT 2013), 2013, : 299 - 304
  • [5] Solving the indeterminations of blind source separation of convolutive speech mixtures
    Rivet, B
    Girin, L
    Jutten, C
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 533 - 536
  • [6] Multichannel blind deconvolution for source separation in convolutive mixtures of speech
    Kokkinakis, K
    Nandi, AK
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 200 - 212
  • [7] Blind source separation of convolutive mixtures of speech in frequency domain
    Makino, S
    Sawada, H
    Mukai, R
    Araki, S
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1640 - 1655
  • [8] Blind source separation of convolutive mixtures
    Serviere, C
    [J]. 8TH IEEE SIGNAL PROCESSING WORKSHOP ON STATISTICAL SIGNAL AND ARRAY PROCESSING, PROCEEDINGS, 1996, : 316 - 319
  • [9] BLIND SOURCE SEPARATION FOR CONVOLUTIVE MIXTURES
    THI, HLN
    JUTTEN, C
    [J]. SIGNAL PROCESSING, 1995, 45 (02) : 209 - 229
  • [10] Blind source separation of convolutive mixtures
    Makino, Shoji
    [J]. INDEPENDENT COMPONENT ANALYSES, WAVELETS, UNSUPERVISED SMART SENSORS, AND NEURAL NETWORKS IV, 2006, 6247