Bin-Wise Combination of Time-Frequency Masking and Beamforming for Convolutive Source Separation

被引:0
|
作者
Bella, Mostafa [1 ,2 ]
Saylani, Hicham [1 ]
Hosseini, Shahram [2 ]
Deville, Yannick [2 ]
机构
[1] Univ Ibn Zohr, Fac Sci, LETSMP, BP 8106 Cite Dakhla, Agadir, Morocco
[2] Univ Toulouse, IRAP, UPS, CNRS,CNES, 14 Av Edouard Belin, F-31400 Toulouse, France
关键词
Blind Source Separation; Underdetermined Convolutive Mixtures; Sparsity; TF Masking; Beamforming; BLIND SOURCE SEPARATION;
D O I
10.1109/MMSP55362.2022.9949527
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents a new Blind Source Separation (BSS) method for convolutive mixtures that can be underdetermined. Exploiting the sparsity of the source signals in the Time-Frequency (TF) domain, this method combines TF masking and beamforming. Indeed, on the one hand, BSS methods based on TF masking achieve remarkable performance even in the underdetermined case, however they tend to cause artifacts at the separated sources. On the other hand, beamforming can achieve good performance in the (over)-determined case without distorting the estimated signals. Therefore, combining these two techniques makes it possible to benefit from both their advantages. In the proposed method, unlike existing methods that use beamforming with TF masking, we introduce new normalized directional vectors to generate the different beamformers involved, and a new way for better estimating these vectors. In addition, we propose a new technique that can be used to separate sources in the case of underdetermined mixtures. Test results showed good performance for our method compared to various existing methods, similar in terms of working hypotheses, both in the determined and underdetermined cases.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Multi-Channel Bin-Wise Speech Separation Combining Time-Frequency Masking and Beamforming
    Bella, Mostafa
    Saylani, Hicham
    Hosseini, Shahram
    Deville, Yannick
    [J]. IEEE ACCESS, 2023, 11 : 100632 - 100645
  • [2] A frequency bin-wise nonlinear masking algorithm in convolutive mixtures for speech segregation
    Chi, Tai-Shih
    Huang, Ching-Wen
    Chou, Wen-Sheng
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (05): : EL361 - EL367
  • [3] Underdetermined Convolutive Blind Source Separation via Frequency Bin-Wise Clustering and Permutation Alignment
    Sawada, Hiroshi
    Araki, Shoko
    Makino, Shoji
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (03): : 516 - 527
  • [4] Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
    Reju, Vaninirappuputhenpurayil Gopalan
    Koh, Soo Ngee
    Soon, Ing Yann
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01): : 101 - 116
  • [5] Sound Source Separation by Using Matched Beamforming and Time-Frequency Masking
    Beh, Jounghoon
    Lee, Taekjin
    Han, David
    Ko, Hanseok
    [J]. IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010,
  • [6] Constructing Time-Frequency Dictionaries for Source Separation via Time-Frequency Masking and Source Localisation
    de Frein, Ruairi
    Rickard, Scott T.
    Pearlmutter, Barak A.
    [J]. INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 573 - +
  • [7] Blind source separation using time-frequency masking
    Mohammed, Abbas
    Ballal, Tarig
    Grbic, Nedelko
    [J]. RADIOENGINEERING, 2007, 16 (04) : 96 - 100
  • [8] Source Separation of Convolutive and Noisy Mixtures Using Audio-Visual Dictionary Learning and Probabilistic Time-Frequency Masking
    Liu, Qingju
    Wang, Wenwu
    Jackson, Philip J. B.
    Barnard, Mark
    Kittler, Josef
    Chambers, Jonathon
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2013, 61 (22) : 5520 - 5535
  • [9] TIME-FREQUENCY CLUSTERING WITH WEIGHTED AND CONTEXTUAL INFORMATION FOR CONVOLUTIVE BLIND SOURCE SEPARATION
    Jafari, Ingrid
    Atcheson, Matt
    Togneri, Roberto
    Nordholm, Sven
    [J]. 2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 157 - 160
  • [10] Multichannel bin-wise robust frequency-domain adaptive filtering and its application to adaptive beamforming
    Herbordt, Wolfgang
    Buchner, Herbert
    Nakamura, Satoshi
    Kellermann, Walter
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1340 - 1351