Bin-Wise Combination of Time-Frequency Masking and Beamforming for Convolutive Source Separation

被引：0

作者：

Bella, Mostafa ^{[1
,2
]}

Saylani, Hicham ^{[1
]}

Hosseini, Shahram ^{[2
]}

Deville, Yannick ^{[2
]}

机构：

[1] Univ Ibn Zohr, Fac Sci, LETSMP, BP 8106 Cite Dakhla, Agadir, Morocco

[2] Univ Toulouse, IRAP, UPS, CNRS,CNES, 14 Av Edouard Belin, F-31400 Toulouse, France

来源：

2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP) | 2022年

关键词：

Blind Source Separation; Underdetermined Convolutive Mixtures; Sparsity; TF Masking; Beamforming; BLIND SOURCE SEPARATION;

D O I：

10.1109/MMSP55362.2022.9949527

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

This paper presents a new Blind Source Separation (BSS) method for convolutive mixtures that can be underdetermined. Exploiting the sparsity of the source signals in the Time-Frequency (TF) domain, this method combines TF masking and beamforming. Indeed, on the one hand, BSS methods based on TF masking achieve remarkable performance even in the underdetermined case, however they tend to cause artifacts at the separated sources. On the other hand, beamforming can achieve good performance in the (over)-determined case without distorting the estimated signals. Therefore, combining these two techniques makes it possible to benefit from both their advantages. In the proposed method, unlike existing methods that use beamforming with TF masking, we introduce new normalized directional vectors to generate the different beamformers involved, and a new way for better estimating these vectors. In addition, we propose a new technique that can be used to separate sources in the case of underdetermined mixtures. Test results showed good performance for our method compared to various existing methods, similar in terms of working hypotheses, both in the determined and underdetermined cases.

引用

页数：6

共 50 条

[1] Multi-Channel Bin-Wise Speech Separation Combining Time-Frequency Masking and Beamforming
Bella, Mostafa
Saylani, Hicham
Hosseini, Shahram
Deville, Yannick
[J]. IEEE ACCESS, 2023, 11 : 100632 - 100645
[2] A frequency bin-wise nonlinear masking algorithm in convolutive mixtures for speech segregation
Chi, Tai-Shih
Huang, Ching-Wen
Chou, Wen-Sheng
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (05): : EL361 - EL367
[3] Underdetermined Convolutive Blind Source Separation via Frequency Bin-Wise Clustering and Permutation Alignment
Sawada, Hiroshi
Araki, Shoko
Makino, Shoji
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (03): : 516 - 527
[4] Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reju, Vaninirappuputhenpurayil Gopalan
Koh, Soo Ngee
Soon, Ing Yann
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01): : 101 - 116
[5] Sound Source Separation by Using Matched Beamforming and Time-Frequency Masking
Beh, Jounghoon
Lee, Taekjin
Han, David
Ko, Hanseok
[J]. IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010,
[6] Constructing Time-Frequency Dictionaries for Source Separation via Time-Frequency Masking and Source Localisation
de Frein, Ruairi
Rickard, Scott T.
Pearlmutter, Barak A.
[J]. INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 573 - +
[7] Blind source separation using time-frequency masking
Mohammed, Abbas
Ballal, Tarig
Grbic, Nedelko
[J]. RADIOENGINEERING, 2007, 16 (04) : 96 - 100
[8] Source Separation of Convolutive and Noisy Mixtures Using Audio-Visual Dictionary Learning and Probabilistic Time-Frequency Masking
Liu, Qingju
Wang, Wenwu
Jackson, Philip J. B.
Barnard, Mark
Kittler, Josef
Chambers, Jonathon
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2013, 61 (22) : 5520 - 5535
[9] TIME-FREQUENCY CLUSTERING WITH WEIGHTED AND CONTEXTUAL INFORMATION FOR CONVOLUTIVE BLIND SOURCE SEPARATION
Jafari, Ingrid
Atcheson, Matt
Togneri, Roberto
Nordholm, Sven
[J]. 2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 157 - 160
[10] Multichannel bin-wise robust frequency-domain adaptive filtering and its application to adaptive beamforming
Herbordt, Wolfgang
Buchner, Herbert
Nakamura, Satoshi
Kellermann, Walter
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1340 - 1351

← 1 2 3 4 5 →