DISTRIBUTED SPEECH SEPARATION IN SPATIALLY UNCONSTRAINED MICROPHONE ARRAYS

被引:2
|
作者
Furnon, Nicolas [1 ]
Serizel, Romain [1 ]
Illina, Irina [1 ]
Essid, Slim [2 ]
机构
[1] Univ Lorraine, CNRS, INRIA, Loria, F-54000 Nancy, France
[2] Inst Polytech Paris, LTCI, Telecom Paris, Palaiseau, France
关键词
Speech separation; microphone arrays; distributed processing; RECOGNITION;
D O I
10.1109/ICASSP39728.2021.9414758
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech separation with several speakers is a challenging task because of the non-stationarity of the speech and the strong signal similarity between interferent sources. Current state-of-the-art solutions can separate well the different sources using sophisticated deep neural networks which are very tedious to train. When several microphones are available, spatial information can be exploited to design much simpler algorithms to discriminate speakers. We propose a distributed algorithm that can process spatial information in a spatially unconstrained microphone array. The algorithm relies on a convolutional recurrent neural network that can exploit the signal diversity from the distributed nodes. In a typical case of a meeting room, this algorithm can capture an estimate of each source in a first step and propagate it over the microphone array in order to increase the separation performance in a second step. We show that this approach performs even better when the number of sources and nodes increases. We also study the influence of a mismatch in the number of sources between the training and testing conditions.
引用
收藏
页码:4490 / 4494
页数:5
相关论文
共 50 条
  • [1] DNN-Based Mask Estimation for Distributed Speech Enhancement in Spatially Unconstrained Microphone Arrays
    Furnon, Nicolas
    Serizel, Romain
    Essid, Slim
    Illina, Irina
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2310 - 2323
  • [2] Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes
    Furnon, Nicolas
    Serizel, Romain
    Essid, Slim
    Illina, Irina
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1095 - 1099
  • [3] Location Feature Integration for Clustering-Based Speech Separation in Distributed Microphone Arrays
    Souden, Mehrez
    Kinoshita, Keisuke
    Delcroix, Marc
    Nakatani, Tomohiro
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 354 - 367
  • [4] Continuous Speech Separation with Ad Hoc Microphone Arrays
    Wang, Dongmei
    Yoshioka, Takuya
    Chen, Zhuo
    Wang, Xiaofei
    Zhou, Tianyan
    Meng, Zhong
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1100 - 1104
  • [5] Variational probabilistic speech separation using microphone arrays
    Rennie, Steven J.
    Aarabi, Parham
    Frey, Brendan J.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 135 - 149
  • [6] Speech Enhancement in Distributed Microphone Arrays Using Polynomial Eigenvalue Decomposition
    d'Olne, Emilie
    Neo, Vincent W.
    Naylor, Patrick A.
    [J]. 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 55 - 59
  • [7] AN INTEGRATION OF SOURCE LOCATION CUES FOR SPEECH CLUSTERING IN DISTRIBUTED MICROPHONE ARRAYS
    Souden, Mehrez
    Kinoshita, Keisuke
    Nakatani, Tomohiro
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 111 - 115
  • [8] SPEECH SEPARATION USING PARTIALLY ASYNCHRONOUS MICROPHONE ARRAYS WITHOUT RESAMPLING
    Corey, Ryan M.
    Singer, Andrew C.
    [J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 111 - 115
  • [9] Neural Speech Separation Using Spatially Distributed Microphones
    Wang, Dongmei
    Chen, Zhuo
    Yoshioka, Takuya
    [J]. INTERSPEECH 2020, 2020, : 339 - 343
  • [10] DISTRIBUTED MICROPHONE ARRAY PROCESSING FOR SPEECH SOURCE SEPARATION WITH CLASSIFIER FUSION
    Souden, Mehrez
    Kinoshita, Keisuke
    Delcroix, Marc
    Nakatani, Tomohiro
    [J]. 2012 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2012,