A multichannel learning-based approach for sound source separation in reverberant environments

被引:1
|
作者
Chen, You-Siang [1 ]
Lin, Zi-Jie [1 ]
Bai, Mingsian R. [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Power Mech Engn Elect Engn, Hsinchu, Taiwan
关键词
Source separation and dereverberation; Multichannel learning-based network; Time-dilated convolution network; U-net; Beamforming; SPEECH DEREVERBERATION; JOINT OPTIMIZATION; BLIND SEPARATION; NETWORK; MIXTURES;
D O I
10.1186/s13636-021-00227-2
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a multichannel learning-based network is proposed for sound source separation in reverberant field. The network can be divided into two parts according to the training strategies. In the first stage, time-dilated convolutional blocks are trained to estimate the array weights for beamforming the multichannel microphone signals. Next, the output of the network is processed by a weight-and-sum operation that is reformulated to handle real-valued data in the frequency domain. In the second stage, a U-net model is concatenated to the beamforming network to serve as a non-linear mapping filter for joint separation and dereverberation. The scale invariant mean square error (SI-MSE) that is a frequency-domain modification from the scale invariant signal-to-noise ratio (SI-SNR) is used as the objective function for training. Furthermore, the combined network is also trained with the speech segments filtered by a great variety of room impulse responses. Simulations are conducted for comprehensive multisource scenarios of various subtending angles of sources and reverberation times. The proposed network is compared with several baseline approaches in terms of objective evaluation matrices. The results have demonstrated the excellent performance of the proposed network in dereverberation and separation, as compared to baseline methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A multichannel learning-based approach for sound source separation in reverberant environments
    You-Siang Chen
    Zi-Jie Lin
    Mingsian R. Bai
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2021
  • [2] Sound Source Localization in Reverberant Environments Based on Structural Sparse Bayesian Learning
    Liu, Yanshan
    Wang, Lu
    Zeng, Xiangyang
    Wang, Haitao
    [J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2018, 104 (03) : 528 - 541
  • [3] A LEARNING-BASED APPROACH TO DIRECTION OF ARRIVAL ESTIMATION IN NOISY AND REVERBERANT ENVIRONMENTS
    Xiao, Xiong
    Zhao, Shengkui
    Zhong, Xionghu
    Jones, Douglas L.
    Chng, Eng Siong
    Li, Haizhou
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2814 - 2818
  • [4] Boosting Spatial Information for Deep Learning Based Multichannel Speaker-Independent Speech Separation In Reverberant Environments
    Yang, Ziye
    Zhang, Xiao-Lei
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1506 - 1510
  • [5] Sound Source Localization Based on Robust Least Squares in Reverberant Environments
    Zhu, Hongyan
    Dang, Xudong
    Li, Zelin
    Ge, Quanbo
    [J]. 2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 2029 - 2035
  • [6] Deep Learning Based Binaural Speech Separation in Reverberant Environments
    Zhang, Xueliang
    Wang, DeLiang
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) : 1075 - 1084
  • [7] Position estimation of binaural sound source in reverberant environments
    Ghamdan, Lama
    Shoman, Mahmoud A. Ismail
    Abd Elwahab, Reda
    Ghamry, Nivin Abo El-Hadid
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2017, 18 (02) : 87 - 93
  • [8] Robust MUSIC-Based Sound Source Localization in Reverberant and Echoic Environments
    Sewtz, Marco
    Bodenmueller, Tim
    Triebel, Rudolph
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 2474 - 2480
  • [9] MONAURAL SOURCE SEPARATION: FROM ANECHOIC TO REVERBERANT ENVIRONMENTS
    Cord-Landwehr, Tobias
    Boeddeker, Christoph
    Von Neumann, Thilo
    Zorila, Catalin
    Doddipatla, Rama
    Haeb-Umbach, Reinhold
    [J]. 2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [10] Real-time source separation based on sound localization in a reverberant environment
    Aoki, M
    Furuya, K
    [J]. NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, 2002, : 475 - 484