COMBINING DEEP NEURAL NETWORKS AND BEAMFORMING FOR REAL-TIME MULTI-CHANNEL SPEECH ENHANCEMENT USING A WIRELESS ACOUSTIC SENSOR NETWORK

被引:0
|
作者
Ceolini, Enea [1 ]
Liu, Shih-Chii
机构
[1] Univ Zurich, Inst Neuroinformat, Zurich, Switzerland
基金
瑞士国家科学基金会;
关键词
speech enhancement; beamforming; deep neural networks; wireless acoustic sensor networks; NOISE; SEPARATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work presents a multi-channel speech enhancement algorithm using a neural network combined with beamforming deployed realtime on a wireless acoustic sensor network (WASN) of distributed microphones. We combine spectral mask estimation via a deep neural network together with spatial filtering to obtain a robust speech enhancement system even in difficult real-world scenarios (e.g. speech in noise, reverberant environments). Although the model is trained on simulated data, it performs comparably well on real-world tasks relative to an ideal oracle beamformer. We show that the model can be deployed on a WASN platform that allows for remote placement of microphones and on-board computing. We consider models with a small parameter count and low computational complexity. It achieves signal-to-distortion ratio (SDR) improvements of up to 10 dB in a real-world scenario and runs real-time on-board the WASN, with a latency in the order of hundreds of milliseconds.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] A Causal U-net based Neural Beamforming Network for Real-Time Multi-Channel Speech Enhancement
    Ren, Xinlei
    Zhang, Xu
    Chen, Lianwu
    Zheng, Xiguang
    Zhang, Chen
    Guo, Liang
    Yu, Bing
    [J]. INTERSPEECH 2021, 2021, : 1832 - 1836
  • [2] A Complex Neural Network Adaptive Beamforming for Multi-channel Speech Enhancement in Time Domain
    Jiang, Tao
    Liu, Hongqing
    Zhou, Yi
    Gan, Lu
    [J]. COMMUNICATIONS AND NETWORKING (CHINACOM 2021), 2022, : 129 - 139
  • [3] Real-time Multi-channel Speech Enhancement Based on Neural Network Masking with Attention Model
    Xue, Cheng
    Huang, Weilong
    Chen, Weiguang
    Feng, Jinwei
    [J]. INTERSPEECH 2021, 2021, : 1862 - 1866
  • [4] Real-Time Multi-Channel Acoustic Beamforming Using a Lightweight Microcontroller Processor
    Leary, Paul R.
    Dobrokhodov, Vladimir N.
    Jones, Kevin D.
    Smith, Kevin B.
    [J]. OCEANS 2019 MTS/IEEE SEATTLE, 2019,
  • [5] An Adaptive Multi-channel Approach for Real-Time Multimedia Wireless Sensor Networks
    Cortes, A.
    Pena, N. M.
    Labrador, M. A.
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2010, 8 (04) : 370 - 376
  • [6] An Adaptive Multi-channel Approach for Real-Time Multimedia Wireless Sensor Networks
    Cortes Cabezas, Albeiro
    Pena T, Nestor M.
    Labrador, Miguel A.
    [J]. 2009 IEEE LATIN-AMERICAN CONFERENCE ON COMMUNICATIONS (LATINCOM 2009), 2009, : 22 - +
  • [7] DEEP BEAMFORMING NETWORKS FOR MULTI-CHANNEL SPEECH RECOGNITION
    Xiao, Xiong
    Watanabe, Shinji
    Erdogan, Hakan
    Lu, Liang
    Hershey, John
    Seltzer, Michael L.
    Chen, Guoguo
    Zhang, Yu
    Mandel, Michael
    Yu, Dong
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5745 - 5749
  • [8] A Neural Beamspace-Domain Filter for Real-Time Multi-Channel Speech Enhancement
    Liu, Wenzhe
    Li, Andong
    Wang, Xiao
    Yuan, Minmin
    Chen, Yi
    Zheng, Chengshi
    Li, Xiaodong
    [J]. SYMMETRY-BASEL, 2022, 14 (06):
  • [9] MULTI-CHANNEL SPEECH ENHANCEMENT USING GRAPH NEURAL NETWORKS
    Tzirakis, Panagiotis
    Kumar, Anurag
    Donley, Jacob
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3415 - 3419
  • [10] CONSISTENCY-AWARE MULTI-CHANNEL SPEECH ENHANCEMENT USING DEEP NEURAL NETWORKS
    Masuyama, Yoshiki
    Togami, Masahito
    Komatsu, Tatsuya
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 821 - 825