Bidirectional Truncated Recurrent Neural Networks for Efficient Speech Denoising

被引:0
|
作者
Brakel, Philemon [1 ]
Stroobandt, Dirk [1 ]
Schrauwen, Benjamin [1 ]
机构
[1] Univ Ghent, Dept Elect & Informat Syst, Ghent, Belgium
关键词
recurrent neural networks; deep learning; robust ASR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a bidirectional truncated recurrent neural network architecture for speech denoising. Recent work showed that deep recurrent neural networks perform well at speech denoising tasks and outperform feed forward architectures [1]. However, recurrent neural networks are difficult to train and their simulation does not allow for much parallelization. Given the increasing availability of parallel computing architectures like GPUs this is disadvantageous. The architecture we propose aims to retain the positive properties of recurrent neural networks and deep learning while remaining highly parallelizable. Unlike a standard recurrent neural network, it processes information from both past and future time steps. We evaluate two variants of this architecture on the Aurora2 task for robust ASR where they show promising results. The models outperform the ETSI2 advanced front end and the SPLICE algorithm under matching noise conditions.
引用
收藏
页码:2972 / 2976
页数:5
相关论文
共 50 条
  • [21] Unfolded Recurrent Neural Networks for Speech Recognition
    Saon, George
    Soltau, Hagen
    Emami, Ahmad
    Picheny, Michael
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 343 - 347
  • [22] SPEECH RECOGNITION WITH DEEP RECURRENT NEURAL NETWORKS
    Graves, Alex
    Mohamed, Abdel-rahman
    Hinton, Geoffrey
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6645 - 6649
  • [23] Variational Recurrent Neural Networks for Speech Separation
    Chien, Jen-Tzung
    Kuo, Kuan-Ting
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1193 - 1197
  • [24] Efficient Gated Convolutional Recurrent Neural Networks for Real-Time Speech Enhancement
    Fazal-E-Wahab
    Ye, Zhongfu
    Saleem, Nasir
    Ali, Hamza
    Ali, Imad
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2023,
  • [25] Localisation in Wireless Networks using Deep Bidirectional Recurrent Neural Networks
    Lynch, David
    Ho, Lester
    MacDonald, Michael
    O'Neill, Michael
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [26] Wavelet denoising of speech using neural networks for threshold selection
    Medina, CA
    Alcaim, A
    Apolinario, JA
    [J]. ELECTRONICS LETTERS, 2003, 39 (25) : 1869 - 1871
  • [27] ON TRAINING RECURRENT NETWORKS WITH TRUNCATED BACKPROPAGATION THROUGH TIME IN SPEECH RECOGNITION
    Tang, Hao
    Glass, James
    [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 48 - 55
  • [28] Signal Denoising with Recurrent Spiking Neural Networks and Active Tuning
    Ciurletti, Melvin
    Traub, Manuel
    Karlbauer, Matthias
    Butz, Martin, V
    Otte, Sebastian
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 220 - 232
  • [29] Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition
    Chen, X.
    Ragni, A.
    Liu, X.
    Gales, M. J. F.
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 269 - 273
  • [30] Classification, Denoising, and Deinterleaving of Pulse Streams With Recurrent Neural Networks
    Liu, Zhang-Meng
    Yu, Philip S.
    [J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2019, 55 (04) : 1624 - 1639