Bidirectional Truncated Recurrent Neural Networks for Efficient Speech Denoising

被引：0

作者：

Brakel, Philemon ^{[1
]}

Stroobandt, Dirk ^{[1
]}

Schrauwen, Benjamin ^{[1
]}

机构：

[1] Univ Ghent, Dept Elect & Informat Syst, Ghent, Belgium

来源：

14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年

关键词：

recurrent neural networks; deep learning; robust ASR;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a bidirectional truncated recurrent neural network architecture for speech denoising. Recent work showed that deep recurrent neural networks perform well at speech denoising tasks and outperform feed forward architectures [1]. However, recurrent neural networks are difficult to train and their simulation does not allow for much parallelization. Given the increasing availability of parallel computing architectures like GPUs this is disadvantageous. The architecture we propose aims to retain the positive properties of recurrent neural networks and deep learning while remaining highly parallelizable. Unlike a standard recurrent neural network, it processes information from both past and future time steps. We evaluate two variants of this architecture on the Aurora2 task for robust ASR where they show promising results. The models outperform the ETSI2 advanced front end and the SPLICE algorithm under matching noise conditions.

引用

页码：2972 / 2976

页数：5

共 50 条

[21] Unfolded Recurrent Neural Networks for Speech Recognition
Saon, George
Soltau, Hagen
Emami, Ahmad
Picheny, Michael
[J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 343 - 347
[22] SPEECH RECOGNITION WITH DEEP RECURRENT NEURAL NETWORKS
Graves, Alex
Mohamed, Abdel-rahman
Hinton, Geoffrey
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6645 - 6649
[23] Variational Recurrent Neural Networks for Speech Separation
Chien, Jen-Tzung
Kuo, Kuan-Ting
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1193 - 1197
[24] Efficient Gated Convolutional Recurrent Neural Networks for Real-Time Speech Enhancement
Fazal-E-Wahab
Ye, Zhongfu
Saleem, Nasir
Ali, Hamza
Ali, Imad
[J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2023,
[25] Localisation in Wireless Networks using Deep Bidirectional Recurrent Neural Networks
Lynch, David
Ho, Lester
MacDonald, Michael
O'Neill, Michael
[J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[26] Wavelet denoising of speech using neural networks for threshold selection
Medina, CA
Alcaim, A
Apolinario, JA
[J]. ELECTRONICS LETTERS, 2003, 39 (25) : 1869 - 1871
[27] ON TRAINING RECURRENT NETWORKS WITH TRUNCATED BACKPROPAGATION THROUGH TIME IN SPEECH RECOGNITION
Tang, Hao
Glass, James
[J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 48 - 55
[28] Signal Denoising with Recurrent Spiking Neural Networks and Active Tuning
Ciurletti, Melvin
Traub, Manuel
Karlbauer, Matthias
Butz, Martin, V
Otte, Sebastian
[J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 220 - 232
[29] Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition
Chen, X.
Ragni, A.
Liu, X.
Gales, M. J. F.
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 269 - 273
[30] Classification, Denoising, and Deinterleaving of Pulse Streams With Recurrent Neural Networks
Liu, Zhang-Meng
Yu, Philip S.
[J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2019, 55 (04) : 1624 - 1639

← 1 2 3 4 5 →