INCREMENTAL BINARIZATION ON RECURRENT NEURAL NETWORKS FOR SINGLE-CHANNEL SOURCE SEPARATION

被引：0

作者：

Kim, Sunwoo ^{[1
]}

Maity, Mrinmoy ^{[1
]}

Kim, Minje ^{[1
]}

机构：

[1] Indiana Univ, Dept Intelligent Syst Engn, Bloomington, IN 47408 USA

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年

关键词：

Speech Enhancement; Recurrent Neural Networks; Gated Recurrent Units; Bitwise Neural Networks;

D O I：

10.1109/icassp.2019.8682595

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper proposes a Bitwise Gated Recurrent Unit (BGRU) network for the single- channel source separation task. Recurrent Neural Networks (RNN) require several sets of weights within its cells, which significantly increases the computational cost compared to the fully- connected networks. To mitigate this increased computation, we focus on the GRU cells and quantize the feedforward procedure with binarized values and bitwise operations. The BGRU network is trained in two stages. The real- valued weights are pretrained and transferred to the bitwise network, which are then incrementally binarized to minimize the potential loss that can occur from a sudden introduction of quantization. As the proposed binarization technique turns only a few randomly chosen parameters into their binary versions, it gives the network training procedure a chance to gently adapt to the partly quantized version of the network. It eventually achieves the full binarization by incrementally increasing the amount of binarization over the iterations. Our experiments show that the proposed BGRU method produces source separation results greater than that of a real-valued fully connected network, with 11-12 dB mean Signal-to-Distortion Ratio (SDR). A fully binarized BGRU still outperforms a Bitwise Neural Network (BNN) by 1-2 dB even with less number of layers.

引用

页码：376 / 380

页数：5

共 50 条

[1] BITWISE NEURAL NETWORKS FOR EFFICIENT SINGLE-CHANNEL SOURCE SEPARATION
Kim, Minje
Smaragdis, Paris
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 701 - 705
[2] Discriminatively Trained Recurrent Neural Networks for Single-Channel Speech Separation
Weninger, Felix
Hershey, John R.
Le Roux, Jonathan
Schuller, Bjoern
[J]. 2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2014, : 577 - 581
[3] SINGLE-CHANNEL SPEECH SEPARATION WITH MEMORY-ENHANCED RECURRENT NEURAL NETWORKS
Weninger, Felix
Eyben, Florian
Schuller, Bjoern
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[4] Two-Stage Single-Channel Audio Source Separation Using Deep Neural Networks
Grais, Emad M.
Roma, Gerard
Simpson, Andrew J. R.
Plumbley, Mark D.
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (09) : 1469 - 1479
[5] DEEP NEURAL NETWORKS FOR SINGLE CHANNEL SOURCE SEPARATION
Grais, Emad M.
Sen, Mehmet Umut
Erdogan, Hakan
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[6] Ensemble System of Deep Neural Networks for Single-Channel Audio Separation
Al-Kaltakchi, Musab T. S.
Mohammad, Ahmad Saeed
Woo, Wai Lok
[J]. INFORMATION, 2023, 14 (07)
[7] DESIGNING MULTICHANNEL SOURCE SEPARATION BASED ON SINGLE-CHANNEL SOURCE SEPARATION
Lopez, A. Ramirez
Ono, N.
Remes, U.
Palomaki, K.
Kurimo, M.
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 469 - 473
[8] Single-channel phaseless blind source separation
Hameed, Humera
Ahmed, Ali
Fayyaz, Ubaid U.
[J]. TELECOMMUNICATION SYSTEMS, 2022, 80 (03) : 469 - 475
[9] Single-channel phaseless blind source separation
Humera Hameed
Ali Ahmed
Ubaid U. Fayyaz
[J]. Telecommunication Systems, 2022, 80 : 469 - 475
[10] A maximum likelihood approach to single-channel source separation
Jang, GJ
Lee, TW
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 4 (7-8) : 1365 - 1392

← 1 2 3 4 5 →