INCREMENTAL BINARIZATION ON RECURRENT NEURAL NETWORKS FOR SINGLE-CHANNEL SOURCE SEPARATION

被引:0
|
作者
Kim, Sunwoo [1 ]
Maity, Mrinmoy [1 ]
Kim, Minje [1 ]
机构
[1] Indiana Univ, Dept Intelligent Syst Engn, Bloomington, IN 47408 USA
关键词
Speech Enhancement; Recurrent Neural Networks; Gated Recurrent Units; Bitwise Neural Networks;
D O I
10.1109/icassp.2019.8682595
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a Bitwise Gated Recurrent Unit (BGRU) network for the single- channel source separation task. Recurrent Neural Networks (RNN) require several sets of weights within its cells, which significantly increases the computational cost compared to the fully- connected networks. To mitigate this increased computation, we focus on the GRU cells and quantize the feedforward procedure with binarized values and bitwise operations. The BGRU network is trained in two stages. The real- valued weights are pretrained and transferred to the bitwise network, which are then incrementally binarized to minimize the potential loss that can occur from a sudden introduction of quantization. As the proposed binarization technique turns only a few randomly chosen parameters into their binary versions, it gives the network training procedure a chance to gently adapt to the partly quantized version of the network. It eventually achieves the full binarization by incrementally increasing the amount of binarization over the iterations. Our experiments show that the proposed BGRU method produces source separation results greater than that of a real-valued fully connected network, with 11-12 dB mean Signal-to-Distortion Ratio (SDR). A fully binarized BGRU still outperforms a Bitwise Neural Network (BNN) by 1-2 dB even with less number of layers.
引用
收藏
页码:376 / 380
页数:5
相关论文
共 50 条
  • [1] BITWISE NEURAL NETWORKS FOR EFFICIENT SINGLE-CHANNEL SOURCE SEPARATION
    Kim, Minje
    Smaragdis, Paris
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 701 - 705
  • [2] Discriminatively Trained Recurrent Neural Networks for Single-Channel Speech Separation
    Weninger, Felix
    Hershey, John R.
    Le Roux, Jonathan
    Schuller, Bjoern
    [J]. 2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2014, : 577 - 581
  • [3] SINGLE-CHANNEL SPEECH SEPARATION WITH MEMORY-ENHANCED RECURRENT NEURAL NETWORKS
    Weninger, Felix
    Eyben, Florian
    Schuller, Bjoern
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [4] Two-Stage Single-Channel Audio Source Separation Using Deep Neural Networks
    Grais, Emad M.
    Roma, Gerard
    Simpson, Andrew J. R.
    Plumbley, Mark D.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (09) : 1469 - 1479
  • [5] DEEP NEURAL NETWORKS FOR SINGLE CHANNEL SOURCE SEPARATION
    Grais, Emad M.
    Sen, Mehmet Umut
    Erdogan, Hakan
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [6] Ensemble System of Deep Neural Networks for Single-Channel Audio Separation
    Al-Kaltakchi, Musab T. S.
    Mohammad, Ahmad Saeed
    Woo, Wai Lok
    [J]. INFORMATION, 2023, 14 (07)
  • [7] DESIGNING MULTICHANNEL SOURCE SEPARATION BASED ON SINGLE-CHANNEL SOURCE SEPARATION
    Lopez, A. Ramirez
    Ono, N.
    Remes, U.
    Palomaki, K.
    Kurimo, M.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 469 - 473
  • [8] Single-channel phaseless blind source separation
    Hameed, Humera
    Ahmed, Ali
    Fayyaz, Ubaid U.
    [J]. TELECOMMUNICATION SYSTEMS, 2022, 80 (03) : 469 - 475
  • [9] Single-channel phaseless blind source separation
    Humera Hameed
    Ali Ahmed
    Ubaid U. Fayyaz
    [J]. Telecommunication Systems, 2022, 80 : 469 - 475
  • [10] A maximum likelihood approach to single-channel source separation
    Jang, GJ
    Lee, TW
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 4 (7-8) : 1365 - 1392