Monaural Source Separation Based on Adaptive Discriminative Criterion in Neural Networks

被引:0
|
作者
Sun, Yang [1 ]
Zhu, Lei [2 ]
Chambers, Jonathon A. [1 ]
Naqvi, Syed Mohsen [1 ]
机构
[1] Newcastle Univ, Sch Elect & Elect Engn, Newcastle Upon Tyne, Tyne & Wear, England
[2] Harbin Engn Univ, Sci Coll, Harbin, Heilongjiang, Peoples R China
关键词
Monaural Source Separation; Deep Recurrent Neural Network; Penalty Factor; Adaptive;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Monaural source separation is an important research area which can help to improve the performance of several real-world applications, such as speech recognition and assisted living systems. Huang et al. proposed deep recurrent neural networks (DRNNs) with discriminative criterion objective function to improve the performance of source separation. However, the penalty factor in the objective function is selected randomly and empirically. Therefore, we introduce an approach to calculate the parameter in the discriminative term adaptively via the discrepancy between target features. The penalty factor can be changed with inputs to improve the separation performance. The proposed method is evaluated with different settings and architectures of neural networks. In these experiments, the TIMIT corpus is explored as the database and the signal to distortion ratio (SDR) as the measurement. Comparing with the previous approach, our method has improved robustness and a better separation performance.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] DISCRIMINATIVE MULTIPLE SOUND SOURCE LOCALIZATION BASED ON DEEP NEURAL NETWORKS USING INDEPENDENT LOCATION MODEL
    Takeda, Ryu
    Komatani, Kazunori
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 603 - 609
  • [42] Distilled Binary Neural Network for Monaural Speech Separation
    Chen, Xiuyi
    Liu, Guangcan
    Shi, Jing
    Xu, Jiaming
    Xu, Bo
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [43] Nonlinear blind source separation by Spline Neural Networks
    Solazzi, M
    Piazza, F
    Uncini, A
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 2781 - 2784
  • [44] Fully Quantized Neural Networks for Audio Source Separation
    Cohen, Elad
    Habi, Hai Victor
    Peretz, Reuven
    Netzer, Arnon
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 926 - 933
  • [45] Blind Source Separation for Convolutive Mixtures with Neural Networks
    Kirei, Botond Sandor
    Topa, Marina Dana
    Muresan, Irina
    Homana, Ioana
    Toma, Norbert
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2011, 11 (01) : 63 - 68
  • [46] DEEP NEURAL NETWORKS FOR SINGLE CHANNEL SOURCE SEPARATION
    Grais, Emad M.
    Sen, Mehmet Umut
    Erdogan, Hakan
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [47] Multichannel Audio Source Separation With Deep Neural Networks
    Nugraha, Aditya Arie
    Liutkus, Antoine
    Vincent, Emmanuel
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (09) : 1652 - 1664
  • [48] Blind Source Separation Method Based on Neural Network with Bias Term and Maximum Likelihood Estimation Criterion
    Liu, Sheng
    Wang, Bangmin
    Zhang, Lanyong
    SENSORS, 2021, 21 (03) : 1 - 27
  • [49] Monaural Music Source Separation Using Deep Convolutional Neural Network Embedded with Feature Extraction Module
    Yu, Yongbin
    Peng, Chenhui
    Tang, Qian
    Wang, Xiangxiang
    2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 546 - 551
  • [50] Adaptive learning for the neural classifier based on Fisher criterion
    Jacob, AM
    Hemerly, EM
    MACHINE LEARNING FOR SIGNAL PROCESSING XIV, 2004, : 103 - 112