Combining Mask Estimates for Single Channel Audio Source Separation using Deep Neural Networks

被引:15
|
作者
Grais, Emad M. [1 ]
Roma, Gerard [1 ]
Simpson, Andrew J. R. [1 ]
Plumbley, Mark D. [1 ]
机构
[1] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford, Surrey, England
基金
英国工程与自然科学研究理事会;
关键词
Combining estimates; deep neural networks; single channel source separation; neural network ensembles; deep learning;
D O I
10.21437/Interspeech.2016-216
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Deep neural networks (DNNs) are usually used for single channel source separation to predict either soft or binary time frequency masks. The masks are used to separate the sources from the mixed signal. Binary masks produce separated sources with more distortion and less interference than soft masks. In this paper, we propose to use another DNN to combine the estimates of binary and soft masks to achieve the advantages and avoid the disadvantages of using each mask individually. We aim to achieve separated sources with low distortion and low interference between each other. Our experimental results show that combining the estimates of binary and soft masks using DNN achieves lower distortion than using each estimate individually and achieves as low interference as the binary mask.
引用
收藏
页码:3339 / 3343
页数:5
相关论文
共 50 条
  • [1] Discriminative Enhancement for Single Channel Audio Source Separation Using Deep Neural Networks
    Grais, Emad M.
    Roma, Gerard
    Simpson, Andrew J. R.
    Plumbley, Mark D.
    [J]. LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2017), 2017, 10169 : 236 - 246
  • [2] Two-Stage Single-Channel Audio Source Separation Using Deep Neural Networks
    Grais, Emad M.
    Roma, Gerard
    Simpson, Andrew J. R.
    Plumbley, Mark D.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (09) : 1469 - 1479
  • [3] DEEP NEURAL NETWORKS FOR SINGLE CHANNEL SOURCE SEPARATION
    Grais, Emad M.
    Sen, Mehmet Umut
    Erdogan, Hakan
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [4] Single Channel Speech Source Separation Using Hierarchical Deep Neural Networks
    Noorani, Seyed Majid
    Seyedin, Sanaz
    [J]. 2020 28TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2020, : 466 - 470
  • [5] Monoaural Audio Source Separation Using Deep Convolutional Neural Networks
    Chandna, Pritish
    Miron, Marius
    Janer, Jordi
    Gomez, Emilia
    [J]. LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2017), 2017, 10169 : 258 - 266
  • [6] Ensemble System of Deep Neural Networks for Single-Channel Audio Separation
    Al-Kaltakchi, Musab T. S.
    Mohammad, Ahmad Saeed
    Woo, Wai Lok
    [J]. INFORMATION, 2023, 14 (07)
  • [7] Multichannel Audio Source Separation With Deep Neural Networks
    Nugraha, Aditya Arie
    Liutkus, Antoine
    Vincent, Emmanuel
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (09) : 1652 - 1664
  • [8] Referenceless Performance Evaluation of Audio Source Separation using Deep Neural Networks
    Grais, Emad M.
    Wierstorf, Hagen
    Ward, Dominic
    Mason, Russell
    Plumbley, Mark D.
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [9] Towards Automated Single Channel Source Separation using Neural Networks
    Gang, Arpita
    Biyani, Pravesh
    Soni, Akshay
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3494 - 3498
  • [10] Single channel audio source separation
    Gao, Bin
    Woo, W.L.
    Dlay, S.S.
    [J]. WSEAS Transactions on Signal Processing, 2008, 4 (04): : 173 - 182