Towards Automated Single Channel Source Separation using Neural Networks

被引:1
|
作者
Gang, Arpita [1 ]
Biyani, Pravesh [1 ]
Soni, Akshay
机构
[1] IIIT Delhi, New Delhi, India
关键词
Single Channel Source separation; Hyper parameter; Neural Network; Speech Recognition;
D O I
10.21437/Interspeech.2018-2065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many applications of single channel source separation (SCSS) including automatic speech recognition (ASR), hearing aids etc. require an estimation of only one source from a mixture of many sources. Treating this special case as a regular SCSS problem where in all constituent sources are given equal priority in terms of reconstruction may result in a suboptimal separation performance. In this paper, we tackle the one source separation problem by suitably modifying the orthodox SCSS framework and focus only on one source at a time. The proposed approach is a generic framework that can be applied to any existing SCSS algorithm, improves performance, and scales well when there are more than two sources in the mixture unlike most existing SCSS methods. Additionally, existing SCSS algorithms rely on fine hyper-parameter tuning hence making them difficult to use in practice. Our framework takes a step towards automatic tuning of the hyper-parameters thereby making our method better suited for the mixture to be separated and thus practically more useful. We test our framework on a neural network based algorithm and the results show an improved performance in terms of SDR and SAR.
引用
收藏
页码:3494 / 3498
页数:5
相关论文
共 50 条
  • [41] Towards Designing an Automated Classification of Lymphoma subtypes using Deep Neural Networks
    Tambe, Rucha
    Mahajan, Sarang
    Shah, Urmil
    Agrawal, Mohit
    Garware, Bhushan
    PROCEEDINGS OF THE 6TH ACM IKDD CODS AND 24TH COMAD, 2019, : 143 - 149
  • [42] Towards Automated Regulation of Jacobaea Vulgaris in Grassland using Deep Neural Networks
    Schauer, Moritz
    Hohl, Renke
    Vaupel, Dennis
    Bienhaus, Diethelm
    Ghobadi, Seyed Eghbal
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 702 - 711
  • [43] JOINT TRAINING OF DEEP NEURAL NETWORKS FOR MULTI-CHANNEL DEREVERBERATION AND SPEECH SOURCE SEPARATION
    Togami, Masahito
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3032 - 3036
  • [44] Blind Source Separation of Single Channel Mixture Using Tensorization and Tensor Diagonalization
    Phan, Anh-Huy
    Tichavsky, Petr
    Cichocki, Andrzej
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2017), 2017, 10169 : 36 - 46
  • [45] Single Channel Blind Source Separation Using Optimized Local Mean Decomposition
    Guo, Yina
    Ren, Xiaowen
    Sun, Chaoli
    Tian, Wenyan
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017, : 2743 - 2748
  • [46] Single Channel Blind Source Separation using Dual Extended Kalman Filter
    Dutt, Rashi
    Mondal, Sayon
    Acharyya, Amit
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [47] BLIND BOUNDED SOURCE SEPARATION USING NEURAL NETWORKS WITH LOCAL LEARNING RULES
    Erdogan, Alper T.
    Pehlevan, Cengiz
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3812 - 3816
  • [48] Referenceless Performance Evaluation of Audio Source Separation using Deep Neural Networks
    Grais, Emad M.
    Wierstorf, Hagen
    Ward, Dominic
    Mason, Russell
    Plumbley, Mark D.
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [49] LOW-LATENCY SOUND SOURCE SEPARATION USING DEEP NEURAL NETWORKS
    Naithani, Gaurav
    Parascandolo, Giambattista
    Barker, Tom
    Pontoppidan, Niels Henrik
    Virtanen, Tuomas
    2016 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2016, : 272 - 276
  • [50] Conditional Automated Channel Pruning for Deep Neural Networks
    Liu, Yixin
    Guo, Yong
    Guo, Jiaxin
    Jiang, Luoqian
    Chen, Jian
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1275 - 1279