Towards Automated Single Channel Source Separation using Neural Networks

被引:1
|
作者
Gang, Arpita [1 ]
Biyani, Pravesh [1 ]
Soni, Akshay
机构
[1] IIIT Delhi, New Delhi, India
关键词
Single Channel Source separation; Hyper parameter; Neural Network; Speech Recognition;
D O I
10.21437/Interspeech.2018-2065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many applications of single channel source separation (SCSS) including automatic speech recognition (ASR), hearing aids etc. require an estimation of only one source from a mixture of many sources. Treating this special case as a regular SCSS problem where in all constituent sources are given equal priority in terms of reconstruction may result in a suboptimal separation performance. In this paper, we tackle the one source separation problem by suitably modifying the orthodox SCSS framework and focus only on one source at a time. The proposed approach is a generic framework that can be applied to any existing SCSS algorithm, improves performance, and scales well when there are more than two sources in the mixture unlike most existing SCSS methods. Additionally, existing SCSS algorithms rely on fine hyper-parameter tuning hence making them difficult to use in practice. Our framework takes a step towards automatic tuning of the hyper-parameters thereby making our method better suited for the mixture to be separated and thus practically more useful. We test our framework on a neural network based algorithm and the results show an improved performance in terms of SDR and SAR.
引用
收藏
页码:3494 / 3498
页数:5
相关论文
共 50 条
  • [31] Fetal Phonocardiogram Extraction Using Single Channel Blind Source Separation
    Samieinasab, Maryam
    Sameni, Reza
    2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 78 - 83
  • [32] LOWLATENCY SOUND SOURCE SEPARATION USING CONVOLUTIONAL RECURRENT NEURAL NETWORKS
    Naithani, Gaurav
    Barker, Tom
    Parascandolo, Giambattista
    Bramslow, Lars
    Pontoppidan, Niels Henrik
    Virtanen, Tuomas
    2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 71 - 75
  • [33] Monoaural Audio Source Separation Using Deep Convolutional Neural Networks
    Chandna, Pritish
    Miron, Marius
    Janer, Jordi
    Gomez, Emilia
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2017), 2017, 10169 : 258 - 266
  • [34] DESIGNING MULTICHANNEL SOURCE SEPARATION BASED ON SINGLE-CHANNEL SOURCE SEPARATION
    Lopez, A. Ramirez
    Ono, N.
    Remes, U.
    Palomaki, K.
    Kurimo, M.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 469 - 473
  • [35] A Novel Approach for Single Channel Source Separation
    Lin, Yan-Bo
    Lee, Yuan-Shan
    Tuan Pham
    Tai, Tzu-Chiang
    Wang, Jia-Ching
    2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2016, : 351 - 352
  • [36] REPRESENTATION MODELS IN SINGLE CHANNEL SOURCE SEPARATION
    Zoehrer, Matthias
    Pernkopf, Franz
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 713 - 717
  • [37] Hybrid neural networks for ISFET source separation
    Bermejo, S
    Bedoya, G
    Cabestany, J
    SMART SENSORS, ACTUATORS, AND MEMS, PTS 1 AND 2, 2003, 5116 : 109 - 119
  • [38] Automated Atrial Fibrillation Source Detection Using Shallow Convolutional Neural Networks
    Lira, Isac N.
    de Oliveira, Pedro Marinho R.
    Freitas Jr, Walter
    Zarzoso, Vicente
    2020 COMPUTING IN CARDIOLOGY, 2020,
  • [39] SINGLE-CHANNEL SPEECH SEPARATION WITH MEMORY-ENHANCED RECURRENT NEURAL NETWORKS
    Weninger, Felix
    Eyben, Florian
    Schuller, Bjoern
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [40] Towards Automated Quantification of the Tumour Microenvironment using Fully Convolutional Neural Networks
    Wilson, B. I.
    Magee, D. R.
    Grabsch, H. I.
    West, N. P.
    Quirke, P.
    JOURNAL OF PATHOLOGY, 2022, 256 : S13 - S13