Towards Automated Single Channel Source Separation using Neural Networks

被引：1

作者：

Gang, Arpita ^{[1
]}

Biyani, Pravesh ^{[1
]}

Soni, Akshay

机构：

[1] IIIT Delhi, New Delhi, India

来源：

19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES | 2018年

关键词：

Single Channel Source separation; Hyper parameter; Neural Network; Speech Recognition;

D O I：

10.21437/Interspeech.2018-2065

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many applications of single channel source separation (SCSS) including automatic speech recognition (ASR), hearing aids etc. require an estimation of only one source from a mixture of many sources. Treating this special case as a regular SCSS problem where in all constituent sources are given equal priority in terms of reconstruction may result in a suboptimal separation performance. In this paper, we tackle the one source separation problem by suitably modifying the orthodox SCSS framework and focus only on one source at a time. The proposed approach is a generic framework that can be applied to any existing SCSS algorithm, improves performance, and scales well when there are more than two sources in the mixture unlike most existing SCSS methods. Additionally, existing SCSS algorithms rely on fine hyper-parameter tuning hence making them difficult to use in practice. Our framework takes a step towards automatic tuning of the hyper-parameters thereby making our method better suited for the mixture to be separated and thus practically more useful. We test our framework on a neural network based algorithm and the results show an improved performance in terms of SDR and SAR.

引用

页码：3494 / 3498

页数：5

共 50 条

[21] Single channel audio source separation
Gao, Bin
Woo, W.L.
Dlay, S.S.
WSEAS Transactions on Signal Processing, 2008, 4 (04): : 173 - 182
[22] TOWARDS UNSUPERVISED SINGLE-CHANNEL BLIND SOURCE SEPARATION USING ADVERSARIAL PAIR UNMIX-AND-REMIX
Hoshen, Yedid
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3272 - 3276
[23] Automated Seismic Source Characterization Using Deep Graph Neural Networks
van den Ende, M. P. A.
Ampuero, J. -P.
GEOPHYSICAL RESEARCH LETTERS, 2020, 47 (17)
[24] Ensemble System of Deep Neural Networks for Single-Channel Audio Separation
Al-Kaltakchi, Musab T. S.
Mohammad, Ahmad Saeed
Woo, Wai Lok
INFORMATION, 2023, 14 (07)
[25] Discriminatively Trained Recurrent Neural Networks for Single-Channel Speech Separation
Weninger, Felix
Hershey, John R.
Le Roux, Jonathan
Schuller, Bjoern
2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2014, : 577 - 581
[26] Single channel source separation using time-frequency non-negative matrix factorization and sigmoid base normalization deep neural networks
Koteswararao, Yannam Vasantha
Rao, C. B. Rama
MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2022, 33 (03) : 1023 - 1043
[27] Speaker Independent Single Channel Source Separation Using Sinusoidal Features
Ranjan, Shivesh
Payton, Karen L.
Mowlaee, Pejman
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1522 - 1525
[28] Single Channel Blind Source Separation using the Best Characteristic Basis
Gao, Bin
Woo, W. L.
Dlay, S. S.
2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 795 - 799
[29] SINGLE CHANNEL AUDIO SOURCE SEPARATION USING CONVOLUTIONAL DENOISING AUTOENCODERS
Grais, Emad M.
Plumbley, Mark D.
2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1265 - 1269
[30] Single-Channel Source Separation Using Complex Matrix Factorization
King, Brian J.
Atlas, Les
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2591 - 2597

← 1 2 3 4 5 →