Optimization and Parallelization of Monaural Source Separation Algorithms in the openBliSSART Toolkit

被引：0

作者：

Felix Weninger

Björn Schuller

机构：

[1] Technische Universität München,Institute for Human

来源：

Journal of Signal Processing Systems | 2012年 / 69卷

关键词：

Audio source separation; Speech enhancement; Parallel computing;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We describe the implementation of monaural audio source separation algorithms in our toolkit openBliSSART (Blind Source Separation for Audio Recognition Tasks). To our knowledge, it provides the first freely available C+ + implementation of Non-Negative Matrix Factorization (NMF) supporting the Compute Unified Device Architecture (CUDA) for fast parallel processing on graphics processing units (GPUs). Besides integrating parallel processing, openBliSSART introduces several numerical optimizations of commonly used monaural source separation algorithms that reduce both computation time and memory usage. By illustrating a variety of use-cases from audio effects in music processing to speech enhancement and feature extraction, we demonstrate the wide applicability of our application framework for a multiplicity of research and end-user applications. We evaluate the toolkit by benchmark results of the NMF algorithms and discuss the influence of their parameterization on source separation quality and real-time factor. In the result, the GPU parallelization in openBliSSART introduces double-digit speedups with respect to conventional CPU computation, enabling real-time processing on a desktop PC even for high matrix dimensions.

引用

页码：267 / 277

页数：10

共 50 条

[1] Optimization and Parallelization of Monaural Source Separation Algorithms in the openBliSSART Toolkit
Weninger, Felix
Schuller, Bjoern
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2012, 69 (03): : 267 - 277
[2] OPENBLISSART: DESIGN AND EVALUATION OF A RESEARCH TOOLKIT FOR BLIND SOURCE SEPARATION IN AUDIO RECOGNITION TASKS
Weninger, Felix
Lehmann, Alexander
Schuller, Bjoern
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 1625 - 1628
[3] Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation
Huang, Po-Sen
Kim, Minje
Hasegawa-Johnson, Mark
Smaragdis, Paris
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (12) : 2136 - 2147
[4] Bayesian Factorization and Learning for Monaural Source Separation
Chien, Jen-Tzung
Yang, Po-Kai
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (01) : 185 - 195
[5] SUPERVISED MONAURAL SOURCE SEPARATION BASED ON AUTOENCODERS
Osako, Keiichi
Mitsufuji, Yuki
Singh, Rita
Raj, Bhiksha
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 11 - 15
[6] Monaural source separation using spectral cues
Pearlmutter, BA
Zador, AM
INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, 2004, 3195 : 478 - 485
[7] Parallelization of the quantile function optimization algorithms
A. I. Kibzun
Automation and Remote Control, 2007, 68 : 799 - 810
[8] PARALLELIZATION IN LEXICOGRAPHIC DISCRETE OPTIMIZATION ALGORITHMS
SERGIENKO, IV
CHERVAK, YY
GRENDZHA, VI
CYBERNETICS, 1984, 20 (05): : 720 - 725
[9] Parallelization of the quantile function optimization algorithms
Kibzun, A. I.
AUTOMATION AND REMOTE CONTROL, 2007, 68 (05) : 799 - 810
[10] Spatial Dispersion Constrained NMF for Monaural Source Separation
Viet-Hang Duong
Lee, Yuan-Shan
Bach-Tung Pham
Mathulaprangsan, Seksan
Pham-The Bao
Wang, Jia-Ching
2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,

← 1 2 3 4 5 →