Optimization and Parallelization of Monaural Source Separation Algorithms in the openBliSSART Toolkit

被引:0
|
作者
Felix Weninger
Björn Schuller
机构
[1] Technische Universität München,Institute for Human
来源
关键词
Audio source separation; Speech enhancement; Parallel computing;
D O I
暂无
中图分类号
学科分类号
摘要
We describe the implementation of monaural audio source separation algorithms in our toolkit openBliSSART (Blind Source Separation for Audio Recognition Tasks). To our knowledge, it provides the first freely available C+ + implementation of Non-Negative Matrix Factorization (NMF) supporting the Compute Unified Device Architecture (CUDA) for fast parallel processing on graphics processing units (GPUs). Besides integrating parallel processing, openBliSSART introduces several numerical optimizations of commonly used monaural source separation algorithms that reduce both computation time and memory usage. By illustrating a variety of use-cases from audio effects in music processing to speech enhancement and feature extraction, we demonstrate the wide applicability of our application framework for a multiplicity of research and end-user applications. We evaluate the toolkit by benchmark results of the NMF algorithms and discuss the influence of their parameterization on source separation quality and real-time factor. In the result, the GPU parallelization in openBliSSART introduces double-digit speedups with respect to conventional CPU computation, enabling real-time processing on a desktop PC even for high matrix dimensions.
引用
收藏
页码:267 / 277
页数:10
相关论文
共 50 条
  • [1] Optimization and Parallelization of Monaural Source Separation Algorithms in the openBliSSART Toolkit
    Weninger, Felix
    Schuller, Bjoern
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2012, 69 (03): : 267 - 277
  • [2] OPENBLISSART: DESIGN AND EVALUATION OF A RESEARCH TOOLKIT FOR BLIND SOURCE SEPARATION IN AUDIO RECOGNITION TASKS
    Weninger, Felix
    Lehmann, Alexander
    Schuller, Bjoern
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 1625 - 1628
  • [3] Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation
    Huang, Po-Sen
    Kim, Minje
    Hasegawa-Johnson, Mark
    Smaragdis, Paris
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (12) : 2136 - 2147
  • [4] Bayesian Factorization and Learning for Monaural Source Separation
    Chien, Jen-Tzung
    Yang, Po-Kai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (01) : 185 - 195
  • [5] SUPERVISED MONAURAL SOURCE SEPARATION BASED ON AUTOENCODERS
    Osako, Keiichi
    Mitsufuji, Yuki
    Singh, Rita
    Raj, Bhiksha
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 11 - 15
  • [6] Monaural source separation using spectral cues
    Pearlmutter, BA
    Zador, AM
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, 2004, 3195 : 478 - 485
  • [7] Parallelization of the quantile function optimization algorithms
    A. I. Kibzun
    Automation and Remote Control, 2007, 68 : 799 - 810
  • [8] PARALLELIZATION IN LEXICOGRAPHIC DISCRETE OPTIMIZATION ALGORITHMS
    SERGIENKO, IV
    CHERVAK, YY
    GRENDZHA, VI
    CYBERNETICS, 1984, 20 (05): : 720 - 725
  • [9] Parallelization of the quantile function optimization algorithms
    Kibzun, A. I.
    AUTOMATION AND REMOTE CONTROL, 2007, 68 (05) : 799 - 810
  • [10] Spatial Dispersion Constrained NMF for Monaural Source Separation
    Viet-Hang Duong
    Lee, Yuan-Shan
    Bach-Tung Pham
    Mathulaprangsan, Seksan
    Pham-The Bao
    Wang, Jia-Ching
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,