Optimization and Parallelization of Monaural Source Separation Algorithms in the openBliSSART Toolkit

被引:0
|
作者
Felix Weninger
Björn Schuller
机构
[1] Technische Universität München,Institute for Human
来源
关键词
Audio source separation; Speech enhancement; Parallel computing;
D O I
暂无
中图分类号
学科分类号
摘要
We describe the implementation of monaural audio source separation algorithms in our toolkit openBliSSART (Blind Source Separation for Audio Recognition Tasks). To our knowledge, it provides the first freely available C+ + implementation of Non-Negative Matrix Factorization (NMF) supporting the Compute Unified Device Architecture (CUDA) for fast parallel processing on graphics processing units (GPUs). Besides integrating parallel processing, openBliSSART introduces several numerical optimizations of commonly used monaural source separation algorithms that reduce both computation time and memory usage. By illustrating a variety of use-cases from audio effects in music processing to speech enhancement and feature extraction, we demonstrate the wide applicability of our application framework for a multiplicity of research and end-user applications. We evaluate the toolkit by benchmark results of the NMF algorithms and discuss the influence of their parameterization on source separation quality and real-time factor. In the result, the GPU parallelization in openBliSSART introduces double-digit speedups with respect to conventional CPU computation, enabling real-time processing on a desktop PC even for high matrix dimensions.
引用
收藏
页码:267 / 277
页数:10
相关论文
共 50 条
  • [31] On the Stability of Source Separation Algorithms
    Jean-François Cardoso
    Journal of VLSI signal processing systems for signal, image and video technology, 2000, 26 : 7 - 14
  • [32] On the stability of source separation algorithms
    Cardoso, JF
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2000, 26 (1-2): : 7 - 14
  • [33] INFORMED MONAURAL SOURCE SEPARATION OF MUSIC BASED ON CONVOLUTIONAL SPARSE CODING
    Jao, Ping-Keng
    Yang, Yi-Hsuan
    Wohlberg, Brendt
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 236 - 240
  • [34] A SI-SDR Loss Function based Monaural Source Separation
    Li, Shuai
    Liu, Hongqing
    Zhou, Yi
    Luo, Zhen
    PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 356 - 360
  • [35] Monaural music source separation: Nonnegativity, sparseness, and shift-invariance
    Kim, M
    Choi, S
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, PROCEEDINGS, 2006, 3889 : 617 - 624
  • [36] Monaural Source Separation Based on Adaptive Discriminative Criterion in Neural Networks
    Sun, Yang
    Zhu, Lei
    Chambers, Jonathon A.
    Naqvi, Syed Mohsen
    2017 22ND INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2017,
  • [37] ON THE LIMITATIONS OF BINAURAL REPRODUCTION OF MONAURAL BLIND SOURCE SEPARATION OUTPUT SIGNALS
    Reindl, Klaus
    Kellermann, Walter
    Zhang, Mengqiu
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 305 - 309
  • [38] NMF WITH SPECTRAL AND TEMPORAL CONTINUITY CRITERIA FOR MONAURAL SOUND SOURCE SEPARATION
    Becker, Julian M.
    Sohn, Christian
    Rohlfing, Christian
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 316 - 320
  • [39] Monaural Music Source Separation using a ResNet Latent Separator Network
    Brunner, Gino
    Naas, Nawel
    Palsson, Sveinn
    Richter, Oliver
    Wattenhofer, Roger
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1124 - 1131
  • [40] Stability analysis and optimization of time-domain convolutive source separation algorithms
    Charkani, N
    Deville, Y
    Herault, J
    FIRST IEEE SIGNAL PROCESSING WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS, 1997, : 73 - 76