Variational Bayesian Inference for Source Separation and Robust Feature Extraction

被引:9
|
作者
Adiloglu, Kamil [1 ]
Vincent, Emmanuel [2 ]
机构
[1] HorTech gGmbH, D-26129 Oldenburg, Germany
[2] Inria, F-54600 Villers Les Nancy, France
关键词
AUDIO SOURCE SEPARATION; UNCERTAINTY ESTIMATION; SPEECH; MODELS; BLIND; FACTORIZATION; COMPENSATION; RECOGNITION; INFORMATION; FRAMEWORK;
D O I
10.1109/TASLP.2016.2583794
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We consider the task of separating and classifying individual sound sources mixed together. The main challenge is to achieve robust classification despite residual distortion of the separated source signals. A promising paradigm is to estimate the uncertainty about the separated source signals and to propagate it through the subsequent feature extraction and classification stages. We argue that variational Bayesian (VB) inference offers a mathematically rigorous way of deriving uncertainty estimators, which contrasts with state-of-the-art estimators based on heuristics or on maximum likelihood (ML) estimation. We propose a general VB source separation algorithm, which makes it possible to jointly exploit spatial and spectral models of the sources. This algorithm achieves 6% and 5% relative error reduction compared to ML uncertainty estimation on the CHiME noise -robust speaker identification and speech recognition benchmarks, respectively, and it opens the way for more complex VB approximations of uncertainty.
引用
收藏
页码:1746 / 1758
页数:13
相关论文
共 50 条
  • [1] Variational and stochastic inference for Bayesian source separation
    Cemgil, A. Taylan
    Fevotte, Cedric
    Godsill, Simon J.
    [J]. DIGITAL SIGNAL PROCESSING, 2007, 17 (05) : 891 - 913
  • [2] Sparse Robust Dynamic Feature Extraction using Bayesian Inference
    Puli, Vamsi Krishna
    Chiplunkar, Ranjith
    Huang, Biao
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (06) : 6201 - 6209
  • [3] Transfer Learning for Dynamic Feature Extraction Using Variational Bayesian Inference
    Xie, Junyao
    Huang, Biao
    Dubljevic, Stevan
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (11) : 5524 - 5535
  • [4] A GENERAL VARIATIONAL BAYESIAN FRAMEWORK FOR ROBUST FEATURE EXTRACTION IN MULTISOURCE RECORDINGS
    Adiloglu, Kamil
    Vincent, Emmanuel
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 273 - 276
  • [5] Application of variational Bayesian PCA for speech feature extraction
    Kwon, OW
    Lee, TW
    Chan, KL
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 825 - 828
  • [6] Variational Bayesian Inference for Robust Streaming Tensor Factorization and Completion
    Hawkins, Cole
    Zhang, Zheng
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1446 - 1451
  • [7] Variational Bayesian method for temporally correlated source separation
    Huang, Qinghua
    Yang, Jie
    Zhou, Yue
    [J]. NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2006, 4232 : 1058 - 1067
  • [8] VARIATIONAL BAYESIAN MODEL AVERAGING FOR AUDIO SOURCE SEPARATION
    Jaureguiberry, Xabier
    Vincent, Emmanuel
    Richard, Gael
    [J]. 2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 33 - 36
  • [9] Nonlinear blind source separation by variational Bayesian learning
    Valpola, H
    Oja, E
    Ilin, A
    Honkela, A
    Karhunen, J
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2003, E86A (03) : 532 - 541
  • [10] Rethinking the Invariant Feature Learning: Variational Bayesian Inference for Domain Generalization
    Liu, Xiaofeng
    Hu, Bo
    Jin, Linghao
    Han, Xu
    Xing, Fangxu
    Ouyang, Jinsong
    Lu, Jun
    El Fakhri, Georges
    Woo, Jonghye
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 881 - 887