Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition

被引:4
|
作者
Fontaine, Mathieu [1 ]
Stoter, Fabian-Robert [2 ,3 ]
Liutkus, Antoine [2 ,3 ]
Simsekli, Umut [4 ]
Serizel, Romain [1 ]
Badeau, Roland [4 ]
机构
[1] Univ Lorraine, LORIA, CNRS, INRIA, F-54000 Nancy, France
[2] INRIA, Montpellier, France
[3] LIRMM, Montpellier, France
[4] Univ Paris Saclay, Telecom ParisTech, LTCI, Paris, France
关键词
SPEECH ENHANCEMENT; ALGORITHMS;
D O I
10.1007/978-3-319-93764-9_2
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper introduces a new method for multichannel speech enhancement based on a versatile modeling of the residual noise spectrogram. Such a model has already been presented before in the single channel case where the noise component is assumed to follow an alpha-stable distribution for each time-frequency bin, whereas the speech spectrogram, supposed to be more regular, is modeled as Gaussian. In this paper, we describe a multichannel extension of this model, as well as a Monte Carlo Expectation - Maximisation algorithm for parameter estimation. In particular, a multichannel extension of the Itakura-Saito nonnegative matrix factorization is exploited to estimate the spectral parameters for speech, and a Metropolis-Hastings algorithm is proposed to estimate the noise contribution. We evaluate the proposed method in a challenging multichannel denoising application and compare it to other state-of-the-art algorithms.
引用
收藏
页码:13 / 23
页数:11
相关论文
共 50 条
  • [1] MULTICHANNEL AUDIO SIGNAL COMPRESSION BASED ON TENSOR DECOMPOSITION
    Wang, Jing
    Xu, Chundong
    Xie, Xiang
    Kuang, Jingming
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 286 - 290
  • [2] A Novel Multichannel Audio Signal Compression Method Based on Tensor Representation and Decomposition
    Wang Jing
    Me Xiang
    Kuang Jingming
    [J]. CHINA COMMUNICATIONS, 2014, 11 (03) : 80 - 90
  • [3] Tensor Completion for Recovering Multichannel Audio Signal with Missing Data
    Lidong Yang
    Min Liu
    Jing Wang
    Xiang Xie
    Jingming Kuang
    [J]. China Communications, 2019, 16 (04) : 186 - 195
  • [4] Tensor Completion for Recovering Multichannel Audio Signal with Missing Data
    Yang, Lidong
    Liu, Min
    Wang, Jing
    Xie, Xiang
    Kuang, Jingming
    [J]. CHINA COMMUNICATIONS, 2019, 16 (04) : 186 - 195
  • [5] ALPHA-STABLE MULTICHANNEL AUDIO SOURCE SEPARATION
    Leglaive, Simon
    Simsekli, Umut
    Liutkus, Antoine
    Badeau, Roland
    Richard, Gael
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 576 - 580
  • [6] MULTICHANNEL AUDIO SOURCE SEPARATION WITH PROBABILISTIC REVERBERATION MODELING
    Leglaive, Simon
    Badeau, Roland
    Richard, Gael
    [J]. 2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [7] MULTICHANNEL AUDIO UPMIXING BASED ON NON-NEGATIVE TENSOR FACTORIZATION REPRESENTATION
    Nikunen, J.
    Virtanen, T.
    Vilermo, M.
    [J]. 2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 33 - 36
  • [8] Multi-channel audio recovery based on tensor decomposition
    Yang, Li-Dong
    Wang, Jing
    Zhao, Yi
    Xie, Xiang
    Kuang, Jing-Ming
    [J]. Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2015, 35 (11): : 1183 - 1188
  • [9] PROBABILISTIC LATENT TENSOR FACTORIZATION FRAMEWORK FOR AUDIO MODELING
    Cemgil, Ali Taylan
    Simsekli, Umut
    Subakan, Yusuf Cem
    [J]. 2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 137 - 140
  • [10] A tensor decomposition based multichannel linear prediction approach to speech dereverberation
    Zeng, Xiaojin
    He, Hongsen
    Chen, Jingdong
    Benesty, Jacob
    [J]. APPLIED ACOUSTICS, 2023, 214