Joint dereverberation and blind source separation using a hybrid autoregressive and convolutive transfer function-based model

被引:0
|
作者
Liu S. [1 ,2 ]
Yang F. [2 ,3 ]
Chen R. [4 ]
Yang J. [1 ,2 ]
机构
[1] Key Laboratory of Noise and Vibration Research, Institute of Acoustics, Chinese Academy of Sciences, Beijing
[2] University of Chinese Academy of Sciences, Beijing
[3] State Key Laboratory of Acoustics, Institute of Acoustics, Chinese Academy of Sciences, Beijing
[4] Tencent AI Lab, Beijing
基金
中国国家自然科学基金;
关键词
Autoregressive; Blind source separation; Convolutive transfer function; Dereverberation; Multichannel non-negative matrix factorization;
D O I
10.1016/j.apacoust.2024.110135
中图分类号
学科分类号
摘要
Most frequency-domain blind source separation (BSS) methods are based on the multiplicative narrowband assumption, which is not valid in long reverberation environments. In contrast, convolutive transfer function (CTF)-based BSS methods do not rely on the narrowband assumption, and the separation performance is significantly improved compared to the traditional algorithms in long reverberation environments. However, the CTF-based BSS methods and their variants, e.g., autoregressive (AR) BSS methods, introduce modeling errors to some extent, due to the truncation or approximation during the optimization process. To address this problem, we propose a frequency-domain BSS method employing a hybrid AR and CTF model, which can provide more precise representations of the early reflections and late reverberations. Furthermore, we utilize the Gaussian noise model to deal with the BSS problem in noisy reverberant environments. We formulate the objective function using the maximum log-likelihood criterion, and derive an efficient iterative algorithm for parameter estimation with the block coordinate descent (BCD) method. Experimental results show that the proposed method has a better separation performance than the existing methods in long reverberation environments. © 2024 Elsevier Ltd
引用
下载
收藏
相关论文
共 50 条
  • [41] AN IMROVEMENT IN USING HERMITIAN ANGLEIN CONVOLUTIVE SPEECH BLIND SOURCE SEPARATION
    Mahmoodian, Hamid
    Soltani, Atefeh
    Hashemi, Ali
    2013 INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTER AND COMPUTATION (ICECCO), 2013, : 368 - 371
  • [42] Multiresolution Convolutive Blind Source Separation Using Adaptive Lifting Scheme
    Hattay, Jamel
    Belaid, Samir
    Naanaa, Wady
    2013 IEEE 20TH INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS (ICECS), 2013, : 273 - 276
  • [43] A Hybrid Reverberation Model and Its Application to Joint Speech Dereverberation and Separation
    Liu, Tongzheng
    Lu, Zhihua
    da Costa, Joao Paulo J.
    Fei, Tai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 3000 - 3014
  • [44] A Blind Source Separation Approach Based on IVA for Convolutive Speech Mixtures
    Jan, Tariqullah
    Zafar, Haseeb
    Khalil, Ruhulamin
    Ashraf, Majid
    2016 8TH COMPUTER SCIENCE AND ELECTRONIC ENGINEERING CONFERENCE (CEEC), 2016, : 140 - 145
  • [45] Joint source separation and dereverberation using constrained spectral divergence optimization
    Nathwani, Karan
    Hegde, Rajesh M.
    SIGNAL PROCESSING, 2015, 106 : 266 - 281
  • [46] Autoregressive Moving Average Jointly-Diagonalizable Spatial Covariance Analysis for Joint Source Separation and Dereverberation
    Sekiguchi, Kouhei
    Bando, Yoshiaki
    Nugraha, Aditya Arie
    Fontaine, Mathieu
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2368 - 2382
  • [47] AUDIO SOURCE SEPARATION BASED ON CONVOLUTIVE TRANSFER FUNCTION AND FREQUENCY-DOMAIN LASSO OPTIMIZATION
    Li, Xiaofei
    Girin, Laurent
    Horaud, Radu
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 541 - 545
  • [48] Non-orthogonal joint block diagonalization based on the LU or QR factorizations for convolutive blind source separation
    Zhang, Lei
    Cao, Yueyun
    Yang, Zichun
    Weng, Lei
    JOURNAL OF VIBROENGINEERING, 2017, 19 (05) : 3380 - 3394
  • [49] Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function
    Li, Xiaofei
    Girin, Laurent
    Gannot, Sharon
    Horaud, Radu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (03) : 645 - 659
  • [50] FAJD blind source separation algorithm based on time-varying autoregressive model
    Ji C.
    Jin C.
    Zhang Y.
    Kongzhi yu Juece/Control and Decision, 2020, 35 (03): : 651 - 656