A Hybrid Reverberation Model and Its Application to Joint Speech Dereverberation and Separation

被引:0
|
作者
Liu, Tongzheng [1 ]
Lu, Zhihua [1 ]
da Costa, Joao Paulo J. [2 ]
Fei, Tai [3 ]
机构
[1] Ningbo Univ, Coll Informat Sci & Engn, Ningbo 315211, Peoples R China
[2] Hamm Lippstadt Univ Appl Sci HSHL, Dept Lippstadt 2, D-59063 Hamm, Germany
[3] HELLA GmbH & Co KGaA, D-59552 Lippstadt, Germany
基金
中国国家自然科学基金;
关键词
Reverberation model; dereverberation; speech separation; blind source separation; multichannel nonnegative matrix factorization; microphone array; BLIND SOURCE SEPARATION; NONNEGATIVE MATRIX FACTORIZATION; INDEPENDENT VECTOR EXTRACTION; NOISE-REDUCTION; ALGORITHMS; CANCELLATION; ENHANCEMENT; MIXTURES;
D O I
10.1109/TASLP.2023.3301227
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This article proposes a hybrid reverberation model by integrating two conventional models, namely, the multichannel linear prediction (MCLP) model and the spatial coherence model. The late reverberation is divided into two components. One component is modeled using an MCLP model, and the other is modeled using the spatial coherence model. In contrast with the conventional models, the proposed hybrid model increases modeling capacity, especially in the case of long reverberation time. In order to optimally estimate model parameters, joint speech dereverberation and separation is taken into account. The hybrid reverberation model is then used in conjunction with the multichannel nonnegative matrix factorization (MNMF). The method called Hybrid-FastMNMF is proposed by treating the reverberation component modeled by the spatial coherence model as a noise source and estimating its parameters similarly to speech sources. Furthermore, prior knowledge of the spatial coherence matrix is employed to whiten the observations, resulting in another method called Hybrid-FastMNMF-W. Experimental findings demonstrate the proposed methods' superior performance in terms of joint speech dereverberation and separation, and they further justify the efficiency of the proposed hybrid reverberation model.
引用
下载
收藏
页码:3000 / 3014
页数:15
相关论文
共 50 条
  • [41] A hybrid model for unsupervised single channel speech separation
    Kumar, M. K. Prasanna
    Kumaraswamy, R.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (05) : 13241 - 13259
  • [42] Joint Dereverberation and Residual Echo Suppression of Speech Signals in Noisy Environments
    Habets, Emanuel A. P.
    Gannot, Sharon
    Cohen, Israel
    Sommen, Piet C. W.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1433 - 1451
  • [43] SINGLE CHANNEL JOINT SPEECH DEREVERBERATION AND DENOISING USING DEEP PRIORS
    Raikar, Aditya
    Basu, Sourya
    Hegde, Rajesh M.
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 216 - 220
  • [44] A Non-convolutive NMF Model for Speech Dereverberation
    Mohanan, Nikhil
    Velmurugan, Rajbabu
    Rao, Preeti
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1324 - 1328
  • [45] A Late Reverberation Power Spectral Density Aware Approach to Speech Dereverberation Based on Deep Neural Networks
    Qi, Yuanlei
    Yang, Feiran
    Yang, Jun
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1700 - 1703
  • [46] Speech Recognition Using Blind Source Separation and Dereverberation Method for Mixed Sound of Speech and Music
    Wang, Longbiao
    Odani, Kyohei
    Kai, Atsuhiko
    Li, Weifeng
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [47] Effect of Reverberation on Neural Responses to Natural Speech in Rabbit Auditory Midbrain: No Evidence for a Neural Dereverberation Mechanism
    Barzelay, Oded
    David, Stephen
    Delgutte, Bertrand
    ENEURO, 2023, 10 (05)
  • [48] ON THE APPLICATION OF REVERBERATION SUPPRESSION TO ROBUST SPEECH RECOGNITION
    Maas, Roland
    Habets, Emanuel A. P.
    Sehr, Armin
    Kellermann, Walter
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 297 - 300
  • [49] Multichannel Equalization in the KLT and Frequency Domains With Application to Speech Dereverberation
    Rashobh, Rajan S.
    Khong, Andy W. H.
    Liu, Di
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (03) : 634 - 646
  • [50] Multichannel equalization in the KLT and frequency domains with application to speech dereverberation
    Rashobh, Rajan S.
    Khong, Andy W. H.
    Liu, Di
    IEEE Transactions on Audio, Speech and Language Processing, 2014, 22 (03): : 634 - 646