A Hybrid Reverberation Model and Its Application to Joint Speech Dereverberation and Separation

被引：0

作者：

Liu, Tongzheng ^{[1
]}

Lu, Zhihua ^{[1
]}

da Costa, Joao Paulo J. ^{[2
]}

Fei, Tai ^{[3
]}

机构：

[1] Ningbo Univ, Coll Informat Sci & Engn, Ningbo 315211, Peoples R China

[2] Hamm Lippstadt Univ Appl Sci HSHL, Dept Lippstadt 2, D-59063 Hamm, Germany

[3] HELLA GmbH & Co KGaA, D-59552 Lippstadt, Germany

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2023年 / 31卷

基金：

中国国家自然科学基金;

关键词：

Reverberation model; dereverberation; speech separation; blind source separation; multichannel nonnegative matrix factorization; microphone array; BLIND SOURCE SEPARATION; NONNEGATIVE MATRIX FACTORIZATION; INDEPENDENT VECTOR EXTRACTION; NOISE-REDUCTION; ALGORITHMS; CANCELLATION; ENHANCEMENT; MIXTURES;

D O I：

10.1109/TASLP.2023.3301227

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This article proposes a hybrid reverberation model by integrating two conventional models, namely, the multichannel linear prediction (MCLP) model and the spatial coherence model. The late reverberation is divided into two components. One component is modeled using an MCLP model, and the other is modeled using the spatial coherence model. In contrast with the conventional models, the proposed hybrid model increases modeling capacity, especially in the case of long reverberation time. In order to optimally estimate model parameters, joint speech dereverberation and separation is taken into account. The hybrid reverberation model is then used in conjunction with the multichannel nonnegative matrix factorization (MNMF). The method called Hybrid-FastMNMF is proposed by treating the reverberation component modeled by the spatial coherence model as a noise source and estimating its parameters similarly to speech sources. Furthermore, prior knowledge of the spatial coherence matrix is employed to whiten the observations, resulting in another method called Hybrid-FastMNMF-W. Experimental findings demonstrate the proposed methods' superior performance in terms of joint speech dereverberation and separation, and they further justify the efficiency of the proposed hybrid reverberation model.

引用

下载

页码：3000 / 3014

页数：15

共 50 条

[41] A hybrid model for unsupervised single channel speech separation
Kumar, M. K. Prasanna
Kumaraswamy, R.
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (05) : 13241 - 13259
[42] Joint Dereverberation and Residual Echo Suppression of Speech Signals in Noisy Environments
Habets, Emanuel A. P.
Gannot, Sharon
Cohen, Israel
Sommen, Piet C. W.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1433 - 1451
[43] SINGLE CHANNEL JOINT SPEECH DEREVERBERATION AND DENOISING USING DEEP PRIORS
Raikar, Aditya
Basu, Sourya
Hegde, Rajesh M.
2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 216 - 220
[44] A Non-convolutive NMF Model for Speech Dereverberation
Mohanan, Nikhil
Velmurugan, Rajbabu
Rao, Preeti
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1324 - 1328
[45] A Late Reverberation Power Spectral Density Aware Approach to Speech Dereverberation Based on Deep Neural Networks
Qi, Yuanlei
Yang, Feiran
Yang, Jun
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1700 - 1703
[46] Speech Recognition Using Blind Source Separation and Dereverberation Method for Mixed Sound of Speech and Music
Wang, Longbiao
Odani, Kyohei
Kai, Atsuhiko
Li, Weifeng
2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
[47] Effect of Reverberation on Neural Responses to Natural Speech in Rabbit Auditory Midbrain: No Evidence for a Neural Dereverberation Mechanism
Barzelay, Oded
David, Stephen
Delgutte, Bertrand
ENEURO, 2023, 10 (05)
[48] ON THE APPLICATION OF REVERBERATION SUPPRESSION TO ROBUST SPEECH RECOGNITION
Maas, Roland
Habets, Emanuel A. P.
Sehr, Armin
Kellermann, Walter
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 297 - 300
[49] Multichannel Equalization in the KLT and Frequency Domains With Application to Speech Dereverberation
Rashobh, Rajan S.
Khong, Andy W. H.
Liu, Di
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (03) : 634 - 646
[50] Multichannel equalization in the KLT and frequency domains with application to speech dereverberation
Rashobh, Rajan S.
Khong, Andy W. H.
Liu, Di
IEEE Transactions on Audio, Speech and Language Processing, 2014, 22 (03): : 634 - 646

← 1 2 3 4 5 →