A Hybrid Reverberation Model and Its Application to Joint Speech Dereverberation and Separation

被引:0
|
作者
Liu, Tongzheng [1 ]
Lu, Zhihua [1 ]
da Costa, Joao Paulo J. [2 ]
Fei, Tai [3 ]
机构
[1] Ningbo Univ, Coll Informat Sci & Engn, Ningbo 315211, Peoples R China
[2] Hamm Lippstadt Univ Appl Sci HSHL, Dept Lippstadt 2, D-59063 Hamm, Germany
[3] HELLA GmbH & Co KGaA, D-59552 Lippstadt, Germany
基金
中国国家自然科学基金;
关键词
Reverberation model; dereverberation; speech separation; blind source separation; multichannel nonnegative matrix factorization; microphone array; BLIND SOURCE SEPARATION; NONNEGATIVE MATRIX FACTORIZATION; INDEPENDENT VECTOR EXTRACTION; NOISE-REDUCTION; ALGORITHMS; CANCELLATION; ENHANCEMENT; MIXTURES;
D O I
10.1109/TASLP.2023.3301227
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This article proposes a hybrid reverberation model by integrating two conventional models, namely, the multichannel linear prediction (MCLP) model and the spatial coherence model. The late reverberation is divided into two components. One component is modeled using an MCLP model, and the other is modeled using the spatial coherence model. In contrast with the conventional models, the proposed hybrid model increases modeling capacity, especially in the case of long reverberation time. In order to optimally estimate model parameters, joint speech dereverberation and separation is taken into account. The hybrid reverberation model is then used in conjunction with the multichannel nonnegative matrix factorization (MNMF). The method called Hybrid-FastMNMF is proposed by treating the reverberation component modeled by the spatial coherence model as a noise source and estimating its parameters similarly to speech sources. Furthermore, prior knowledge of the spatial coherence matrix is employed to whiten the observations, resulting in another method called Hybrid-FastMNMF-W. Experimental findings demonstrate the proposed methods' superior performance in terms of joint speech dereverberation and separation, and they further justify the efficiency of the proposed hybrid reverberation model.
引用
收藏
页码:3000 / 3014
页数:15
相关论文
共 50 条
  • [1] JOINT BLIND DEREVERBERATION AND SEPARATION OF SPEECH MIXTURES
    Jan, Tariqullah
    Wang, Wenwu
    [J]. 2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2343 - 2347
  • [2] Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization
    Yoshioka, Takuya
    Nakatani, Tomohiro
    Miyoshi, Masato
    Okuno, Hiroshi G.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (01): : 69 - 84
  • [3] Multi-channel speech dereverberation based on a statistical model of late reverberation
    Habets, EAP
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 173 - 176
  • [4] Speech dereverberation based on blind estimation of a reverberation filter
    Zee, Min-Seon
    Park, Hyung-Min
    [J]. IEICE ELECTRONICS EXPRESS, 2009, 6 (20): : 1456 - 1461
  • [5] Special Issue on Dereverberation and Reverberation of Audio, Music, and Speech
    Spriet, Ann
    Goetze, Stefan
    van Waterschoot, Toon
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2017, 65 (1-2): : 6 - 7
  • [6] A NEW CASCADED SPECTRAL SUBTRACTION APPROACH FOR BINAURAL SPEECH DEREVERBERATION AND ITS APPLICATION IN SOURCE SEPARATION
    Khan, Muhammad Salman
    Naqvi, Syed Mohsen
    Chambers, Jonathon
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6566 - 6570
  • [7] Online blind source separation and dereverberation of speech based on a joint diagonalizability constraint
    Yu, Ho-Gun
    Kim, Do-Hui
    Song, Min-Hwan
    Park, Hyung-Min
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (05): : 503 - 514
  • [8] Joint Online Multichannel Acoustic Echo Cancellation, Speech Dereverberation and Source Separation
    Na, Yueyue
    Wang, Ziteng
    Liu, Zhang
    Tian, Biao
    Fu, Qiang
    [J]. INTERSPEECH 2021, 2021, : 1144 - 1148
  • [9] Computationally Efficient and Versatile Framework for Joint Optimization of Blind Speech Separation and Dereverberation
    Nakatani, Tomohiro
    Ikeshita, Rintaro
    Kinoshita, Keisuke
    Sawada, Hiroshi
    Araki, Shoko
    [J]. INTERSPEECH 2020, 2020, : 91 - 95
  • [10] Joint Multichannel Blind Speech Separation and Dereverberation: A Real-Time Algorithmic Implementation
    Rotili, Rudy
    De Simone, Claudio
    Perelli, Alessandro
    Cifani, Simone
    Squartini, Stefano
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, 2010, 93 : 85 - 93