MULTI-MICROPHONE COMPLEX SPECTRAL MAPPING FOR SPEECH DEREVERBERATION

被引:0
|
作者
Wang, Zhong-Qiu [1 ]
Wang, DeLiang [1 ,2 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
[2] Ohio State Univ, Ctr Cognit & Brain Sci, Columbus, OH 43210 USA
基金
美国国家科学基金会;
关键词
Beamforming; complex spectral mapping; speech dereverberation; microphone array processing; deep learning; SEPARATION; LOCALIZATION; ENHANCEMENT; RECOGNITION; NETWORKS; MASKING; NOISY;
D O I
10.1109/icassp40776.2020.9053610
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This study proposes a multi-microphone complex spectral mapping approach for speech dereverberation on a fixed array geometry. In the proposed approach, a deep neural network (DNN) is trained to predict the real and imaginary (RI) components of direct sound from the stacked reverberant (and noisy) RI components of multiple microphones. We also investigate the integration of multi-microphone complex spectral mapping with beamforming and post-filtering. Experimental results on multi-channel speech dereverberation demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:486 / 490
页数:5
相关论文
共 50 条
  • [41] The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments
    Stupakov, Alex
    Hanusa, Evan
    Vijaywargi, Deepak
    Fox, Dieter
    Bilmes, Jeff
    COMPUTER SPEECH AND LANGUAGE, 2012, 26 (01): : 52 - 66
  • [42] MAXIMUM LIKELIHOOD BASED NOISE COVARIANCE MATRIX ESTIMATION FOR MULTI-MICROPHONE SPEECH ENHANCEMENT
    Kjems, Ulrik
    Jensen, Jesper
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 295 - 299
  • [43] Dual-microphone speech dereverberation in a noisy environment
    Habets, Emanuel A. P.
    Gannot, Sharon
    Cohen, Israel
    2006 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2006, : 651 - 655
  • [44] MWF-based speech dereverberation with a local microphone array and an external microphone
    Ali, Randall
    van Waterschoot, Toon
    Moonen, Marc
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [45] TIME-VARYING RESIDUAL NOISE FEATURE MODEL ESTIMATION FOR MULTI-MICROPHONE SPEECH RECOGNITION
    Yoshioka, Takuya
    Ternon, Emmanuel Y. J.
    Nakatani, Tomohiro
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4913 - 4916
  • [46] MULTI-MICROPHONE INTERFERENCE SUPPRESSION USING THE PRINCIPAL SUBSPACE MODIFICATION AND ITS APPLICATION TO SPEECH RECOGNITION
    Kim, Gibak
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5508 - 5511
  • [47] MULTI-MICROPHONE SIGNAL-PROCESSING TECHNIQUE TO REMOVE ROOM REVERBERATION FROM SPEECH SIGNALS
    ALLEN, JB
    BERKLEY, DA
    BLAUERT, J
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 (04): : 912 - 915
  • [48] TIME-VARYING RESIDUAL NOISE FEATURE MODEL ESTIMATION FOR MULTI-MICROPHONE SPEECH RECOGNITION
    Yoshioka, Takuya
    Ternon, Emmanuel Y. J.
    Nakatani, Tomohiro
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4913 - 4916
  • [49] On the Improvement of Modulation Features Using Multi-Microphone Energy Tracking for Robust Distant Speech Recognition
    Rodomagoulakis, Isidoros
    Maragos, Petros
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 558 - 562
  • [50] State Space Microphone Array Nonlinear Acoustic Echo Cancellation Using Multi-Microphone Near-End Speech Covariance
    Park, Jihwan
    Chang, Joon-Hyuk
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (10) : 1520 - 1534