ONLINE SPEECH DEREVERBERATION USING RLS-WPE BASED ON A FULL SPATIAL CORRELATION MATRIX INTEGRATED IN A SPEECH ENHANCEMENT SYSTEM

被引:0
|
作者
Kim, Jun Hyung [1 ]
Park, Jazzson [2 ]
Ahn, Minwook [2 ]
Lee, Yeonbok [2 ]
Kim, Wonsub [2 ]
Park, Hyung-Min [1 ]
机构
[1] Sogang Univ, Dept Elect Engn, Seoul, South Korea
[2] SK Telecom, SW R&D Ctr, Seoul, South Korea
关键词
Dereverberation; weighted prediction error; recursive least squares; online processing; speech enhancement;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reverberation may degrade the performance of automatic speech recognition significantly. Although the weighted prediction error (WPE) provided impressive results, most of the conventional WPE methods were based on batch processing. An efficient WPE method based on frame-by-frame adaptation of the prediction filter in the recursive least squares (RLS) framework was proposed, but the spatial correlation matrix was oversimplified to be a scaled identity matrix. In this paper, we derive a generalized online RLS-WPE method allowing a full spatial correlation matrix. Experimental results showed that the method using a diagonal matrix achieved better performance than that using a scaled identity matrix and employing a full matrix further improved the performance. Furthermore, we also integrated the algorithm into a speech enhancement system, including stereo adaptive echo cancellation and minimum variance distortion response beamforming, implemented onto Xilinx Zynq Ultrascale+MPSoC.
引用
收藏
页码:36 / 40
页数:5
相关论文
共 29 条
  • [1] Integrated Speech Enhancement Method Using Noise Suppression and Dereverberation
    Yoshioka, Takuya
    Nakatani, Tomohiro
    Miyoshi, Masato
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (02): : 231 - 246
  • [2] Binaural speech enhancement system combining dereverberation and spatial masking-based noise removal for robust speech recognition
    Tien Dung Tran
    Dang Khoa Nguyen
    Quoc Cuong Nguyen
    Huu Binh Nguyen
    [J]. 2012 FOURTH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS (ICCE), 2012, : 345 - 350
  • [3] Speech Enhancement Based on Full-Sentence Correlation and Clean Speech Recognition
    Ming, Ji
    Crookes, Danny
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (03) : 531 - 543
  • [4] Improved Speech Spatial Covariance Matrix Estimation for Online Multi-Microphone Speech Enhancement
    Kim, Minseung
    Cheong, Sein
    Song, Hyungchan
    Shin, Jong Won
    [J]. SENSORS, 2023, 23 (01)
  • [5] Speech recognition in living rooms: Integrated speech enhancement and recognition system based on spatial, spectral and temporal modeling of sounds
    Delcroix, Marc
    Kinoshita, Keisuke
    Nakatani, Tomohiro
    Araki, Shoko
    Ogawa, Atsunori
    Hori, Takaaki
    Watanabe, Shinji
    Fujimoto, Masakiyo
    Yoshioka, Takuya
    Oba, Takanobu
    Kubo, Yotaro
    Souden, Mehrez
    Hahm, Seong-Jun
    Nakamura, Atsushi
    [J]. COMPUTER SPEECH AND LANGUAGE, 2013, 27 (03): : 851 - 873
  • [6] Single channel dereverberation using example-based speech enhancement with uncertainty decoding technique
    Kinoshita, Keisuke
    Souden, Mehrez
    Delcroix, Marc
    Nakatani, Tomohiro
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 204 - 207
  • [7] Improved subspace-based speech enhancement using a novel updating approach for noise correlation matrix
    Faraji, Neda
    Ahadi, Seyed Mohammad
    [J]. 2015 SIGNAL PROCESSING AND INTELLIGENT SYSTEMS CONFERENCE (SPIS), 2015, : 88 - 92
  • [8] Speech Enhancement by Denoising and Dereverberation Using a Generalized Sidelobe Canceller-Based Multichannel Wiener Filter
    Bai, Mingsian R.
    Kung, Fan-Jie
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2022, 70 (03): : 140 - 155
  • [9] Performance enhancement of syllable based Tamil speech recognition system using time normalization and rate of speech
    A. Akila
    E. Chandra
    [J]. CSI Transactions on ICT, 2014, 2 (2) : 77 - 84
  • [10] Using Full Covariance Matrix for CMU Sphinx-III Speech Recognition System
    Plonkowski, Marcin
    Urbanovich, Pavel
    [J]. PRZEGLAD ELEKTROTECHNICZNY, 2018, 94 (07): : 102 - 104