FRAME-ONLINE DNN-WPE DEREVERBERATION

被引:0
|
作者
Heymann, Jahn [1 ]
Drude, Lukas [1 ]
Haeb-Umbach, Reinhold [1 ]
Kinoshita, Keisuke [2 ]
Nakatani, Tomohiro [2 ]
机构
[1] Paderborn Univ, Dept Commun Engn, Paderborn, Germany
[2] NTT Corp, NTT Commun Sci Labs, Kyoto, Japan
关键词
speech recognition; online speech enhancement; dereverberation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Signal dereverberation using the weighted prediction error (WPE) method has been proven to be an effective means to raise the accuracy of far-field speech recognition. But in its original formulation, WPE requires multiple iterations over a sufficiently long utterance, rendering it unsuitable for online low-latency applications. Recently, two methods have been proposed to overcome this limitation. One utilizes a neural network to estimate the power spectral density (PSD) of the target signal and works in a block-online fashion. The other method relies on a rather simple PSD estimation which smoothes the observed PSD and utilizes a recursive formulation which enables it to work on a frame-by-frame basis. In this paper, we integrate a deep neural network (DNN) based estimator into the recursive frame-online formulation. We evaluate the performance of the recursive system with different PSD estimators in comparison to the block-online and of-fline variant on two distinct corpora. The REVERB challenge data, where the signal is mainly deteriorated by reverberation, and a database which combines WSJ and VoiceHome to also consider (directed) noise sources. The results show that although smoothing works surprisingly well, the more sophisticated DNN based estimator shows promising improvements and shortens the performance gap between online and of-fline processing.
引用
收藏
页码:466 / 470
页数:5
相关论文
共 50 条
  • [1] Speech dereverberation and source separation using DNN-WPE and LWPR-PCA
    Jasmine J. C. Sheeja
    B. Sankaragomathi
    [J]. Neural Computing and Applications, 2023, 35 : 7339 - 7356
  • [2] Speech dereverberation and source separation using DNN-WPE and LWPR-PCA
    Sheeja, Jasmine J. C.
    Sankaragomathi, B.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (10): : 7339 - 7356
  • [3] Neural network-based spectrum estimation for online WPE dereverberation
    Kinoshita, Keisuke
    Delcroix, Marc
    Kwon, Haeyong
    Mori, Takuma
    Nakatani, Tomohiro
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 384 - 388
  • [4] DNN-FREE LOW-LATENCY ADAPTIVE SPEECH ENHANCEMENT BASED ON FRAME-ONLINE BEAMFORMING POWERED BY BLOCK-ONLINE FASTMNMF
    Nugraha, Aditya Arie
    Sekiguchi, Kouhei
    Fontaine, Mathieu
    Bando, Yoshiaki
    Yoshii, Kazuyoshi
    [J]. 2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [5] Improving Frame-Online Neural Speech Enhancement With Overlapped-Frame Prediction
    Wang, Zhong-Qiu
    Watanabe, Shinji
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1422 - 1426
  • [6] Robust Speech Dereverberation Based on WPE and Deep Learning
    Li, Hao
    Zhang, Xueliang
    Gao, Guanglai
    [J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 52 - 56
  • [7] IMPORTANCE OF SWITCH OPTIMIZATION CRITERION IN SWITCHING WPE DEREVERBERATION
    Kamo, Naoyuki
    Ikeshita, Rintaro
    Kinoshita, Keisuke
    Nakatani, Tomohiro
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 176 - 180
  • [8] JOINT OPTIMIZATION OF NEURAL NETWORK-BASED WPE DEREVERBERATION AND ACOUSTIC MODEL FOR ROBUST ONLINE ASR
    Heymann, Jahn
    Drude, Lukas
    Haeb-Umbach, Reinhold
    Kinoshita, Keisuke
    Nakatani, Tomohiro
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6655 - 6659
  • [9] AN UNSUPERVISED LEARNING APPROACH TO NEURAL-NET-SUPPORTED WPE DEREVERBERATION
    Petkov, Petko N.
    Tsiaras, Vasileios
    Doddipatla, Rama
    Stylianou, Yannis
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5761 - 5765
  • [10] ONLINE SPEECH DEREVERBERATION USING RLS-WPE BASED ON A FULL SPATIAL CORRELATION MATRIX INTEGRATED IN A SPEECH ENHANCEMENT SYSTEM
    Kim, Jun Hyung
    Park, Jazzson
    Ahn, Minwook
    Lee, Yeonbok
    Kim, Wonsub
    Park, Hyung-Min
    [J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 36 - 40