Neural network-based spectrum estimation for online WPE dereverberation

被引:66
|
作者
Kinoshita, Keisuke [1 ]
Delcroix, Marc [1 ]
Kwon, Haeyong [1 ]
Mori, Takuma [1 ]
Nakatani, Tomohiro [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Tokyo, Japan
关键词
dereverberation; neural network; spectrum estimation; inverse filtering; WPE;
D O I
10.21437/Interspeech.2017-733
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel speech dereverberation framework that utilizes deep neural network (DNN)-based spectrum estimation to construct linear inverse filters. The proposed derevcrberation framework is based on the state-of-the-art inverse filter estimation algorithm called weighted prediction error (WPE) algorithm, which is known to effectively reduce reverberation and greatly boost the ASR performance in various conditions. In WPE, the accuracy of the inverse filter estimation, and thus the deverberation performance, is largely dependent on the estimation of the power spectral density (PSD) of the target signal. Therefore, the conventional WPE iteratively performs the inverse filter estimation, actual dereverberation and the PSD estimation to gradually improve the PSD estimate. However, while such iterative procedure works well when sufficiently long acoustically-stationary observed signals arc available, WPE's performance degrades when the duration of observed/accessible data is short, which typically is the case for real-time applications using online block-batch processing with small batches. To solve this problem, we incorporate the DNN-based spectrum estimator into the framework of WPE, because a DNN can estimate the PSD robustly even from very short observed data. We experimentally show that the proposed framework outperforms the conventional WPE, and improves the ASR performance in real noisy reverberant environments in both single-channel and multichannel cases.
引用
收藏
页码:384 / 388
页数:5
相关论文
共 50 条
  • [1] JOINT OPTIMIZATION OF NEURAL NETWORK-BASED WPE DEREVERBERATION AND ACOUSTIC MODEL FOR ROBUST ONLINE ASR
    Heymann, Jahn
    Drude, Lukas
    Haeb-Umbach, Reinhold
    Kinoshita, Keisuke
    Nakatani, Tomohiro
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6655 - 6659
  • [2] FRAME-ONLINE DNN-WPE DEREVERBERATION
    Heymann, Jahn
    Drude, Lukas
    Haeb-Umbach, Reinhold
    Kinoshita, Keisuke
    Nakatani, Tomohiro
    [J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 466 - 470
  • [3] A Neural Network-based Online Estimation of Stochastic Inertia in a Power System
    Kisinga, Daniel Angelo
    Makolo, Peter
    Trodden, Paul
    [J]. 2022 IEEE PES/IAS POWERAFRICA CONFERENCE, 2022, : 283 - 287
  • [4] Neural Network-based Estimation of the MMSE
    Diaz, Mario
    Kairouz, Peter
    Liao, Jiachun
    Sankar, Lalitha
    [J]. 2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 1023 - 1028
  • [5] Robust Speech Dereverberation Based on WPE and Deep Learning
    Li, Hao
    Zhang, Xueliang
    Gao, Guanglai
    [J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 52 - 56
  • [6] Neural Network-Based Estimation for OFDM Channels
    Cheng, Chia-Hsin
    Huang, Yung-Fa
    Huang, Yao-Hung
    Chen, Hsing-Chung
    Yao, Tsung-Yu
    [J]. 2015 IEEE 29th International Conference on Advanced Information Networking and Applications (IEEE AINA 2015), 2015, : 600 - 604
  • [7] Neural network-based ATM QoS estimation
    Sheng, WB
    Rueda, J
    Blight, D
    [J]. IEEE WESCANEX 97 COMMUNICATIONS, POWER AND COMPUTING CONFERENCE PROCEEDINGS, 1997, : 1 - 6
  • [8] Performance estimation of a neural network-based controller
    Schumann, Johann
    Liu, Yan
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 2, PROCEEDINGS, 2006, 3972 : 981 - 990
  • [9] AN UNSUPERVISED LEARNING APPROACH TO NEURAL-NET-SUPPORTED WPE DEREVERBERATION
    Petkov, Petko N.
    Tsiaras, Vasileios
    Doddipatla, Rama
    Stylianou, Yannis
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5761 - 5765
  • [10] Neural network-based construction of online prediction intervals
    Hadjicharalambous, Myrianthi
    Polycarpou, Marios M.
    Panayiotou, Christos G.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (11): : 6715 - 6733