FRAME-ONLINE DNN-WPE DEREVERBERATION

被引：0

作者：

Heymann, Jahn ^{[1
]}

Drude, Lukas ^{[1
]}

Haeb-Umbach, Reinhold ^{[1
]}

Kinoshita, Keisuke ^{[2
]}

Nakatani, Tomohiro ^{[2
]}

机构：

[1] Paderborn Univ, Dept Commun Engn, Paderborn, Germany

[2] NTT Corp, NTT Commun Sci Labs, Kyoto, Japan

来源：

2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC) | 2018年

关键词：

speech recognition; online speech enhancement; dereverberation;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Signal dereverberation using the weighted prediction error (WPE) method has been proven to be an effective means to raise the accuracy of far-field speech recognition. But in its original formulation, WPE requires multiple iterations over a sufficiently long utterance, rendering it unsuitable for online low-latency applications. Recently, two methods have been proposed to overcome this limitation. One utilizes a neural network to estimate the power spectral density (PSD) of the target signal and works in a block-online fashion. The other method relies on a rather simple PSD estimation which smoothes the observed PSD and utilizes a recursive formulation which enables it to work on a frame-by-frame basis. In this paper, we integrate a deep neural network (DNN) based estimator into the recursive frame-online formulation. We evaluate the performance of the recursive system with different PSD estimators in comparison to the block-online and of-fline variant on two distinct corpora. The REVERB challenge data, where the signal is mainly deteriorated by reverberation, and a database which combines WSJ and VoiceHome to also consider (directed) noise sources. The results show that although smoothing works surprisingly well, the more sophisticated DNN based estimator shows promising improvements and shortens the performance gap between online and of-fline processing.

引用

页码：466 / 470

页数：5

共 50 条

[1] Speech dereverberation and source separation using DNN-WPE and LWPR-PCA
Jasmine J. C. Sheeja
B. Sankaragomathi
[J]. Neural Computing and Applications, 2023, 35 : 7339 - 7356
[2] Speech dereverberation and source separation using DNN-WPE and LWPR-PCA
Sheeja, Jasmine J. C.
Sankaragomathi, B.
[J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (10): : 7339 - 7356
[3] Neural network-based spectrum estimation for online WPE dereverberation
Kinoshita, Keisuke
Delcroix, Marc
Kwon, Haeyong
Mori, Takuma
Nakatani, Tomohiro
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 384 - 388
[4] DNN-FREE LOW-LATENCY ADAPTIVE SPEECH ENHANCEMENT BASED ON FRAME-ONLINE BEAMFORMING POWERED BY BLOCK-ONLINE FASTMNMF
Nugraha, Aditya Arie
Sekiguchi, Kouhei
Fontaine, Mathieu
Bando, Yoshiaki
Yoshii, Kazuyoshi
[J]. 2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
[5] Improving Frame-Online Neural Speech Enhancement With Overlapped-Frame Prediction
Wang, Zhong-Qiu
Watanabe, Shinji
[J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1422 - 1426
[6] Robust Speech Dereverberation Based on WPE and Deep Learning
Li, Hao
Zhang, Xueliang
Gao, Guanglai
[J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 52 - 56
[7] IMPORTANCE OF SWITCH OPTIMIZATION CRITERION IN SWITCHING WPE DEREVERBERATION
Kamo, Naoyuki
Ikeshita, Rintaro
Kinoshita, Keisuke
Nakatani, Tomohiro
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 176 - 180
[8] JOINT OPTIMIZATION OF NEURAL NETWORK-BASED WPE DEREVERBERATION AND ACOUSTIC MODEL FOR ROBUST ONLINE ASR
Heymann, Jahn
Drude, Lukas
Haeb-Umbach, Reinhold
Kinoshita, Keisuke
Nakatani, Tomohiro
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6655 - 6659
[9] AN UNSUPERVISED LEARNING APPROACH TO NEURAL-NET-SUPPORTED WPE DEREVERBERATION
Petkov, Petko N.
Tsiaras, Vasileios
Doddipatla, Rama
Stylianou, Yannis
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5761 - 5765
[10] ONLINE SPEECH DEREVERBERATION USING RLS-WPE BASED ON A FULL SPATIAL CORRELATION MATRIX INTEGRATED IN A SPEECH ENHANCEMENT SYSTEM
Kim, Jun Hyung
Park, Jazzson
Ahn, Minwook
Lee, Yeonbok
Kim, Wonsub
Park, Hyung-Min
[J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 36 - 40

← 1 2 3 4 5 →