FRAME-ONLINE DNN-WPE DEREVERBERATION

被引:0
|
作者
Heymann, Jahn [1 ]
Drude, Lukas [1 ]
Haeb-Umbach, Reinhold [1 ]
Kinoshita, Keisuke [2 ]
Nakatani, Tomohiro [2 ]
机构
[1] Paderborn Univ, Dept Commun Engn, Paderborn, Germany
[2] NTT Corp, NTT Commun Sci Labs, Kyoto, Japan
关键词
speech recognition; online speech enhancement; dereverberation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Signal dereverberation using the weighted prediction error (WPE) method has been proven to be an effective means to raise the accuracy of far-field speech recognition. But in its original formulation, WPE requires multiple iterations over a sufficiently long utterance, rendering it unsuitable for online low-latency applications. Recently, two methods have been proposed to overcome this limitation. One utilizes a neural network to estimate the power spectral density (PSD) of the target signal and works in a block-online fashion. The other method relies on a rather simple PSD estimation which smoothes the observed PSD and utilizes a recursive formulation which enables it to work on a frame-by-frame basis. In this paper, we integrate a deep neural network (DNN) based estimator into the recursive frame-online formulation. We evaluate the performance of the recursive system with different PSD estimators in comparison to the block-online and of-fline variant on two distinct corpora. The REVERB challenge data, where the signal is mainly deteriorated by reverberation, and a database which combines WSJ and VoiceHome to also consider (directed) noise sources. The results show that although smoothing works surprisingly well, the more sophisticated DNN based estimator shows promising improvements and shortens the performance gap between online and of-fline processing.
引用
收藏
页码:466 / 470
页数:5
相关论文
共 50 条
  • [31] Online blind source separation and dereverberation of speech based on a joint diagonalizability constraint
    Yu, Ho-Gun
    Kim, Do-Hui
    Song, Min-Hwan
    Park, Hyung-Min
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (05): : 503 - 514
  • [32] Speech dereverberation using weighted prediction error with correlated inter-frame speech components
    Parchami, Mandi
    Zhu, Wei-Ping
    Champagne, Benoit
    [J]. SPEECH COMMUNICATION, 2017, 87 : 49 - 57
  • [33] Joint Online Multichannel Acoustic Echo Cancellation, Speech Dereverberation and Source Separation
    Na, Yueyue
    Wang, Ziteng
    Liu, Zhang
    Tian, Biao
    Fu, Qiang
    [J]. INTERSPEECH 2021, 2021, : 1144 - 1148
  • [34] Acoustic models for online blind source dereverberation using sequential Monte Carlo methods
    Evers, Christine
    Hopgood, James R.
    Bell, Judith
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4597 - +
  • [35] LOW LATENCY ONLINE BLIND SOURCE SEPARATION BASED ON JOINT OPTIMIZATION WITH BLIND DEREVERBERATION
    Ueda, Tetsuya
    Nakatani, Tomohiro
    Ikeshita, Rintaro
    Kinoshita, Keisuke
    Araki, Shoko
    Makino, Shoji
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 506 - 510
  • [36] Low Latency Online Source Separation and Noise Reduction Based on Joint Optimization with Dereverberation
    Ueda, Tetsuya
    Nakatani, Tomohiro
    Ikeshita, Rintaro
    Kinoshita, Keisuke
    Araki, Shoko
    Makino, Shoji
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1000 - 1004
  • [37] Exploring an incongruence frame for online reviews
    Sugathan, Praveen
    Sudhir, Subin
    Ramachandran, Rahul
    [J]. JOURNAL OF CONSUMER BEHAVIOUR, 2024,
  • [38] The Frame Effect of Price in Online Selling
    Li, Silan
    Chen, Tao
    Yang, Wen
    [J]. FOURTEENTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, 2015, : 231 - 236
  • [39] DNN Adaptive Partitioning Strategy for Heterogeneous Online Inspection Systems of Substations
    Fu, Qincui
    Deng, Fangming
    Xue, Xianfa
    Zeng, Jianjun
    Wei, Baoquan
    [J]. ELECTRONICS, 2024, 13 (17)
  • [40] An Online Approach for DNN Model Caching and Processor Allocation in Edge Computing
    Chen, Zhiqi
    Zhang, Sheng
    Ma, Zhi
    Zhang, Shuai
    Qian, Zhuzhong
    Xiao, Mingjun
    Wu, Jie
    Lu, Sanglu
    [J]. 2022 IEEE/ACM 30TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2022,