Linear Prediction-based Dereverberation with Very Deep Convolutional Neural Networks for Reverberant Speech Recognition

被引:0
|
作者
Park, Sunchan [1 ]
Jeong, Yongwon [1 ]
Kim, Min Sik [1 ]
Kim, Hyung Soon [1 ]
机构
[1] Pusan Natl Univ, Dept Elect Engn, Busan, South Korea
关键词
convolutional neural network; dereverberation; reverberant speech recognition; weighted prediction error;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional neural networks (CNNs) have been shown to improve classification tasks such as automatic speech recognition (ASR). Furthermore, the CNN with very deep architecture lowered the word error rate (WER) in reverberant and noisy environments. However, DNN-based ASR systems still perform poorly in unseen reverberant conditions. In this paper, we use the weighted prediction error (WPE)-based preprocessing for dereverberation. In our experiments on the ASR task of the REVERB Challenge 2014, the WPE-based processing with eight channels reduced the WER by 20% for the real-condition data using CNN acoustic models with 10 layers.
引用
收藏
页码:310 / 311
页数:2
相关论文
共 50 条
  • [31] Plant identification based on very deep convolutional neural networks
    Heyan Zhu
    Qinglin Liu
    Yuankai Qi
    Xinyuan Huang
    Feng Jiang
    Shengping Zhang
    [J]. Multimedia Tools and Applications, 2018, 77 : 29779 - 29797
  • [32] A Reverberation-Time-Aware Approach to Speech Dereverberation Based on Deep Neural Networks
    Wu, Bo
    Li, Kehuang
    Yang, Minglei
    Lee, Chin-Hui
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (01) : 102 - 111
  • [33] Mongolian Speech Recognition Based on Deep Neural Networks
    Zhang, Hui
    Bao, Feilong
    Gao, Guanglai
    [J]. CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA (CCL 2015), 2015, 9427 : 180 - 188
  • [34] Birdsong recognition using prediction-based recurrent neural fuzzy networks
    Juang, Chia-Feng
    Chen, Tai-Mou
    [J]. NEUROCOMPUTING, 2007, 71 (1-3) : 121 - 130
  • [35] Reverberant speech recognition combining deep neural networks and deep autoencoders augmented with a phone-class feature
    Mimura, Masato
    Sakai, Shinsuke
    Kawahara, Tatsuya
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015,
  • [36] Reverberant speech recognition combining deep neural networks and deep autoencoders augmented with a phone-class feature
    Masato Mimura
    Shinsuke Sakai
    Tatsuya Kawahara
    [J]. EURASIP Journal on Advances in Signal Processing, 2015
  • [37] Speech dereverberation for enhancement and recognition using dynamic features constrained deep neural networks and feature adaptation
    Xiong Xiao
    Shengkui Zhao
    Duc Hoang Ha Nguyen
    Xionghu Zhong
    Douglas L. Jones
    Eng Siong Chng
    Haizhou Li
    [J]. EURASIP Journal on Advances in Signal Processing, 2016
  • [38] Convolutional Neural Networks for Distant Speech Recognition
    Swietojanski, Pawel
    Ghoshal, Arnab
    Renals, Steve
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (09) : 1120 - 1124
  • [39] Continuous speech recognition by convolutional neural networks
    Zhang, Qing-Qing
    Liu, Yong
    Pan, Jie-Lin
    Yan, Yong-Hong
    [J]. Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2015, 37 (09): : 1212 - 1217
  • [40] AN ANALYSIS OF CONVOLUTIONAL NEURAL NETWORKS FOR SPEECH RECOGNITION
    Huang, Jui-Ting
    Li, Jinyu
    Gong, Yifan
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4989 - 4993