Dereverberation based on Wavelet Packet Filtering for Robust Automatic Speech Recognition

被引:0
|
作者
Gomez, Randy [1 ]
Kawahara, Tatsuya [1 ]
机构
[1] Kyoto Univ, ACCMS, Sakyo Ku, Kyoto 6068501, Japan
关键词
Speech recognition; Robustness; Dereverberation; Wavelet Packets;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a multiple-resolution signal analysis to suppress late reflection of reverberation for robust automatic speech recognition (ASR). Wavelet packet tree (WPT) decomposition offers a finer resolution to discriminate the late reflection subspace from the speech subspace. By selecting appropriate wavelet basis in the WPT for speech and late reflection, we can effectively estimate the Wiener gain directly from the observed reverberant data. Moreover, the selection procedure is performed in accordance with the likelihood of acoustic model used by the speech recognizer. Dereverberation is realized by filtering the wavelet packet coefficients with the Wiener gain to suppress the effects of the late reflection. Experimental evaluations with large vocabulary continuous speech recognition (LVCSR) in real reverberant conditions show that the proposed method outperforms conventional wavelet-based methods and other dereverberation techniques.
引用
收藏
页码:1242 / 1245
页数:4
相关论文
共 50 条
  • [41] Spectrum filtering with FRM for robust speech recognition
    Hayasaka, Noboru
    Miyanaga, Yoshikazu
    [J]. 2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 3285 - +
  • [42] Binaural speech enhancement system combining dereverberation and spatial masking-based noise removal for robust speech recognition
    Tien Dung Tran
    Dang Khoa Nguyen
    Quoc Cuong Nguyen
    Huu Binh Nguyen
    [J]. 2012 FOURTH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS (ICCE), 2012, : 345 - 350
  • [43] Wavelet-based denoising for robust feature extraction for speech recognition
    Farooq, O
    Datta, S
    [J]. ELECTRONICS LETTERS, 2003, 39 (01) : 163 - 165
  • [44] Non-linear Dynamics Characterization from Wavelet Packet Transform for Automatic Recognition of Emotional Speech
    Vasquez-Correa, J. C.
    Orozco-Arroyave, J. R.
    Arias-Londono, J. D.
    Vargas-Bonilla, J. F.
    Noth, Elmar
    [J]. RECENT ADVANCES IN NONLINEAR SPEECH PROCESSING, 2016, 48 : 199 - 207
  • [45] Wavelet based speech recognition
    Gamulkiewicz, B
    Weeks, M
    [J]. PROCEEDINGS OF THE 46TH IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS & SYSTEMS, VOLS 1-3, 2003, : 678 - 681
  • [46] Feature-based Noise Robust Speech Recognition on an Indonesian Language Automatic Speech Recognition System
    Satriawan, Cil Hardianto
    Lestari, Dessi Puji
    [J]. 2014 International Conference on Electrical Engineering and Computer Science (ICEECS), 2014, : 42 - 46
  • [47] Speech recognition using a wavelet packet adaptive network based fuzzy inference system
    Avci, Engin
    Akpolat, Zuhtu Hakan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2006, 31 (03) : 495 - 503
  • [48] A Comparision of Multiclass SVM and HMM Classifier for Wavelet Front End Robust Automatic Speech Recognition
    Rajeswari
    Prasad, N. N. S. S. R. K.
    Sathyanarayana, V
    [J]. 2013 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND NETWORKING TECHNOLOGIES (ICCCNT), 2013,
  • [49] Auditory-based wavelet packet filterbank for speech recognition using neural network
    Gandhiraj, R.
    Sathidevi, P. S.
    [J]. ADCOM 2007: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS, 2007, : 666 - +
  • [50] Crossband Filtering for Weighted Prediction Error-Based Speech Dereverberation
    Rosenbaum, Tomer
    Cohen, Israel
    Winebrand, Emil
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (17):