LASSO-BASED REVERBERATION SUPPRESSION IN AUTOMATIC SPEECH RECOGNITION

被引:0
|
作者
Zhang, Xuewei [1 ,3 ]
Lin, Yiye [1 ,4 ]
Wang, Dong [1 ,2 ]
机构
[1] Tsinghua Univ, Res Inst Informat Technol, Ctr Speech & Language Technol, Beijing, Peoples R China
[2] Tsinghua Natl Lab Informat Sci & Technol, Beijing, Peoples R China
[3] Shenyang Architectural Univ, Shenyang, Liaoning, Peoples R China
[4] Beijing Inst Technol, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
far-field speech recognition; reverberation suppression; linear sparse prediction model; Lasso;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Far-field automatic speech recognition (ASR) is challenging, mainly attributed to the high reverberation in the recordings. A novel linear sparse prediction model has been proposed to estimate and suppress reverberation. This model considers reverberation as a mixture of early and late reflections of the direct signal and estimates the late reflection with Lasso. It has been demonstrated that this approach is promising in improving perceptual intelligibility, however it is unknown if the improvement can be propagated to ASR tasks. This paper applies the Lasso-based dereverberation approach to far-field speech recognition, and shows that it can deliver significant performance improvement for ASR based on deep neural networks (DNN). Particularly, we demonstrated that an utterance-based Lasso is sufficient to obtain good performance, which is important for applying the Lasso-based dereverberation to real-time ASR systems.
引用
收藏
页码:5034 / 5037
页数:4
相关论文
共 50 条
  • [1] ON THE APPLICATION OF REVERBERATION SUPPRESSION TO ROBUST SPEECH RECOGNITION
    Maas, Roland
    Habets, Emanuel A. P.
    Sehr, Armin
    Kellermann, Walter
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 297 - 300
  • [2] IMPROVING ROBUSTNESS AGAINST REVERBERATION FOR AUTOMATIC SPEECH RECOGNITION
    Mitra, Vikramjit
    Van Hout, Julien
    Wang, Wen
    Graciarena, Martin
    McLaren, Mitchell
    Franco, Horacio
    Vergyri, Dimitra
    [J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 525 - 532
  • [3] Late Reverberation Reduction and Blind Reverberation Time Measurement for Automatic Speech Recognition
    Prodeus, Arkadiy
    [J]. 2017 IEEE FIRST UKRAINE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (UKRCON), 2017, : 634 - 639
  • [4] Minimum based noise suppression for improved automatic speech recognition
    Fernández, J
    Meyer, C
    Fischer, A
    [J]. PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2004, : 243 - 248
  • [5] Spectral Subtraction for Reverberation Reduction Applied to Automatic Speech Recognition
    Pacheco, Fernando S.
    Seara, Rui
    [J]. PROCEEDINGS OF THE IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 2006, : 795 - 800
  • [6] Coherence-based phonemic analysis on the effect of reverberation to practical automatic speech recognition
    Nam, Hyeonuk
    Park, Yong-Hwa
    [J]. APPLIED ACOUSTICS, 2025, 227
  • [7] On Adaptive LASSO-based Sparse Time-Varying Complex AR Speech Analysis
    Funaki, Keiichi
    [J]. 2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [8] LEVERAGING AUTOMATIC SPEECH RECOGNITION IN COCHLEAR IMPLANTS FOR IMPROVED SPEECH INTELLIGIBILITY UNDER REVERBERATION
    Hazrati, Oldooz
    Ghaffarzadegan, Shabnam
    Hansen, John H. L.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5093 - 5097
  • [9] A Lasso-based Collaborative Filtering Recommendation Model
    Hiep Xuan Huynh
    Vien Quang Dam
    Long Van Nguyen
    Nghia Quoc Phan
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (04) : 509 - 514
  • [10] Sensitivity of Automatic Speech Recognition to Excessive Noise and Late Reverberation Reduction
    Prodeus, Arkadiy
    [J]. 2016 IEEE 36TH INTERNATIONAL CONFERENCE ON ELECTRONICS AND NANOTECHNOLOGY (ELNANO), 2016, : 347 - 352