Optimization of Dereverberation Parameters based on Likelihood of Speech Recognizer

被引:0
|
作者
Gomez, Randy [1 ]
Kawahara, Tatsuya [1 ]
机构
[1] Kyoto Univ, ACCMS, Sakyo Ku, Kyoto 6068501, Japan
关键词
Dereverberation; Robust ASR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech recognition under reverberant condition is a difficult task. Most dereverberation techniques used to address this problem enhance the reverberant waveform independent from that of the speech recognizer. In this paper, we improve the conventional Spectral Subtraction-based (SS) dereverberation technique. In our proposed approach, the dereverberation parameters are optimized to improve the likelihood of the acoustic model. The system is capable of adaptively fine-tuning these parameters jointly with acoustic model training. Additional optimization is also implemented during decoding of the test utterances. We have evaluated using real reverberant data and experimental results show that the proposed method significantly improves the recognition performance over the conventional approach.
引用
收藏
页码:1259 / 1262
页数:4
相关论文
共 50 条
  • [41] BAYESIAN LEARNING FOR SPEECH DEREVERBERATION
    Chien, Jen-Tzung
    Chang, You-Cheng
    [J]. 2016 IEEE 26TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2016,
  • [42] Dual microphone speech dereverberation
    Abutalebi, Hamid Reza
    Faghani, Farhad
    [J]. 2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 847 - 850
  • [43] A SOM based 2500 - Isolated - Farsi - Word speech recognizer
    Shirazi, J
    Menhaj, MB
    [J]. ARTIFICIAL NEURAL NETWORKS: BIOLOGICAL INSPIRATIONS - ICANN 2005, PT 1, PROCEEDINGS, 2005, 3696 : 589 - 595
  • [44] SPEECH DEREVERBERATION BASED ON INTEGRATED DEEP AND ENSEMBLE LEARNING ALGORITHM
    Lee, Wei-Jen
    Wang, Syu-Siang
    Chen, Fei
    Lu, Xugang
    Chien, Shao-Yi
    Tsao, Yu
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5454 - 5458
  • [45] Speech Enhancement and Dereverberation With Diffusion-Based Generative Models
    Richter, Julius
    Welker, Simon
    Lemercier, Jean-Marie
    Lay, Bunlong
    Gerkmann, Timo
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2351 - 2364
  • [46] Blind dereverberation of monaural speech signals based on harmonic structure
    Nakatani, Tomohiro
    Miyoshi, Masato
    Kinoshita, Keisuke
    [J]. Systems and Computers in Japan, 2006, 37 (06): : 1 - 12
  • [47] Applications of Virtual-Evidence based Speech Recognizer Training
    Subramanya, Amarnag
    Bilmes, Jeff
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2562 - 2565
  • [48] Deep Learning-Based Amplitude Fusion for Speech Dereverberation
    Liu, Chunlei
    Wang, Longbiao
    Dang, Jianwu
    [J]. DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2020, 2020
  • [49] Fast estimation of a precise dereverberation filter based on speech harmonicity
    Kinoshita, K
    Nakatani, T
    Miyoshi, M
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1073 - 1076
  • [50] Speech dereverberation based on probabilistic models of source and room acoustics
    Nakatani, Tomohiro
    Juang, Biing-Hwang
    Kinoshita, Keisuke
    Miyoshi, Masato
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 821 - 824