Robust technologies towards automatic speech recognition in car noise environments

被引:0
|
作者
Ding, Pei [1 ]
He, Lei [1 ]
Yan, Xiang [1 ]
Zhao, Rui [1 ]
Hao, Jie [1 ]
机构
[1] Toshiba Res & Dev Ctr, Beijing, Peoples R China
关键词
robust speech recognition; in-car noise; speech enhancement; spectrum smoothing; immunity learning;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents the research on robust automatic speech recognition (ASR) in car noise environments. In the front-end design, speech enhancement technologies are used to suppress the background noise in frequency domain, and then spectrum smoothing is implemented both in time and frequency index to compensate those spectrum components distorted by noise over-reduction. In acoustic model training, we propose to use an immunity teaming scheme, in which pre-recorded car noises are artificially added to clean training utterances with different signal-to-noise ratios (SNR) to imitate the in-car environments. After analyzing the SNR and, noise spectrum of real in-car utterances, we further refine the immunity training set by adjusting the distribution of SNR and increasing the proportion of training noises that has a similar characteristic. Evaluation results of isolated phrase recognition show that the ASR system with proposed technologies achieves the average error rate reduction (ERR) of 90.68% and 79.08% for artificial car noisy speech and real in-car speech respectively, when compared with the baseline system in which no robust technology is used.
引用
收藏
页码:776 / +
页数:2
相关论文
共 50 条
  • [41] Novel frequency masking curves for noise-robust automatic speech recognition
    Chen, Chia-Ping
    Yeh, Ja-Zang
    Wu, Bo-Feng
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2013, 36 (06) : 696 - 703
  • [42] Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition
    Gemmeke, Jort F.
    Virtanen, Tuomas
    Hurmalainen, Antti
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 2067 - 2080
  • [43] Sparse coding of the modulation spectrum for noise-robust automatic speech recognition
    Ahmadi, Sara
    Ahadi, Seyed Mohammad
    Cranen, Bert
    Boves, Lou
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014, : 1 - 20
  • [44] A MODULATION FEATURE SET FOR ROBUST AUTOMATIC SPEECH RECOGNITION IN ADDITIVE NOISE AND REVERBERATION
    Liu, Xiaoyu
    Sadeghian, Roozbeh
    Zahorian, Stephen A.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5230 - 5234
  • [45] Application of noise robust MDT speech recognition on the SPEECON and SpeechDat-Car databases
    Gemmeke, J. F.
    Wang, Y.
    Van Segbroeck, M.
    Cranen, B.
    Van Hamme, H.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1227 - +
  • [46] ROBUST WORD RECOGNITION IN ADVERSE CAR ENVIRONMENTS
    NAKAMURA, S
    AKABANE, T
    HAMAGUCHI, S
    KITOH, A
    SHARP TECHNICAL JOURNAL, 1993, (57): : 9 - 14
  • [47] Robust word recognition in adverse car environments
    Nakamura, Satoshi
    Akabane, Toshio
    Hamaguchi, Seiji
    Kitoh, Atsunori
    Shapu Giho/Sharp Technical Journal, 1993, (57): : 9 - 14
  • [48] Noise robust automatic speech recognition with adaptive quantile based noise estimation and speech band emphasizing filter bank
    Bonde, CS
    Graversen, C
    Gregersen, AG
    Ngo, KH
    Normark, K
    Purup, M
    Thorsen, T
    Lindberg, B
    NONLINEAR ANALYSES AND ALGORITHMS FOR SPEECH PROCESSING, 2005, 3817 : 291 - 302
  • [49] Towards Noise Robust Speech Emotion Recognition Using Dynamic Layer Customization
    Wilf, Alex
    Provost, Emily Mower
    2021 9TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2021,
  • [50] Towards inclusive automatic speech recognition
    Feng, Siyuan
    Halpern, Bence Mark
    Kudina, Olya
    Scharenborg, Odette
    COMPUTER SPEECH AND LANGUAGE, 2024, 84