Robust technologies towards automatic speech recognition in car noise environments

被引:0
|
作者
Ding, Pei [1 ]
He, Lei [1 ]
Yan, Xiang [1 ]
Zhao, Rui [1 ]
Hao, Jie [1 ]
机构
[1] Toshiba Res & Dev Ctr, Beijing, Peoples R China
关键词
robust speech recognition; in-car noise; speech enhancement; spectrum smoothing; immunity learning;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents the research on robust automatic speech recognition (ASR) in car noise environments. In the front-end design, speech enhancement technologies are used to suppress the background noise in frequency domain, and then spectrum smoothing is implemented both in time and frequency index to compensate those spectrum components distorted by noise over-reduction. In acoustic model training, we propose to use an immunity teaming scheme, in which pre-recorded car noises are artificially added to clean training utterances with different signal-to-noise ratios (SNR) to imitate the in-car environments. After analyzing the SNR and, noise spectrum of real in-car utterances, we further refine the immunity training set by adjusting the distribution of SNR and increasing the proportion of training noises that has a similar characteristic. Evaluation results of isolated phrase recognition show that the ASR system with proposed technologies achieves the average error rate reduction (ERR) of 90.68% and 79.08% for artificial car noisy speech and real in-car speech respectively, when compared with the baseline system in which no robust technology is used.
引用
收藏
页码:776 / +
页数:2
相关论文
共 50 条
  • [1] Robust Automatic Speech Recognition for Accented Mandarin in Car Environments
    Pei Ding
    Lei He
    Xiang Yan
    Jie Hao
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2542 - 2545
  • [2] Robust speech recognition in car environments
    Shozakai, M
    Nakamura, S
    Shikano, K
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 269 - 272
  • [3] Two-channel noise reduction for robust speech recognition in car environments
    Jeong, S.
    Yang, H.
    Hahn, M.
    ELECTRONICS LETTERS, 2008, 44 (17) : 1042 - U54
  • [4] Robust speech recognition for car environment noise
    Kokubo, H
    Amano, A
    Hataoka, N
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2002, 85 (11): : 65 - 73
  • [5] Robust automatic speech recognition in the presence of impulsive noise
    Potamitis, I
    Fakotakis, N
    Kokkinakis, G
    ELECTRONICS LETTERS, 2001, 37 (12) : 799 - 800
  • [6] An overview of noise-robust automatic speech recognition
    Li, Jinyu
    Deng, Li
    Gong, Yifan
    Haeb-Umbach, Reinhold
    IEEE Transactions on Audio, Speech and Language Processing, 2014, 22 (04): : 745 - 777
  • [7] Noise robust automatic speech recognition: review and analysis
    Dua M.
    Akanksha
    Dua S.
    International Journal of Speech Technology, 2023, 26 (02) : 475 - 519
  • [8] Environmental Noise Analysis for Robust Automatic Speech Recognition
    Kishore, N. Sai Bala
    Venkata, M. Rao
    Nagamani, M.
    ADVANCED COMPUTER AND COMMUNICATION ENGINEERING TECHNOLOGY, 2015, 315
  • [9] An Overview of Noise-Robust Automatic Speech Recognition
    Li, Jinyu
    Deng, Li
    Gong, Yifan
    Haeb-Umbach, Reinhold
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (04) : 745 - 777
  • [10] Noise Adaptive Training for Robust Automatic Speech Recognition
    Kalinli, Ozlem
    Seltzer, Michael L.
    Droppo, Jasha
    Acero, Alex
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (08): : 1889 - 1901