Static and dynamic spectral features: Their noise robustness and optimal weights for ASR

被引:7
|
作者
Yang, Chen
Soong, Frank K.
Lee, Tan
机构
[1] Chinese Univ Hong Kong, Dept Elect Engn, Shatin, Hong Kong, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
discriminative training; dynamic features; exponential weighting; noise robustness;
D O I
10.1109/TASL.2006.885932
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we investigate the relative noise robustness of dynamic and static spectral features in speech recognition. It is found that the dynamic cepstrum is more robust to additive noise than its static counterpart. The results are consistent across different types of noise and over a wide range of noise levels. To exploit this unequal robustness, we propose a simple yet effective strategy of exponentially weighting the likelihoods that are contributed by the static and dynamic features during the decoding process. The optimal weights are discriminatively trained with a small amount of development data. This method is evaluated on two speaker-independent, connected digit databases, one in English (Aurora 2) and the other in Cantonese (CUDIGIT). For various types of noise at different signal-to-noise ratios (SNRs), the average relative word error rate reductions attained with the discriminatively trained weights are 36.6% and 41.9 % for Aurora 2 and CUDIGIT, respectively. Noticeable performance improvement can be observed even when there is channel distortion. The proposed approach is appealing to practical applications because. 1) noise estimation is not required, 2) model adaptation is not required, 3) only minor modification of the decoding process is needed, and 4) only few feature weights need to be trained.
引用
收藏
页码:1087 / 1097
页数:11
相关论文
共 50 条
  • [41] STATIC AND DYNAMIC SHIFTS OF SPECTRAL-LINES
    BOERCKER, DB
    IGLESIAS, CA
    [J]. PHYSICAL REVIEW A, 1984, 30 (05): : 2771 - 2774
  • [42] Static and dynamic approaches to computing spectral lineshapes
    Zuehlsdorff, Tim
    Castillo, Andres Montoya
    Napoli, Joseph
    Markland, Thomas
    Isborn, Christine
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258
  • [43] Optimal Dual Certificates for Noise Robustness Bounds in Compressive Sensing
    Marc Nicodème
    Flavius Turcu
    Charles Dossal
    [J]. Journal of Mathematical Imaging and Vision, 2015, 53 : 251 - 263
  • [44] Robustness of optimal probabilistic storage and retrieval of unitary channels to noise
    Pavlicko, Jaroslav
    Ziman, Mario
    [J]. PHYSICAL REVIEW A, 2022, 106 (05)
  • [45] Enhancing the Noise Robustness of the Optimal Computing Budget Allocation Approach
    Choi, Seon Han
    Kim, Tag Gon
    [J]. IEEE ACCESS, 2020, 8 : 25749 - 25763
  • [46] Optimal Dual Certificates for Noise Robustness Bounds in Compressive Sensing
    Nicodeme, Marc
    Turcu, Flavius
    Dossal, Charles
    [J]. JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2015, 53 (03) : 251 - 263
  • [47] Improving robustness of line features for VIO in dynamic scene
    Wu, Jianfeng
    Xiong, Jian
    Guo, Hang
    [J]. MEASUREMENT SCIENCE AND TECHNOLOGY, 2022, 33 (06)
  • [48] Robustness of optimal timing strategies in dynamic investment processes
    Tarasyev, AM
    Watanabe, C
    [J]. NONLINEAR CONTROL SYSTEMS 2001, VOLS 1-3, 2002, : 819 - 824
  • [49] Optimal Spectral Shrinkage and PCA With Heteroscedastic Noise
    Leeb, William
    Romanov, Elad
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (05) : 3009 - 3037
  • [50] Using noise reduction and spectral emphasis techniques to improve ASR performance in noisy conditions
    Zhu, WZ
    O'Shaughnessy, D
    [J]. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 357 - 362