On noise robustness of dynamic and static features for continuous Cantonese digit recognition

被引:0
|
作者
Yang, C [1 ]
Soong, FK [1 ]
Lee, T [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Elect Engn, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It has been shown previously that augmented spectral features (static and dynamic cepstra) are effective for improving ASR performance in a clean environment. In this paper we investigate the noise robustness of static and dynamic cepstral features, in a speaker independent, continuous recognition task by using a noise-added, Cantonese digit database (CUDigit). We found that the dynamic cepstrum is more robust to additive, background noise than its static counterpart. The results are consistent across different types of noise and under various SNRs. Exponential weights which can exploit the unequal robustness of two features are optimally trained in a development set. A relative word error rate reduction of 41.9%, mainly on a significant reduction of insertions, is obtained on the test data under various noise and SNR conditions.
引用
收藏
页码:277 / 280
页数:4
相关论文
共 50 条
  • [1] Clock Drawing Test Digit Recognition Using Static and Dynamic Features
    Harbi, Zainab
    Hicks, Yulia
    Setchi, Rossitza
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS: PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE KES-2016, 2016, 96 : 1221 - 1230
  • [2] Static and dynamic spectral features: Their noise robustness and optimal weights for ASR
    Chen, Y
    Soong, FK
    Lee, T
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 241 - 244
  • [3] Static and dynamic spectral features: Their noise robustness and optimal weights for ASR
    Yang, Chen
    Soong, Frank K.
    Lee, Tan
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03): : 1087 - 1097
  • [4] Continuous Digit Recognition in Noise: Reservoirs can do an excellent job!
    Jalalvand, Azarakhsh
    Triefenbach, Fabian
    Martens, Jean-Pierre
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1802 - 1805
  • [5] Noise and speaker robustness in a Persian continuous speech recognition system
    Veisi, Hadi
    Sameti, Hossein
    [J]. 2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 73 - 76
  • [6] Noise Robust Continuous Digit Recognition with Reservoir-based Acoustic Models
    Jalalvand, Azarakhsh
    Demuynck, Kris
    Martens, Jean-Pierre
    [J]. 2013 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS), 2013, : 204 - 209
  • [7] Noise Robustness of Spectrum Delta (SpD) Features in Malay Vowel Recognition
    Azmi, Mohd Yusof Shahrul
    Idayu, M. Nor
    Roshidi, D.
    Yaakob, A. R.
    Yaacob, Sazali
    [J]. COMPUTER APPLICATIONS FOR COMMUNICATION, NETWORKING, AND DIGITAL CONTENTS, 2012, 350 : 270 - +
  • [8] Noise robustness of texture features
    Tan, TN
    [J]. IMAGE AND VISION COMPUTING, 1997, 15 (11) : 815 - 817
  • [9] Digit recognition using trispectral features
    Chandran, V
    Slomka, S
    Gollogly, M
    Elgar, S
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 3065 - 3068
  • [10] Indonesian Sign Language Recognition by Using the Static and Dynamic Features
    Khotimah, Wijayanti Nurul
    Suciati, Nanik
    Benedict, Ignatius
    [J]. 2018 INTERNATIONAL SEMINAR ON INTELLIGENT TECHNOLOGY AND ITS APPLICATIONS (ISITIA 2018), 2018, : 293 - 298