Jointly Optimized Discriminative Features for Speech Recognition

被引:0
|
作者
Ng, Tim [1 ]
Zhang, Bing [1 ]
Long Nguyen [1 ]
机构
[1] Raytheon BBN Technol, Cambridge, MA 02138 USA
关键词
Multi-Layer Perceptrons; Region Dependent Transform; discriminative training; Mandarin speech recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the past decade, methods to extract long-term acoustic features for speech recognition using Multi-Layer Perceptrons have been proposed. These features have been proved to be good complementary features in some feature augmentations and/or through system combination. Usually, conventional linear dimension reduction algorithms, e.g. Linear Discriminative Analysis, are not applied on the combined features. In this paper, Region Dependent Transform is applied to jointly optimize the feature combination under a discriminative training criterion. When compared to a conventional augmentation, 3% to 6% relative character error rate reduction for Mandarin speech recognition has been achieved using Region Dependent Transform.
引用
收藏
页码:2626 / 2629
页数:4
相关论文
共 50 条
  • [31] Generalized Discriminative Feature Transformation for Speech Recognition
    Hsiao, Roger
    Schultz, Tanja
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 672 - 675
  • [32] Discriminative Analysis of Distortion Sequences in Speech Recognition
    Chang, Pao-Chung
    Chen, Sin-Horng
    Juang, Biing-Hwang
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (03): : 326 - 333
  • [33] Discriminative training of language models for speech recognition
    Kuo, KHJ
    Fosler-Lussier, E
    Jiang, H
    Lee, CH
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 325 - 328
  • [34] PHONOLOGICAL FEATURES IN DISCRIMINATIVE CLASSIFICATION OF DYSARTHRIC SPEECH
    Rudzicz, Frank
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4605 - 4608
  • [35] Attention-based latent features for jointly trained end-to-end automatic speech recognition with modified speech enhancement
    Yang, Da-Hee
    Chang, Joon-Hyuk
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (03) : 202 - 210
  • [36] Optimized Discriminative LBP Patterns for Infrared Face Recognition
    Wang, Zhengzi
    Xie, Zhihua
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 446 - 449
  • [37] Extracting discriminative color features for face recognition
    Liu, Chengjun
    [J]. PATTERN RECOGNITION LETTERS, 2011, 32 (14) : 1796 - 1804
  • [38] Learning emotion-discriminative and domain-invariant features for domain adaptation in speech emotion recognition
    Mao, Qirong
    Xu, Guopeng
    Xue, Wentao
    Gou, Jianping
    Zhan, Yongzhao
    [J]. SPEECH COMMUNICATION, 2017, 93 : 1 - 10
  • [39] Towards more discriminative features for texture recognition
    Cerkezi, Llukman
    Topal, Cihan
    [J]. PATTERN RECOGNITION, 2020, 107 (107)
  • [40] Learning Discriminative Hierarchical Features for Object Recognition
    Zuo, Zhen
    Wang, Gang
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (09) : 1159 - 1163