Jointly Optimized Discriminative Features for Speech Recognition

被引:0
|
作者
Ng, Tim [1 ]
Zhang, Bing [1 ]
Long Nguyen [1 ]
机构
[1] Raytheon BBN Technol, Cambridge, MA 02138 USA
关键词
Multi-Layer Perceptrons; Region Dependent Transform; discriminative training; Mandarin speech recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the past decade, methods to extract long-term acoustic features for speech recognition using Multi-Layer Perceptrons have been proposed. These features have been proved to be good complementary features in some feature augmentations and/or through system combination. Usually, conventional linear dimension reduction algorithms, e.g. Linear Discriminative Analysis, are not applied on the combined features. In this paper, Region Dependent Transform is applied to jointly optimize the feature combination under a discriminative training criterion. When compared to a conventional augmentation, 3% to 6% relative character error rate reduction for Mandarin speech recognition has been achieved using Region Dependent Transform.
引用
收藏
页码:2626 / 2629
页数:4
相关论文
共 50 条
  • [1] DISCRIMINATIVE OUTPUT CODING FEATURES FOR SPEECH RECOGNITION
    Dehzangi, Omid
    Ma, Bin
    Chng, Eng Siong
    Li, Haizhou
    [J]. 2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 89 - 92
  • [2] Discriminative auditory features for robust speech recognition
    Mak, B
    Tam, YC
    Li, Q
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 381 - 384
  • [3] Optimized discriminative transformations for speech features based on minimum classification error
    Zamani, Behzad
    Akbari, Ahmad
    Nasersharif, Babak
    Jalalvand, Azarakhsh
    [J]. PATTERN RECOGNITION LETTERS, 2011, 32 (07) : 948 - 955
  • [4] Discriminative auditory-based features for robust speech recognition
    Mak, BKW
    Tam, YC
    Li, PQ
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (01): : 27 - 36
  • [5] Robust speech recognition based on discriminative learning of environmental features
    Han, J.Q.
    Gao, W.
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2001, 29 (02): : 196 - 198
  • [6] A Study on the Search of the Most Discriminative Speech Features in the Speaker Dependent Speech Emotion Recognition
    Pao, Tsang-Long
    Wang, Chun-Hsiang
    Li, Yu-Ji
    [J]. 2012 FIFTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP), 2012, : 157 - 162
  • [7] Dynamic visual features based on discriminative speech class projection for visual speech recognition
    Lei, X
    Cai, XL
    Fu, ZH
    Zhao, RC
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 687 - 690
  • [8] A GAUSSIAN MIXTURE MODEL LAYER JOINTLY OPTIMIZED WITH DISCRIMINATIVE FEATURES WITHIN A DEEP NEURAL NETWORK ARCHITECTURE
    Variani, Ehsan
    McDermott, Erik
    Heigold, Georg
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4270 - 4274
  • [9] Jointly Learning the Discriminative Dictionary and Projection for Face Recognition
    Bi, Chao
    Yi, Yugen
    Zhang, Lei
    Zheng, Caixia
    Shi, Yanjiao
    Xie, Xiaochun
    Wang, Jianzhong
    Wu, Yan
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [10] Discriminative spectral-temporal multi-resolution features for speech recognition
    McMahon, P
    Harte, N
    Vaseghi, S
    McCourt, P
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 581 - 584