Jointly Optimized Discriminative Features for Speech Recognition

被引：0

作者：

Ng, Tim ^{[1
]}

Zhang, Bing ^{[1
]}

Long Nguyen ^{[1
]}

机构：

[1] Raytheon BBN Technol, Cambridge, MA 02138 USA

来源：

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年

关键词：

Multi-Layer Perceptrons; Region Dependent Transform; discriminative training; Mandarin speech recognition;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In the past decade, methods to extract long-term acoustic features for speech recognition using Multi-Layer Perceptrons have been proposed. These features have been proved to be good complementary features in some feature augmentations and/or through system combination. Usually, conventional linear dimension reduction algorithms, e.g. Linear Discriminative Analysis, are not applied on the combined features. In this paper, Region Dependent Transform is applied to jointly optimize the feature combination under a discriminative training criterion. When compared to a conventional augmentation, 3% to 6% relative character error rate reduction for Mandarin speech recognition has been achieved using Region Dependent Transform.

引用

页码：2626 / 2629

页数：4

共 50 条

[31] Generalized Discriminative Feature Transformation for Speech Recognition
Hsiao, Roger
Schultz, Tanja
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 672 - 675
[32] Discriminative Analysis of Distortion Sequences in Speech Recognition
Chang, Pao-Chung
Chen, Sin-Horng
Juang, Biing-Hwang
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (03): : 326 - 333
[33] Discriminative training of language models for speech recognition
Kuo, KHJ
Fosler-Lussier, E
Jiang, H
Lee, CH
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 325 - 328
[34] PHONOLOGICAL FEATURES IN DISCRIMINATIVE CLASSIFICATION OF DYSARTHRIC SPEECH
Rudzicz, Frank
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4605 - 4608
[35] Attention-based latent features for jointly trained end-to-end automatic speech recognition with modified speech enhancement
Yang, Da-Hee
Chang, Joon-Hyuk
[J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (03) : 202 - 210
[36] Optimized Discriminative LBP Patterns for Infrared Face Recognition
Wang, Zhengzi
Xie, Zhihua
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 446 - 449
[37] Extracting discriminative color features for face recognition
Liu, Chengjun
[J]. PATTERN RECOGNITION LETTERS, 2011, 32 (14) : 1796 - 1804
[38] Learning emotion-discriminative and domain-invariant features for domain adaptation in speech emotion recognition
Mao, Qirong
Xu, Guopeng
Xue, Wentao
Gou, Jianping
Zhan, Yongzhao
[J]. SPEECH COMMUNICATION, 2017, 93 : 1 - 10
[39] Towards more discriminative features for texture recognition
Cerkezi, Llukman
Topal, Cihan
[J]. PATTERN RECOGNITION, 2020, 107 (107)
[40] Learning Discriminative Hierarchical Features for Object Recognition
Zuo, Zhen
Wang, Gang
[J]. IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (09) : 1159 - 1163

← 1 2 3 4 5 →