Jointly Optimized Discriminative Features for Speech Recognition

被引：0

作者：

Ng, Tim ^{[1
]}

Zhang, Bing ^{[1
]}

Long Nguyen ^{[1
]}

机构：

[1] Raytheon BBN Technol, Cambridge, MA 02138 USA

来源：

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年

关键词：

Multi-Layer Perceptrons; Region Dependent Transform; discriminative training; Mandarin speech recognition;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In the past decade, methods to extract long-term acoustic features for speech recognition using Multi-Layer Perceptrons have been proposed. These features have been proved to be good complementary features in some feature augmentations and/or through system combination. Usually, conventional linear dimension reduction algorithms, e.g. Linear Discriminative Analysis, are not applied on the combined features. In this paper, Region Dependent Transform is applied to jointly optimize the feature combination under a discriminative training criterion. When compared to a conventional augmentation, 3% to 6% relative character error rate reduction for Mandarin speech recognition has been achieved using Region Dependent Transform.

引用

页码：2626 / 2629

页数：4

共 50 条

[1] DISCRIMINATIVE OUTPUT CODING FEATURES FOR SPEECH RECOGNITION
Dehzangi, Omid
Ma, Bin
Chng, Eng Siong
Li, Haizhou
[J]. 2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 89 - 92
[2] Discriminative auditory features for robust speech recognition
Mak, B
Tam, YC
Li, Q
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 381 - 384
[3] Optimized discriminative transformations for speech features based on minimum classification error
Zamani, Behzad
Akbari, Ahmad
Nasersharif, Babak
Jalalvand, Azarakhsh
[J]. PATTERN RECOGNITION LETTERS, 2011, 32 (07) : 948 - 955
[4] Discriminative auditory-based features for robust speech recognition
Mak, BKW
Tam, YC
Li, PQ
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (01): : 27 - 36
[5] Robust speech recognition based on discriminative learning of environmental features
Han, J.Q.
Gao, W.
[J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2001, 29 (02): : 196 - 198
[6] A Study on the Search of the Most Discriminative Speech Features in the Speaker Dependent Speech Emotion Recognition
Pao, Tsang-Long
Wang, Chun-Hsiang
Li, Yu-Ji
[J]. 2012 FIFTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP), 2012, : 157 - 162
[7] Dynamic visual features based on discriminative speech class projection for visual speech recognition
Lei, X
Cai, XL
Fu, ZH
Zhao, RC
[J]. PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 687 - 690
[8] A GAUSSIAN MIXTURE MODEL LAYER JOINTLY OPTIMIZED WITH DISCRIMINATIVE FEATURES WITHIN A DEEP NEURAL NETWORK ARCHITECTURE
Variani, Ehsan
McDermott, Erik
Heigold, Georg
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4270 - 4274
[9] Jointly Learning the Discriminative Dictionary and Projection for Face Recognition
Bi, Chao
Yi, Yugen
Zhang, Lei
Zheng, Caixia
Shi, Yanjiao
Xie, Xiaochun
Wang, Jianzhong
Wu, Yan
[J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
[10] Discriminative spectral-temporal multi-resolution features for speech recognition
McMahon, P
Harte, N
Vaseghi, S
McCourt, P
[J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 581 - 584

← 1 2 3 4 5 →