Discriminative Kernel-Based Phoneme Sequence Recognition

被引：0

作者：

Keshet, Joseph ^{[1
]}

Shalev-Shwartz, Shai ^{[1
]}

Bengio, Samy ^{[2
]}

Singer, Yoram ^{[1
,3
]}

Chazan, Dan ^{[4
]}

机构：

[1] Hebrew Univ Jerusalem, Sch Comp Sci & Engn, Jerusalem, Israel

[2] IDIAP Res Inst, Martigny, Switzerland

[3] Google Inc, Mountain View, CA USA

[4] Technion, Dept Elect Engn, Haifa, Israel

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

speech recognition; phoneme recognition; acoustic modeling; support vector machines;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We describe a new method for phoneme sequence recognition given a speech utterance, which is not based on the HMM. In contrast to HMM-based approaches, our method uses a discriminative kernel-based training procedure in which the learning process is tailored to the goal of minimizing the Levenshtein distance between the predicted phoneme sequence and the correct sequence. The phoneme sequence predictor is devised by mapping the speech utterance along with a proposed phoneme sequence to a vector-space endowed with an inner-product that is realized by a Mercer kernel. Building on large margin techniques for predicting whole sequences, we are able to devise a learning algorithm which distills to separating the correct phoneme sequence from all other sequences. We describe an iterative algorithm for learning the phoneme sequence recognizer and further describe an efficient implementation of it. We present initial encouraging experimental results with the TIMIT and compare the proposed method to an HMM-based approach.

引用

页码：593 / +

页数：2

共 50 条

[11] Kernel-based sparse representation for gesture recognition
Zhou, Yin
Liu, Kai
Carrillo, Rafael E.
Barner, Kenneth E.
Kiamilev, Fouad
PATTERN RECOGNITION, 2013, 46 (12) : 3208 - 3222
[12] Kernel-based subspace analysis for face recognition
Tsai, Pohsiang
Jan, Tony
Hintz, Tom
2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 1127 - 1132
[13] Kernel-based pose invariant face recognition
Hsieh, Chao-Kuei
Chen, Yung-Chang
2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 987 - 990
[14] Application of kernel-based feature space transformations and learning methods to phoneme classification
Kocsor, A
Tóth, L
APPLIED INTELLIGENCE, 2004, 21 (02) : 129 - 142
[15] Application of Kernel-Based Feature Space Transformations and Learning Methods to Phoneme Classification
András Kocsor
László Tóth
Applied Intelligence, 2004, 21 : 129 - 142
[16] A discriminative kernel-based model to rank images from text queries
Grangier, David
Bengio, Samy
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (08) : 1371 - 1384
[17] Mouth Shape Sequence Recognition Based on Speech Phoneme Recognition
Xu, Ming
Hu, Ruimin
2006 FIRST INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA, 2006,
[18] Learning kernel-based HMMs for dynamic sequence synthesis
Wang, TS
Zheng, NN
Li, Y
Xu, YQ
Shum, HY
10TH PACIFIC CONFERENCE ON COMPUTER GRAPHICS AND APPLICATIONS, PROCEEDINGS, 2002, : 87 - 95
[19] Learning kernel-based HMMs for dynamic sequence synthesis
Wang, Tian-Shu
Zheng, Nan-Ning
Li, Yan
Xu, Ying-Qing
Shum, Heung-Yeung
Jisuanji Xuebao/Chinese Journal of Computers, 2003, 26 (02): : 153 - 159
[20] Learning kernel-based HMMs for dynamic sequence synthesis
Wang, TS
Zheng, NN
Li, Y
Xu, YQ
Shum, HY
GRAPHICAL MODELS, 2003, 65 (04) : 206 - 221

← 1 2 3 4 5 →