RECURRENT SUPPORT VECTOR MACHINES FOR SPEECH RECOGNITION

被引：0

作者：

Zhang, Shi-Xiong ^{[1
]}

Zhao, Rui ^{[1
]}

Liu, Chaojun ^{[1
]}

Li, Jinyu ^{[1
]}

Gong, Yifan ^{[1
]}

机构：

[1] Microsoft Corp, One Microsoft Way, Redmond, WA 98052 USA

来源：

2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS | 2016年

关键词：

Deep learning; LSTM; SVM; maximum margin; sequence training;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Recurrent Neural Networks (RNNs) using Long-Short Term Memory (LSTM) architecture have demonstrated the state-of-the-art performances on speech recognition. Most of deep RNNs use the softmax activation function in the last layer for classification. This paper illustrates small but consistent advantages of replacing the softmax layer in RNN with Support Vector Machines (SVMs). The parameters of RNNs and SVMs are jointly learned using a sequence-level max-margin criteria, instead of cross-entropy. The resulting model is termed Recurrent SVM. The conventional SVMs need to predefine a feature space and do not have internal states to deal with arbitrary long-term dependencies in sequences. The proposed recurrent SVM uses LSTMs to learn the feature space and to capture temporal dependencies, while using the SVM (in the last layer) for sequence classification. The model is evaluated on the Windows phone task for large vocabulary continuous speech recognition.

引用

页码：5885 / 5889

页数：5

共 50 条

[1] Speech Recognition using Support Vector Machines
Aida-zade, Kamil
Xocayev, Anar
Rustamov, Samir
[J]. 2016 IEEE 10TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2016, : 108 - 111
[2] Convolutional support vector machines for speech recognition
Passricha, Vishal
Aggarwal, Rajesh Kumar
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 601 - 609
[3] An Application of Speech Recognition with Support Vector Machines
Eray, Osman
Tokat, Sezai
Iplikci, Serdar
[J]. 2018 6TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSIC AND SECURITY (ISDFS), 2018, : 38 - 43
[4] Applications of support vector machines to speech recognition
Ganapathiraju, A
Hamaker, JE
Picone, J
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (08) : 2348 - 2355
[5] Infinite Support Vector Machines in Speech Recognition
Yang, Jingzhou
van Dalen, Rogier C.
Gales, Mark
[J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3302 - 3306
[6] Convolutional support vector machines for speech recognition
Vishal Passricha
Rajesh Kumar Aggarwal
[J]. International Journal of Speech Technology, 2019, 22 : 601 - 609
[7] Speech Emotion Recognition Using Support Vector Machines
Yu, Caiming
Tian, Qingxi
Cheng, Fang
Zhang, Shiqing
[J]. ADVANCED RESEARCH ON COMPUTER SCIENCE AND INFORMATION ENGINEERING, PT I, 2011, 152 : 215 - 220
[8] Visual speech recognition using support vector machines
Gordan, M
Kotropoulos, C
Pitas, I
[J]. DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 1093 - 1096
[9] A Study of Support Vector Machines for Emotional Speech Recognition
Kurpukdee, Nattapong
Kasuriya, Sawit
Chunwijitra, Vataya
Wutiwiwatchai, Chai
Lamsrichan, Poonlap
[J]. 2017 8TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY FOR EMBEDDED SYSTEMS (IC-ICTES), 2017,
[10] DEEP NEURAL SUPPORT VECTOR MACHINES FOR SPEECH RECOGNITION
Zhang, Shi-Xiong
Liu, Chaojun
Yao, Kaisheng
Gong, Yifan
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4275 - 4279

← 1 2 3 4 5 →