RECURRENT SUPPORT VECTOR MACHINES FOR SPEECH RECOGNITION

被引:0
|
作者
Zhang, Shi-Xiong [1 ]
Zhao, Rui [1 ]
Liu, Chaojun [1 ]
Li, Jinyu [1 ]
Gong, Yifan [1 ]
机构
[1] Microsoft Corp, One Microsoft Way, Redmond, WA 98052 USA
关键词
Deep learning; LSTM; SVM; maximum margin; sequence training;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recurrent Neural Networks (RNNs) using Long-Short Term Memory (LSTM) architecture have demonstrated the state-of-the-art performances on speech recognition. Most of deep RNNs use the softmax activation function in the last layer for classification. This paper illustrates small but consistent advantages of replacing the softmax layer in RNN with Support Vector Machines (SVMs). The parameters of RNNs and SVMs are jointly learned using a sequence-level max-margin criteria, instead of cross-entropy. The resulting model is termed Recurrent SVM. The conventional SVMs need to predefine a feature space and do not have internal states to deal with arbitrary long-term dependencies in sequences. The proposed recurrent SVM uses LSTMs to learn the feature space and to capture temporal dependencies, while using the SVM (in the last layer) for sequence classification. The model is evaluated on the Windows phone task for large vocabulary continuous speech recognition.
引用
收藏
页码:5885 / 5889
页数:5
相关论文
共 50 条
  • [1] Speech Recognition using Support Vector Machines
    Aida-zade, Kamil
    Xocayev, Anar
    Rustamov, Samir
    [J]. 2016 IEEE 10TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2016, : 108 - 111
  • [2] Convolutional support vector machines for speech recognition
    Passricha, Vishal
    Aggarwal, Rajesh Kumar
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 601 - 609
  • [3] An Application of Speech Recognition with Support Vector Machines
    Eray, Osman
    Tokat, Sezai
    Iplikci, Serdar
    [J]. 2018 6TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSIC AND SECURITY (ISDFS), 2018, : 38 - 43
  • [4] Applications of support vector machines to speech recognition
    Ganapathiraju, A
    Hamaker, JE
    Picone, J
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (08) : 2348 - 2355
  • [5] Infinite Support Vector Machines in Speech Recognition
    Yang, Jingzhou
    van Dalen, Rogier C.
    Gales, Mark
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3302 - 3306
  • [6] Convolutional support vector machines for speech recognition
    Vishal Passricha
    Rajesh Kumar Aggarwal
    [J]. International Journal of Speech Technology, 2019, 22 : 601 - 609
  • [7] Speech Emotion Recognition Using Support Vector Machines
    Yu, Caiming
    Tian, Qingxi
    Cheng, Fang
    Zhang, Shiqing
    [J]. ADVANCED RESEARCH ON COMPUTER SCIENCE AND INFORMATION ENGINEERING, PT I, 2011, 152 : 215 - 220
  • [8] Visual speech recognition using support vector machines
    Gordan, M
    Kotropoulos, C
    Pitas, I
    [J]. DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 1093 - 1096
  • [9] A Study of Support Vector Machines for Emotional Speech Recognition
    Kurpukdee, Nattapong
    Kasuriya, Sawit
    Chunwijitra, Vataya
    Wutiwiwatchai, Chai
    Lamsrichan, Poonlap
    [J]. 2017 8TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY FOR EMBEDDED SYSTEMS (IC-ICTES), 2017,
  • [10] DEEP NEURAL SUPPORT VECTOR MACHINES FOR SPEECH RECOGNITION
    Zhang, Shi-Xiong
    Liu, Chaojun
    Yao, Kaisheng
    Gong, Yifan
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4275 - 4279