Convolutional support vector machines for speech recognition

被引:14
|
作者
Passricha, Vishal [1 ]
Aggarwal, Rajesh Kumar [1 ]
机构
[1] Natl Inst Technol, Comp Engn Dept, Kurukshetra, Haryana, India
关键词
ASR; CNN; SVM; Maximum margin; CSVM; NEURAL-NETWORKS; FEATURES; MODELS;
D O I
10.1007/s10772-018-09584-4
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional neural networks (CNNs) have demonstrated the state-of-the-art performances on automatic speech recognition. Softmax activation function for prediction and minimizing the cross-entropy loss is employed by most of the CNNs. This paper proposes a new deep architecture in which two heterogeneous classification techniques named as CNN and support vector machines (SVMs) are combined together. In this proposed model, features are learned using convolution layer and classified by SVMs. The last layer of CNN i.e. softmax layer is replaced by SVMs to efficiently deal with high dimensional features. This model should be interpreted as a special form of structured SVM and named as convolutional support vector machine (CSVM). Instead of training each component separately, the parameters of CNN and SVMs are jointly trained using frame level max-margin, sequence level max-margin, and state-level minimum Bayes risk criterion. The performance of CSVM is checked on TIMIT and Wall Street Journal datasets for phone recognition. By incorporating the features of both CNN and SVMs, CSVM improves the result by 13.33% and 2.31% over baseline CNN and segmental recurrent neural networks respectively.
引用
收藏
页码:601 / 609
页数:9
相关论文
共 50 条
  • [41] Efficient Speech Emotion Recognition Using Binary Support Vector Machines & Multiclass SVM
    Kanth, N. Ratna
    Saraswathi, S.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2015, : 542 - 547
  • [42] Multiple regression using support vector machines for recognition of speech in a moving car environment
    Lee, W
    Sekhar, CC
    Takeda, K
    Itakura, F
    [J]. ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 904 - 908
  • [43] Kernel Support Vector Machines and Convolutional Neural Networks
    Jiang, Shihao
    Hartley, Richard
    Fernando, Basura
    [J]. 2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, : 560 - 566
  • [44] Iris recognition using support vector machines
    Wang, Y
    Han, JQ
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2004, PT 1, 2004, 3173 : 622 - 628
  • [45] BODY TYPE RECOGNITION BY SUPPORT VECTOR MACHINES
    Wang, Yuxiu
    Liu, Hao
    Li, Xiaojiu
    [J]. ITC&DC: 5TH INTERNATIONAL TEXTILE, CLOTHING & DESIGN CONFERENCE 2010, BOOK OF PROCEEDINGS: MAGIC WORLD OF TEXTILES, 2010, : 758 - 762
  • [46] A tutorial on Support Vector Machines for pattern recognition
    Burges, CJC
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) : 121 - 167
  • [47] Support vector machines for Thai phoneme recognition
    Thubthong, N
    Kijsirikul, B
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2001, 9 (06) : 803 - 813
  • [48] Support Vector Machines for Traffic Signs Recognition
    Shi, Min
    Wu, Haifeng
    Fleyeh, Hasan
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 3820 - +
  • [49] A Tutorial on Support Vector Machines for Pattern Recognition
    Christopher J.C. Burges
    [J]. Data Mining and Knowledge Discovery, 1998, 2 : 121 - 167
  • [50] Support vector machines for olfactory signals recognition
    Distante, C
    Ancona, N
    Siciliano, P
    [J]. SENSORS AND ACTUATORS B-CHEMICAL, 2003, 88 (01): : 30 - 39