Structured Support Vector Machines for Noise Robust Continuous Speech Recognition

被引:0
|
作者
Zhang, Shi-Xiong [1 ]
Gales, M. J. F. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
speech recognition; structural SVMs; optimal alignment; large margin; log linear model;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of discriminative models is an interesting alternative to generative models for speech recognition. This paper examines one form of these models, structured support vector machines (SVMs), for noise robust speech recognition. One important aspect of structured SVMs is the form of the joint feature space. In this work features based on generative models are used, which allows model-based compensation schemes to be applied to yield robust joint features. However, these features require the segmentation of frames into words, or sub-words, to be specified. In previous work this segmentation was obtained using generative models. Here the segmentations are refined using the parameters of the structured SVM. A Viterbi-like scheme for obtaining "optimal" segmentations, and modifications to the training algorithm to allow them to be efficiently used, are described. The performance of the approach is evaluated on a noise corrupted continuous digit task: AURORA 2.
引用
收藏
页码:996 / 999
页数:4
相关论文
共 50 条
  • [1] INFINITE STRUCTURED SUPPORT VECTOR MACHINES FOR SPEECH RECOGNITION
    Yang, J.
    van Dalen, R. C.
    Zhang, S. -X.
    Gales, M. J. F.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [2] STRUCTURED DISCRIMINATIVE MODELS FOR NOISE ROBUST CONTINUOUS SPEECH RECOGNITION
    Ragni, A.
    Gales, M. J. F.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4788 - 4791
  • [3] Tone recognition of continuous Cantonese speech based on support vector machines
    Peng, G
    Wang, WSY
    [J]. SPEECH COMMUNICATION, 2005, 45 (01) : 49 - 62
  • [4] Robust Noisy Speech Recognition Using Deep Neural Support Vector Machines
    Amami, Rimah
    Ben Ayed, Dorra
    [J]. DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2019, 800 : 300 - 307
  • [5] Lattice segmentation and support vector machines for large vocabulary continuous speech recognition
    Venkataramani, V
    Byrne, W
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 817 - 820
  • [6] Speech Recognition using Support Vector Machines
    Aida-zade, Kamil
    Xocayev, Anar
    Rustamov, Samir
    [J]. 2016 IEEE 10TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2016, : 108 - 111
  • [7] Convolutional support vector machines for speech recognition
    Passricha, Vishal
    Aggarwal, Rajesh Kumar
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 601 - 609
  • [8] RECURRENT SUPPORT VECTOR MACHINES FOR SPEECH RECOGNITION
    Zhang, Shi-Xiong
    Zhao, Rui
    Liu, Chaojun
    Li, Jinyu
    Gong, Yifan
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5885 - 5889
  • [9] Applications of support vector machines to speech recognition
    Ganapathiraju, A
    Hamaker, JE
    Picone, J
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (08) : 2348 - 2355
  • [10] Infinite Support Vector Machines in Speech Recognition
    Yang, Jingzhou
    van Dalen, Rogier C.
    Gales, Mark
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3302 - 3306