Structured Support Vector Machines for Noise Robust Continuous Speech Recognition

被引:0
|
作者
Zhang, Shi-Xiong [1 ]
Gales, M. J. F. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
speech recognition; structural SVMs; optimal alignment; large margin; log linear model;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of discriminative models is an interesting alternative to generative models for speech recognition. This paper examines one form of these models, structured support vector machines (SVMs), for noise robust speech recognition. One important aspect of structured SVMs is the form of the joint feature space. In this work features based on generative models are used, which allows model-based compensation schemes to be applied to yield robust joint features. However, these features require the segmentation of frames into words, or sub-words, to be specified. In previous work this segmentation was obtained using generative models. Here the segmentations are refined using the parameters of the structured SVM. A Viterbi-like scheme for obtaining "optimal" segmentations, and modifications to the training algorithm to allow them to be efficiently used, are described. The performance of the approach is evaluated on a noise corrupted continuous digit task: AURORA 2.
引用
收藏
页码:996 / 999
页数:4
相关论文
共 50 条
  • [41] FACIAL EXPRESSION RECOGNITION WITH ROBUST COVARIANCE ESTIMATION AND SUPPORT VECTOR MACHINES
    Vretos, N.
    Tefas, A.
    Pitas, I.
    [J]. 2012 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2012,
  • [42] ON NOISE ESTIMATION FOR ROBUST SPEECH RECOGNITION USING VECTOR TAYLOR SERIES
    Zhao, Yong
    Juang, Biing-Hwang
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4290 - 4293
  • [43] A recursive feature vector normalization approach for robust speech recognition in noise
    Viikki, O
    Bye, D
    Laurila, K
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 733 - 736
  • [44] Psychoacoustic Model Compensation for Robust Continuous Speech Recognition in Additive Noise
    Das, Biswajit
    Panda, Ashish
    [J]. 2015 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2015, : 511 - 515
  • [45] Support vector machines for Segmental Minimum Bayes Risk decoding of continuous speech
    Venkataramani, V
    Chakrabartty, S
    Byrne, W
    [J]. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 13 - 18
  • [46] Support vector machines employing cross-correlation for emotional speech recognition
    Chandaka, Suryannarayana
    Chatterjee, Amitava
    Munshi, Sugata
    [J]. MEASUREMENT, 2009, 42 (04) : 611 - 618
  • [47] Speech Recognition using Wavelet Packets, Neural Networks and Support Vector Machines
    Kulkarni, Purva
    Kulkarni, Saili
    Mulange, Sucheta
    Dand, Aneri
    Cheeran, Alice N.
    [J]. 2014 INTERNATIONAL CONFERENCE ON SIGNAL PROPAGATION AND COMPUTER TECHNOLOGY (ICSPCT 2014), 2014, : 451 - 455
  • [48] Speech Emotion Recognition Based on Fuzzy Least Squares Support Vector Machines
    Zhang, Shiqing
    [J]. 2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 1299 - 1302
  • [49] Implicit State-Tying for Support Vector Machines Based Speech Recognition
    Bolanos, Daniel
    Ward, Wayne
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 924 - 927
  • [50] RECOGNITION OF CONTINUOUS COMPLEX SPEECH BY MACHINES
    LEVINSON, SE
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (01): : 422 - 423