Structured Support Vector Machines for Noise Robust Continuous Speech Recognition

被引:0
|
作者
Zhang, Shi-Xiong [1 ]
Gales, M. J. F. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
speech recognition; structural SVMs; optimal alignment; large margin; log linear model;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of discriminative models is an interesting alternative to generative models for speech recognition. This paper examines one form of these models, structured support vector machines (SVMs), for noise robust speech recognition. One important aspect of structured SVMs is the form of the joint feature space. In this work features based on generative models are used, which allows model-based compensation schemes to be applied to yield robust joint features. However, these features require the segmentation of frames into words, or sub-words, to be specified. In previous work this segmentation was obtained using generative models. Here the segmentations are refined using the parameters of the structured SVM. A Viterbi-like scheme for obtaining "optimal" segmentations, and modifications to the training algorithm to allow them to be efficiently used, are described. The performance of the approach is evaluated on a noise corrupted continuous digit task: AURORA 2.
引用
收藏
页码:996 / 999
页数:4
相关论文
共 50 条
  • [31] NOISE ROBUST SPEECH RECOGNITION ON AURORA4 BY HUMANS AND MACHINES
    Qian, Yanmin
    Tan, Tian
    Hu, Hu
    Liu, Qi
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5604 - 5608
  • [32] Speaker Recognition from Coded Speech Using Support Vector Machines
    Janicki, Artur
    Staroszczyk, Tomasz
    [J]. TEXT, SPEECH AND DIALOGUE, TSD 2011, 2011, 6836 : 291 - 298
  • [33] VISUAL SPEECH RECOGNITION USING OPTICAL FLOW AND SUPPORT VECTOR MACHINES
    Shaikh, Ayaz A.
    Kumar, Dinesh K.
    Gubbi, Jayavardhana
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2011, 10 (02) : 167 - 187
  • [34] VISUAL SPEECH RECOGNITION USING DYNAMIC FEATURES AND SUPPORT VECTOR MACHINES
    Yau, Wai Chee
    Kumar, Dinesh Kant
    Arjunan, Sridhar Poosapadi
    [J]. INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2008, 8 (03) : 419 - 437
  • [35] A Support Vector Machines-based rejection technique for speech recognition
    Ma, CX
    Randolph, MA
    Drish, J
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 381 - 384
  • [37] Application of Support Vector Machine with Modified Gaussian Kernel in A Noise-Robust Speech Recognition System
    Bai, Jing
    Zhang, Xue-ying
    Duan, Ji-kang
    [J]. 2008 IEEE INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING WORKSHOP PROCEEDINGS, VOLS 1 AND 2, 2008, : 502 - 505
  • [38] Robust Automatic Speech Recognition System for the Recognition of Continuous Kannada Speech Sentences in the Presence of Noise
    Mahadevaswamy
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2023, 130 (03) : 2039 - 2058
  • [39] Training robust support vector regression machines for more general noise
    Dong, Hongwei
    Yang, Liming
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (03) : 2881 - 2892
  • [40] FACIAL EXPRESSION RECOGNITION WITH ROBUST COVARIANCE ESTIMATION AND SUPPORT VECTOR MACHINES
    Vretos, N.
    Tefas, A.
    Pitas, I.
    [J]. 2012 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2012,