Task independent minimum confusibility training for continuous speech recognition

被引:0
|
作者
Nogueiras-Rodriguez, A [1 ]
Marino, JB [1 ]
机构
[1] Univ Politecn Catalunya, Barcelona, Spain
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a task independent discriminative training framework for subword units based continuous speech recognition is presented. Instead of aiming at the optimisation of any task independent figure, say the phone classification or recognition rates, we focus our attention to the reduction of the number of errors committed by the system when a task is defined. This consideration leads to the use of a segmental approach based on the minimisation of the confusibility over short chains of subword units. Using this framework, a reduction of 32% in the string error rate may be achieved in the recognition of unknown length digit strings using task independent phone like units.
引用
收藏
页码:477 / 480
页数:4
相关论文
共 50 条
  • [1] Acoustic training system for speaker independent continuous arabic speech recognition system
    Nofal, M
    Abdel-Raheem, E
    El Henawy, H
    Kader, NA
    [J]. Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004, : 200 - 203
  • [2] Towards task-independent speech recognition
    Lefevre, F
    Gauvain, JL
    Lamel, L
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 521 - 524
  • [3] Minimum classification error training of landmark models for real-time continuous speech recognition
    McDermott, E
    Hazen, TJ
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 937 - 940
  • [4] Lattice segmentation and minimum Bayes risk discriminative training for large vocabulary continuous speech recognition
    Doumpiotis, V
    Byrne, W
    [J]. SPEECH COMMUNICATION, 2006, 48 (02) : 142 - 160
  • [5] Genericity and portability for task-independent speech recognition
    Lefevre, F
    Gauvain, JL
    Lamel, L
    [J]. COMPUTER SPEECH AND LANGUAGE, 2005, 19 (03): : 345 - 363
  • [6] Exploration of an Independent Training Framework for Speech Emotion Recognition
    Zhong, Shunming
    Yu, Baoxian
    Zhang, Han
    [J]. IEEE ACCESS, 2020, 8 : 222533 - 222543
  • [7] Minimum Bayes error feature selection for continuous speech recognition
    Saon, G
    Padmanabhan, M
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 800 - 806
  • [8] PROTOTYPE-BASED MINIMUM ERROR TRAINING FOR SPEECH RECOGNITION
    MCDERMOTT, E
    KATAGIRI, S
    [J]. APPLIED INTELLIGENCE, 1994, 4 (03) : 245 - 256
  • [9] BOOKS ON TAPE AS TRAINING DATA FOR CONTINUOUS SPEECH RECOGNITION
    BOULIANNE, G
    KENNY, P
    LENNIG, M
    OSHAUGHNESSY, D
    MERMELSTEIN, P
    [J]. SPEECH COMMUNICATION, 1994, 14 (01) : 61 - 70
  • [10] TASK ADAPTATION IN SYLLABLE TRIGRAM MODELS FOR CONTINUOUS SPEECH RECOGNITION
    MATSUNAGA, S
    YAMADA, T
    SHIKANO, K
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1993, E76D (01) : 38 - 43