Vocabulary independent discriminative utterance verification for nonkeyword rejection in subword based speech recognition

被引:87
|
作者
Sukkar, RA
Lee, CH
机构
[1] Lucent Technologies, Bell Laboratories, Naperville, IL 60566
来源
关键词
D O I
10.1109/89.544527
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An integral part of any deployable speech recognition system is the capability. to detect if the input speech does not contain any of the words in the recognizer vocabulary set. This capability, which is called utterance verification (or keyword recognition and nonkeyword rejection), is therefore becoming increasingly important as speech recognition systems continue to migrate from the laboratory to actual applications, In this paper we present a framework and a method for vocabulary independent utterance verification in subword-based speech recognition, The verification process is cast as a statistical hypothesis test, where vocabulary independence is accomplished through a two-stage verification process: subword-level verification followed by string-level verification, A verification function is defined and discriminatively trained to perform subword-level verification, String-level verification is accomplished by defining and evaluating an overall string-level log likelihood ratio that is a function of the subword-level verification scores, Experimental results show that this vocabulary-independent discriminative utterance verification method significantly outperforms a baseline method commonly. used in wordspotting tasks.
引用
收藏
页码:420 / 429
页数:10
相关论文
共 50 条
  • [41] Improving Discriminative Training for Robust Acoustic Models in Large Vocabulary Continuous Speech Recognition
    Pylkkonen, Janne
    Kurimo, Mikko
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1210 - 1213
  • [42] Discriminative training for large-vocabulary speech recognition using minimum classification error
    McDermott, Erik
    Hazen, Timothy J.
    Le Roux, Jonathan
    Nakamura, Atsushi
    Katagiri, Shigeru
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 203 - 223
  • [43] Domain Corpus Independent Vocabulary Generation for Embedded Continuous Speech Recognition
    Lim, Minkyu
    Kim, Kwang-Ho
    Kim, Ji-Hwan
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2009, 55 (03) : 1631 - 1636
  • [44] Multilingual phone models for vocabulary-independent speech recognition tasks
    Köhler, J
    [J]. SPEECH COMMUNICATION, 2001, 35 (1-2) : 21 - 30
  • [45] ON LARGE-VOCABULARY SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION
    LEE, KF
    [J]. SPEECH COMMUNICATION, 1988, 7 (04) : 375 - 379
  • [46] Reliable unseen model prediction for vocabulary-independent speech recognition
    Kim, S
    Kim, H
    [J]. AI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3339 : 599 - 609
  • [47] Chain-based Discriminative Autoencoders for Speech Recognition
    Lee, Hung-Shin
    Huang, Pin-Tuan
    Cheng, Yao-Fei
    Wang, Hsin-Min
    [J]. INTERSPEECH 2022, 2022, : 2078 - 2082
  • [48] A discriminative training framework using N-best speech recognition transcriptions and scores for spoken utterance classification
    Yaman, Sibel
    Deng, Li
    Yu, Dong
    Wang, Ye-Yi
    Acero, Alex
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 5 - +
  • [49] Lattice segmentation and minimum Bayes risk discriminative training for large vocabulary continuous speech recognition
    Doumpiotis, V
    Byrne, W
    [J]. SPEECH COMMUNICATION, 2006, 48 (02) : 142 - 160
  • [50] Discriminative training for large vocabulary telephone-based name recognition
    McDermott, E
    Biem, A
    Tenpaku, S
    Katagiri, S
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 3739 - 3742