Vocabulary independent discriminative utterance verification for nonkeyword rejection in subword based speech recognition

被引:87
|
作者
Sukkar, RA
Lee, CH
机构
[1] Lucent Technologies, Bell Laboratories, Naperville, IL 60566
来源
关键词
D O I
10.1109/89.544527
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An integral part of any deployable speech recognition system is the capability. to detect if the input speech does not contain any of the words in the recognizer vocabulary set. This capability, which is called utterance verification (or keyword recognition and nonkeyword rejection), is therefore becoming increasingly important as speech recognition systems continue to migrate from the laboratory to actual applications, In this paper we present a framework and a method for vocabulary independent utterance verification in subword-based speech recognition, The verification process is cast as a statistical hypothesis test, where vocabulary independence is accomplished through a two-stage verification process: subword-level verification followed by string-level verification, A verification function is defined and discriminatively trained to perform subword-level verification, String-level verification is accomplished by defining and evaluating an overall string-level log likelihood ratio that is a function of the subword-level verification scores, Experimental results show that this vocabulary-independent discriminative utterance verification method significantly outperforms a baseline method commonly. used in wordspotting tasks.
引用
收藏
页码:420 / 429
页数:10
相关论文
共 50 条
  • [21] Analysis of HMM Temporal Evolution for Automatic Speech Recognition and Utterance Verification
    Casar, Marta
    Fonollosa, Jose A. R.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 613 - 616
  • [22] Subword unit based speech recognition in car environments
    Fischer, A
    Stahl, V
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 257 - 260
  • [23] Discriminative training of decoding graphs for large vocabulary continuous speech recognition
    Kuo, Hong-Kwang Jeff
    Kingsbury, Brian
    Zweig, Geoffrey
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 45 - +
  • [24] Combining LVCSR and Vocabulary-Independent Ranked Utterance Retrieval for Robust Speech Search
    Olsson, J. Scott
    Oard, Douglas W.
    [J]. PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 91 - 98
  • [25] Improved discriminative training techniques for large vocabulary continuous speech recognition
    Povey, D
    Woodland, PC
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 45 - 48
  • [26] Discriminative training based on the criterion of least phone competing tokens for large vocabulary speech recognition
    Liu, B
    Jiang, H
    Zhou, JL
    Wang, RH
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 117 - 120
  • [27] DSP-based large vocabulary speaker-independent speech recognition
    Hirayama, H
    Yoshida, K
    Koga, S
    Hattori, H
    [J]. NEC RESEARCH & DEVELOPMENT, 1996, 37 (04): : 528 - 534
  • [28] Speaker verification through large vocabulary continuous speech recognition
    Newman, M
    Gillick, L
    Ito, Y
    McAllaster, D
    Peskin, B
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2419 - 2422
  • [29] Improved subword modeling for WFST-based speech recognition
    Smit, Peter
    Virpioja, Sami
    Kurimo, Mikko
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2551 - 2555
  • [30] DISCRIMINATIVE TRAINING OF HIERARCHICAL ACOUSTIC MODELS FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
    Chang, Hung-An
    Glass, James R.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4481 - 4484