Vocabulary independent discriminative utterance verification for nonkeyword rejection in subword based speech recognition

被引：87

作者：

Sukkar, RA

Lee, CH

机构：

[1] Lucent Technologies, Bell Laboratories, Naperville, IL 60566

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1996年 / 4卷 / 06期

关键词：

D O I：

10.1109/89.544527

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

An integral part of any deployable speech recognition system is the capability. to detect if the input speech does not contain any of the words in the recognizer vocabulary set. This capability, which is called utterance verification (or keyword recognition and nonkeyword rejection), is therefore becoming increasingly important as speech recognition systems continue to migrate from the laboratory to actual applications, In this paper we present a framework and a method for vocabulary independent utterance verification in subword-based speech recognition, The verification process is cast as a statistical hypothesis test, where vocabulary independence is accomplished through a two-stage verification process: subword-level verification followed by string-level verification, A verification function is defined and discriminatively trained to perform subword-level verification, String-level verification is accomplished by defining and evaluating an overall string-level log likelihood ratio that is a function of the subword-level verification scores, Experimental results show that this vocabulary-independent discriminative utterance verification method significantly outperforms a baseline method commonly. used in wordspotting tasks.

引用

页码：420 / 429

页数：10

共 50 条

[41] Improving Discriminative Training for Robust Acoustic Models in Large Vocabulary Continuous Speech Recognition
Pylkkonen, Janne
Kurimo, Mikko
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1210 - 1213
[42] Discriminative training for large-vocabulary speech recognition using minimum classification error
McDermott, Erik
Hazen, Timothy J.
Le Roux, Jonathan
Nakamura, Atsushi
Katagiri, Shigeru
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 203 - 223
[43] Domain Corpus Independent Vocabulary Generation for Embedded Continuous Speech Recognition
Lim, Minkyu
Kim, Kwang-Ho
Kim, Ji-Hwan
[J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2009, 55 (03) : 1631 - 1636
[44] Multilingual phone models for vocabulary-independent speech recognition tasks
Köhler, J
[J]. SPEECH COMMUNICATION, 2001, 35 (1-2) : 21 - 30
[45] ON LARGE-VOCABULARY SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION
LEE, KF
[J]. SPEECH COMMUNICATION, 1988, 7 (04) : 375 - 379
[46] Reliable unseen model prediction for vocabulary-independent speech recognition
Kim, S
Kim, H
[J]. AI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3339 : 599 - 609
[47] Chain-based Discriminative Autoencoders for Speech Recognition
Lee, Hung-Shin
Huang, Pin-Tuan
Cheng, Yao-Fei
Wang, Hsin-Min
[J]. INTERSPEECH 2022, 2022, : 2078 - 2082
[48] A discriminative training framework using N-best speech recognition transcriptions and scores for spoken utterance classification
Yaman, Sibel
Deng, Li
Yu, Dong
Wang, Ye-Yi
Acero, Alex
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 5 - +
[49] Lattice segmentation and minimum Bayes risk discriminative training for large vocabulary continuous speech recognition
Doumpiotis, V
Byrne, W
[J]. SPEECH COMMUNICATION, 2006, 48 (02) : 142 - 160
[50] Discriminative training for large vocabulary telephone-based name recognition
McDermott, E
Biem, A
Tenpaku, S
Katagiri, S
[J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 3739 - 3742

← 1 2 3 4 5 →