A fast vocabulary independent algorithm for spotting words in speech

被引：0

作者：

Dharanipragada, S ^{[1
]}

Roukos, S ^{[1
]}

机构：

[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USA

来源：

PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6 | 1998年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In applications such as audio-indexing, spoken message retrieval and video-browsing, it is necessary to have the ability to detect spoken words that are outside the vocabulary of the speech recognizer used in these systems, in large amounts of speech at speeds many times faster than real-time. In this paper we present a fast, vocabulary independent, algorithm for spotting words in speech. The algorithm consists of a preprocessing stage and a coarse-to-detailed search strategy for spotting a word/phone sequence in speech. The preprocessing method provides a phone-level representation of the speech that can be searched efficiently. The coarse search, consisting of phone-ngram matching, identifies regions of speech as putative word hits. The detailed acoustic match is then conducted only at the putative hits identified in the coarse match. This gives us the desired accuracy and speed in wordspotting. Overall, the algorithm has a speed of execution that is 2400 times faster than real-time.

引用

页码：233 / 236

页数：4

共 50 条

[21] Keyword-dependent monaural speech enhancement for open-vocabulary keyword spotting
Liu, Zuozhen
Wu, Chou
Li, Ta
Zhao, Qingwei
Shengxue Xuebao/Acta Acustica, 2023, 48 (02): : 415 - 424
[22] Phoneme-to-grapheme conversion for out-of-vocabulary words in large vocabulary speech recognition
Decadt, B
Duchateau, J
Daelemans, W
Wambacq, P
ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 413 - 416
[23] A fast hierarchical search algorithm for discriminative keyword spotting
Tabibian, Shima
Akbari, Ahmad
Nasersharif, Babak
INFORMATION SCIENCES, 2016, 336 : 45 - 59
[24] Language Independent and Unsupervised Acoustic Models for Speech Recognition and Keyword Spotting
Knill, Kate M.
Gales, Mark J. F.
Ragni, Anton
Rath, Shakti P.
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 16 - 20
[25] Improved lattice-based speech keyword spotting algorithm
Department of Electronic Engineer, Tsinghua University, Beijing
100084, China
Qinghua Daxue Xuebao, 5 (508-513): : 508 - 513
[26] SYNTHESIS OF NEW WORDS FOR IMPROVED DYSARTHRIC SPEECH RECOGNITION ON AN EXPANDED VOCABULARY
Harvill, John
Issa, Dias
Hasegawa-Johnson, Mark
Yoo, Changdong
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6428 - 6432
[27] Detection Of Pronunciation Out Of Vocabulary Words From Speech Recognition System
Degtyarev, Vladimir M.
Gusev, Mikhail N.
EUROCON 2009: INTERNATIONAL IEEE CONFERENCE DEVOTED TO THE 150 ANNIVERSARY OF ALEXANDER S. POPOV, VOLS 1- 4, PROCEEDINGS, 2009, : 1723 - 1728
[28] Compound words in large-vocabulary German speech recognition systems
Berton, A
Fetter, P
RegelBrietzmann, P
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1165 - 1168
[29] A Russian Keyword Spotting System Based on Large Vocabulary Continuous Speech Recognition and Linguistic Knowledge
Smirnov, Valentin
Ignatov, Dmitry
Gusev, Michael
Farkhadov, Mais
Rumyantseva, Natalia
Farkhadova, Mukhabbat
JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2016, 2016
[30] Fast Algorithm for Partial Covers in Words
Kociumaka, Tomasz
Pissis, Solon P.
Radoszewski, Jakub
Rytter, Wojciech
Walen, Tomasz
ALGORITHMICA, 2015, 73 (01) : 217 - 233

← 1 2 3 4 5 →