A fast vocabulary independent algorithm for spotting words in speech

被引：0

作者：

Dharanipragada, S ^{[1
]}

Roukos, S ^{[1
]}

机构：

[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USA

来源：

PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6 | 1998年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In applications such as audio-indexing, spoken message retrieval and video-browsing, it is necessary to have the ability to detect spoken words that are outside the vocabulary of the speech recognizer used in these systems, in large amounts of speech at speeds many times faster than real-time. In this paper we present a fast, vocabulary independent, algorithm for spotting words in speech. The algorithm consists of a preprocessing stage and a coarse-to-detailed search strategy for spotting a word/phone sequence in speech. The preprocessing method provides a phone-level representation of the speech that can be searched efficiently. The coarse search, consisting of phone-ngram matching, identifies regions of speech as putative word hits. The detailed acoustic match is then conducted only at the putative hits identified in the coarse match. This gives us the desired accuracy and speed in wordspotting. Overall, the algorithm has a speed of execution that is 2400 times faster than real-time.

引用

页码：233 / 236

页数：4

共 50 条

[41] Fast speech adversarial example generation for keyword spotting system with conditional GAN
Wang, Donghua
Dong, Li
Wang, Rangding
Yan, Diqun
COMPUTER COMMUNICATIONS, 2021, 179 (179) : 145 - 156
[42] An Algorithm for Automatic Words Extraction From a Stream of Phones in Dictionary-Based Large Vocabulary Continuous Speech Recognition Systems
Biagetti, Giorgio
Crippa, Paolo
Falaschetti, Laura
Orcioni, Simone
Turchetti, Claudio
2015 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2015, : 18 - 23
[43] Modelling Semantic Context of OOV Words in Large Vocabulary Continuous Speech Recognition
Sheikh, Imran
Fohr, Dominique
Illina, Irina
Linares, Georges
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (03) : 598 - 610
[44] Domain Corpus Independent Vocabulary Generation for Embedded Continuous Speech Recognition
Lim, Minkyu
Kim, Kwang-Ho
Kim, Ji-Hwan
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2009, 55 (03) : 1631 - 1636
[45] HarkMan──A Vocabulary-Independent Keyword Spotter for Spontaneous Chinese Speech
郑方
徐明星
牟晓隆
武健
吴文虎
方棣棠
Journal of Computer Science and Technology, 1999, (01) : 18 - 26
[46] ON LARGE-VOCABULARY SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION
LEE, KF
SPEECH COMMUNICATION, 1988, 7 (04) : 375 - 379
[47] Multilingual phone models for vocabulary-independent speech recognition tasks
Köhler, J
SPEECH COMMUNICATION, 2001, 35 (1-2) : 21 - 30
[48] Reliable unseen model prediction for vocabulary-independent speech recognition
Kim, S
Kim, H
AI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3339 : 599 - 609
[49] An adaptive and fast speech detection algorithm
Burileanu, D
Pascalin, L
Burileanu, C
Puchiu, M
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 177 - 182
[50] HarkMan—A vocabulary-independent keyword spotter for spontaneous Chinese speech
Fang Zheng
Mingxing Xu
Xiaolong Mou
Jian Wu
Wenhu Wu
Ditang Fang
Journal of Computer Science and Technology, 1999, 14 (1) : 18 - 26

← 1 2 3 4 5 →