A fast vocabulary independent algorithm for spotting words in speech

被引:0
|
作者
Dharanipragada, S [1 ]
Roukos, S [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USA
来源
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6 | 1998年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In applications such as audio-indexing, spoken message retrieval and video-browsing, it is necessary to have the ability to detect spoken words that are outside the vocabulary of the speech recognizer used in these systems, in large amounts of speech at speeds many times faster than real-time. In this paper we present a fast, vocabulary independent, algorithm for spotting words in speech. The algorithm consists of a preprocessing stage and a coarse-to-detailed search strategy for spotting a word/phone sequence in speech. The preprocessing method provides a phone-level representation of the speech that can be searched efficiently. The coarse search, consisting of phone-ngram matching, identifies regions of speech as putative word hits. The detailed acoustic match is then conducted only at the putative hits identified in the coarse match. This gives us the desired accuracy and speed in wordspotting. Overall, the algorithm has a speed of execution that is 2400 times faster than real-time.
引用
收藏
页码:233 / 236
页数:4
相关论文
共 50 条
  • [21] Keyword-dependent monaural speech enhancement for open-vocabulary keyword spotting
    Liu, Zuozhen
    Wu, Chou
    Li, Ta
    Zhao, Qingwei
    Shengxue Xuebao/Acta Acustica, 2023, 48 (02): : 415 - 424
  • [22] Phoneme-to-grapheme conversion for out-of-vocabulary words in large vocabulary speech recognition
    Decadt, B
    Duchateau, J
    Daelemans, W
    Wambacq, P
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 413 - 416
  • [23] A fast hierarchical search algorithm for discriminative keyword spotting
    Tabibian, Shima
    Akbari, Ahmad
    Nasersharif, Babak
    INFORMATION SCIENCES, 2016, 336 : 45 - 59
  • [24] Language Independent and Unsupervised Acoustic Models for Speech Recognition and Keyword Spotting
    Knill, Kate M.
    Gales, Mark J. F.
    Ragni, Anton
    Rath, Shakti P.
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 16 - 20
  • [25] Improved lattice-based speech keyword spotting algorithm
    Department of Electronic Engineer, Tsinghua University, Beijing
    100084, China
    Qinghua Daxue Xuebao, 5 (508-513): : 508 - 513
  • [26] SYNTHESIS OF NEW WORDS FOR IMPROVED DYSARTHRIC SPEECH RECOGNITION ON AN EXPANDED VOCABULARY
    Harvill, John
    Issa, Dias
    Hasegawa-Johnson, Mark
    Yoo, Changdong
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6428 - 6432
  • [27] Detection Of Pronunciation Out Of Vocabulary Words From Speech Recognition System
    Degtyarev, Vladimir M.
    Gusev, Mikhail N.
    EUROCON 2009: INTERNATIONAL IEEE CONFERENCE DEVOTED TO THE 150 ANNIVERSARY OF ALEXANDER S. POPOV, VOLS 1- 4, PROCEEDINGS, 2009, : 1723 - 1728
  • [28] Compound words in large-vocabulary German speech recognition systems
    Berton, A
    Fetter, P
    RegelBrietzmann, P
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1165 - 1168
  • [29] A Russian Keyword Spotting System Based on Large Vocabulary Continuous Speech Recognition and Linguistic Knowledge
    Smirnov, Valentin
    Ignatov, Dmitry
    Gusev, Michael
    Farkhadov, Mais
    Rumyantseva, Natalia
    Farkhadova, Mukhabbat
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2016, 2016
  • [30] Fast Algorithm for Partial Covers in Words
    Kociumaka, Tomasz
    Pissis, Solon P.
    Radoszewski, Jakub
    Rytter, Wojciech
    Walen, Tomasz
    ALGORITHMICA, 2015, 73 (01) : 217 - 233