A fast vocabulary independent algorithm for spotting words in speech

被引:0
|
作者
Dharanipragada, S [1 ]
Roukos, S [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USA
来源
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6 | 1998年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In applications such as audio-indexing, spoken message retrieval and video-browsing, it is necessary to have the ability to detect spoken words that are outside the vocabulary of the speech recognizer used in these systems, in large amounts of speech at speeds many times faster than real-time. In this paper we present a fast, vocabulary independent, algorithm for spotting words in speech. The algorithm consists of a preprocessing stage and a coarse-to-detailed search strategy for spotting a word/phone sequence in speech. The preprocessing method provides a phone-level representation of the speech that can be searched efficiently. The coarse search, consisting of phone-ngram matching, identifies regions of speech as putative word hits. The detailed acoustic match is then conducted only at the putative hits identified in the coarse match. This gives us the desired accuracy and speed in wordspotting. Overall, the algorithm has a speed of execution that is 2400 times faster than real-time.
引用
收藏
页码:233 / 236
页数:4
相关论文
共 50 条
  • [41] Fast speech adversarial example generation for keyword spotting system with conditional GAN
    Wang, Donghua
    Dong, Li
    Wang, Rangding
    Yan, Diqun
    COMPUTER COMMUNICATIONS, 2021, 179 (179) : 145 - 156
  • [42] An Algorithm for Automatic Words Extraction From a Stream of Phones in Dictionary-Based Large Vocabulary Continuous Speech Recognition Systems
    Biagetti, Giorgio
    Crippa, Paolo
    Falaschetti, Laura
    Orcioni, Simone
    Turchetti, Claudio
    2015 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2015, : 18 - 23
  • [43] Modelling Semantic Context of OOV Words in Large Vocabulary Continuous Speech Recognition
    Sheikh, Imran
    Fohr, Dominique
    Illina, Irina
    Linares, Georges
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (03) : 598 - 610
  • [44] Domain Corpus Independent Vocabulary Generation for Embedded Continuous Speech Recognition
    Lim, Minkyu
    Kim, Kwang-Ho
    Kim, Ji-Hwan
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2009, 55 (03) : 1631 - 1636
  • [45] HarkMan──A Vocabulary-Independent Keyword Spotter for Spontaneous Chinese Speech
    郑方
    徐明星
    牟晓隆
    武健
    吴文虎
    方棣棠
    Journal of Computer Science and Technology, 1999, (01) : 18 - 26
  • [47] Multilingual phone models for vocabulary-independent speech recognition tasks
    Köhler, J
    SPEECH COMMUNICATION, 2001, 35 (1-2) : 21 - 30
  • [48] Reliable unseen model prediction for vocabulary-independent speech recognition
    Kim, S
    Kim, H
    AI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3339 : 599 - 609
  • [49] An adaptive and fast speech detection algorithm
    Burileanu, D
    Pascalin, L
    Burileanu, C
    Puchiu, M
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 177 - 182
  • [50] HarkMan—A vocabulary-independent keyword spotter for spontaneous Chinese speech
    Fang Zheng
    Mingxing Xu
    Xiaolong Mou
    Jian Wu
    Wenhu Wu
    Ditang Fang
    Journal of Computer Science and Technology, 1999, 14 (1) : 18 - 26