New word detection in audio-indexing

被引:0
|
作者
Dharanipragada, S [1 ]
Roukos, S [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USA
关键词
D O I
10.1109/ASRU.1997.659135
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For an Audio-Indexing system that uses a speech recognizer with a fixed vocabulary to be practical one needs the ability to detect out of vocabulary or new words at query time. In this paper we present a fast, vocabulary independent, algorithm for spotting words in speech. The algorithm consists of a preprocessing stage and a coarse-to-detailed search strategy for spotting a word/phone sequence in speech. The preprocessing method provides a phone-level representation of the speech that can be searched efficiently. The coarse search, consisting of phone-ngram matching, identifies regions of speech as putative word hits. The detailed acoustic match is then conducted only at the putative hits identified in the coarse match. This gives us the desired speed in wordspotting.
引用
收藏
页码:551 / 557
页数:7
相关论文
共 50 条
  • [1] Robust audio indexing for Dutch spoken word collections
    Ordelman, R
    de Jong, F
    Huijbregts, M
    van Leeuwen, D
    HUMANITIES, COMPUTERS AND CULTURAL HERITAGE, 2005, : 215 - 223
  • [2] Overlapping statistical word indexing: A new indexing method for Japanese text
    Ogawa, Y
    Matsuda, T
    PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1997, : 226 - 234
  • [3] Word spotting: A new approach to indexing handwriting
    Manmatha, R
    Han, CF
    Riseman, EM
    1996 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1996, : 631 - 637
  • [4] AUDIO INDEXING FOR EFFICIENCY
    RAHMLOW, HF
    PEDRICK, L
    EDUCATIONAL TECHNOLOGY, 1978, 18 (01) : 52 - 54
  • [5] Audio Indexing for YouTube
    Al Laham, Mohamad Nour
    Ayass, Imad
    Ghareeb, Majd
    El-Bazzal, Zouhair
    Raad, Mohamad
    2015 FIFTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION AND COMMUNICATION TECHNOLOGY AND ITS APPLICATIONS (DICTAP), 2015, : 111 - 114
  • [6] Indexing and Retrieval of Audio: A Survey
    Goujun Lu
    Multimedia Tools and Applications, 2001, 15 : 269 - 290
  • [7] State of the art in audio indexing
    Carré, M
    Philippe, P
    ANNALS OF TELECOMMUNICATIONS, 2000, 55 (9-10) : 507 - 525
  • [8] Audio characterization for video indexing
    Patel, NV
    Sethi, IK
    STORAGE AND RETRIEVAL FOR STILL IMAGE AND VIDEO DATABASES IV, 1996, 2670 : 373 - 384
  • [9] Indexing and retrieval of audio: A survey
    Lu, GJ
    MULTIMEDIA TOOLS AND APPLICATIONS, 2001, 15 (03) : 269 - 290
  • [10] State of the art in audio indexing
    Carre, Matthieu
    Philippe, Pierrick
    Annales des Telecommunications/Annals of Telecommunications, 2000, 55 (9-10): : 507 - 525