New word detection in audio-indexing

被引:0
|
作者
Dharanipragada, S [1 ]
Roukos, S [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USA
关键词
D O I
10.1109/ASRU.1997.659135
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For an Audio-Indexing system that uses a speech recognizer with a fixed vocabulary to be practical one needs the ability to detect out of vocabulary or new words at query time. In this paper we present a fast, vocabulary independent, algorithm for spotting words in speech. The algorithm consists of a preprocessing stage and a coarse-to-detailed search strategy for spotting a word/phone sequence in speech. The preprocessing method provides a phone-level representation of the speech that can be searched efficiently. The coarse search, consisting of phone-ngram matching, identifies regions of speech as putative word hits. The detailed acoustic match is then conducted only at the putative hits identified in the coarse match. This gives us the desired speed in wordspotting.
引用
收藏
页码:551 / 557
页数:7
相关论文
共 50 条
  • [21] A new indexing method based on word proximity for Chinese text retrieval
    Du, L
    Sun, YF
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2000, 15 (03) : 280 - 286
  • [22] Audio indexing:: primary components retrieval -: Robust classification in audio documents
    Pinquier, Julien
    Andre-Obrecht, Regine
    MULTIMEDIA TOOLS AND APPLICATIONS, 2006, 30 (03) : 313 - 330
  • [23] An audio-scene cut detection method using fuzzy c-means algorithm for audio-visual indexing
    Nitanda, N
    Haseyama, M
    Kitajima, H
    2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 2, PROCEEDINGS, 2004, : 89 - 92
  • [24] Wavelet-based indexing of audio data in audio/multimedia databases
    Subramanya, SR
    Youssef, A
    INTERNATIONAL WORKSHOP ON MULTI-MEDIA DATABASE MANAGEMENT SYSTEMS- PROCEEDINGS, 1998, : 46 - 53
  • [25] Indexing audio-visual sequences by joint audio and video processing
    Saraceno, C
    Leonardi, R
    VSMM98: FUTUREFUSION - APPLICATION REALITIES FOR THE VIRTUAL AGE, VOLS 1 AND 2, 1998, : 686 - 691
  • [26] CueVideo: Automated video/audio indexing and browsing
    Amir, A
    Srinivasan, S
    Ponceleon, D
    Petkovic, D
    SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, : 326 - 326
  • [27] A GENERIC CLASSIFICATION SYSTEM FOR MULTI-CHANNEL AUDIO INDEXING: APPLICATION TO SPEECH AND MUSIC DETECTION
    Benaroya, Elie-Laurent
    Peeters, Geoffroy
    2013 14TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES (WIAMIS), 2013,
  • [28] Robustness evaluation of the basic descriptors for audio indexing
    Essafi, Hassane
    Sayah, Salima
    Ouddan, Mohamed Amine
    12TH INTERNATIONAL MULTI-MEDIA MODELLING CONFERENCE PROCEEDINGS, 2006, : 369 - 376
  • [29] Using audio description for indexing moving images
    Turner, JM
    Colinet, EL
    KNOWLEDGE ORGANIZATION, 2004, 31 (04): : 222 - 230
  • [30] Audio indexing for efficient music information retrieval
    Karydis, I
    Nanopoulos, A
    Papadopoulos, AN
    Manolopoulos, Y
    11TH INTERNATIONAL MULTIMEDIA MODELLING CONFERENCE, PROCEEDINGS, 2005, : 22 - 29