Quick audio retrieval using active search

被引:0
|
作者
Smith, G [1 ]
Murase, H [1 ]
Kashino, K [1 ]
机构
[1] NTT Corp, Basic Res Labs, Atsugi, Kanagawa 24301, Japan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper discusses a method to search quickly through broadcast audio data to detect and locate known sounds using reference templates, based on the active search algorithm and histogram modeling of zero-crossing features. Active search reduces the number of candidate matches between reference and test template by up to 36 times compared to exhaustive search, while still remaining optimal. Computation is further reduced by using computationally inexpensive zero-crossing features. The method is robust against white noise addition down to 20dB signal-to-noise ratios and digitization noise.
引用
收藏
页码:3777 / 3780
页数:4
相关论文
共 50 条
  • [1] Time-series active search for quick retrieval of audio and video
    Kashino, K
    Smith, G
    Murase, H
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 2993 - 2996
  • [2] Quick audio retrieval using multiple feature vectors
    Kim, KM
    Kim, SY
    Jeon, JK
    Park, KS
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2006, 52 (01) : 200 - 205
  • [3] Novel quick audio retrieval method based on similarity
    Wu J.
    Ren G.
    Li P.
    Information Technology Journal, 2010, 9 (01) : 164 - 168
  • [4] Quick audio retrieval based on histogram feature sequences
    Kashino, Kunio
    Smith, Gavin
    Murase, Hiroshi
    Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 2000, 21 (04): : 217 - 219
  • [5] Very quick audio searching: Introducing global pruning to the Time-Series Active Search
    Kimura, A
    Kashino, K
    Kurozumi, T
    Murase, H
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 1429 - 1432
  • [6] A quick specific audio retrieval algorithm based on general prediction
    Yao, Jincao
    Wan, Wanggen
    Yu, Xiaoqing
    Chang, Liaoyu
    Li, Changlian
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1180 - 1184
  • [7] Feature fluctuation absorption for a quick audio retrieval from long recordings
    Kashino, K
    Kurozumi, T
    Murase, H
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 98 - 101
  • [8] CROSS MODAL AUDIO SEARCH AND RETRIEVAL WITH JOINT EMBEDDINGS BASED ON TEXT AND AUDIO
    Elizalde, Benjamin
    Zarar, Shuayb
    Raj, Bhiksha
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 4095 - 4099
  • [9] Content-based classification, search, and retrieval of audio
    Wold, E
    Blum, T
    Keislar, D
    Wheaton, J
    IEEE MULTIMEDIA, 1996, 3 (03) : 27 - 36
  • [10] A method for direct audio search with applications to indexing and retrieval
    Johnson, SE
    Woodland, PC
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1427 - 1430