A method for direct audio search with applications to indexing and retrieval

被引:0
|
作者
Johnson, SE [1 ]
Woodland, PC [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A technique for searching audio data to find an exact match for a given piece of cue-audio is described. The method uses a cepstral parameterisation of the audio and a covariance-based distance metric to quickly locate direct repeats. Results on data from ABC news broadcasts show that the method can successfully locate matches several hundred times faster than real-time and requires less than a second of cue-audio. By applying the match recursively to the data, repeated sections of audio, which nearly always correspond to non-news items such as commercials and theme-music, can be identified. Experiments show that the application of the technique can also lead to improved information retrieval using automatically transcribed broadcast data.
引用
收藏
页码:1427 / 1430
页数:4
相关论文
共 50 条
  • [21] An Indexing Method of Mathematical Expression Retrieval
    Tian, Xuedong
    Yang, Songqiang
    Li, Xinfu
    Yang, Fang
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 574 - 578
  • [22] Indexing of plasma waveforms for accelerating search and retrieval of their subsequences
    Hochin, Teruhisa
    Yamauchi, Yoshihiro
    Nakanishi, Hideya
    Kojima, Mamoru
    Nomiya, Hiroki
    FUSION ENGINEERING AND DESIGN, 2010, 85 (05) : 649 - 654
  • [23] An indexing, browsing, search and retrieval system for audiovisual libraries
    Hunter, J
    Newmarch, J
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, PROCEEDINGS, 1999, 1696 : 76 - 91
  • [24] Combining audio and video for video sequence indexing applications
    Albiol, A
    Torres, L
    Delp, EJ
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : A353 - A356
  • [25] Multi-scale audio indexing for translingual spoken document retrieval
    Wang, HM
    Meng, H
    Schone, P
    Chen, B
    Lo, WK
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 605 - 608
  • [26] Robust audio indexing and keyword retrieval optimized for the rescue operation domain
    Schneider, Daniel
    Winkler, Thomas
    Loeffler, Jobst
    Schon, Jochen
    MOBILE RESPONSE, 2007, 4458 : 135 - 142
  • [27] A vector-based approach to broadcast audio database indexing and retrieval
    Wang, Lei
    Li, Haizhou
    Chng, Eng Siong
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 512 - 515
  • [28] Content-based indexing and retrieval of audio data using wavelets
    Li, GH
    Khokhar, AA
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 885 - 888
  • [29] CROSS MODAL AUDIO SEARCH AND RETRIEVAL WITH JOINT EMBEDDINGS BASED ON TEXT AND AUDIO
    Elizalde, Benjamin
    Zarar, Shuayb
    Raj, Bhiksha
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 4095 - 4099
  • [30] Content-based classification, search, and retrieval of audio
    Wold, E
    Blum, T
    Keislar, D
    Wheaton, J
    IEEE MULTIMEDIA, 1996, 3 (03) : 27 - 36