Efficient Advertisement Discovery for Audio Podcast Content Using Candidate Segmentation

被引:0
|
作者
MN Nguyen
Qi Tian
Ping Xue
机构
[1] Nanyang Technological University,School of Electrical and Electronic Engineering
[2] Institute for Infocomm Research,undefined
关键词
Search Area; False Acceptance Rate; False Rejection Rate; Search Speed; Candidate Segment;
D O I
暂无
中图分类号
学科分类号
摘要
Nowadays, audio podcasting has been widely used by many online sites such as newspapers, web portals, journals, and so forth, to deliver audio content to users through download or subscription. Within 1 to 30 minutes long of one podcast story, it is often that multiple audio advertisements (ads) are inserted into and repeated, with each of a length of 5 to 30 seconds, at different locations. Automatic detection of these attached ads is a challenging task due to the complexity of the search algorithms. Based on the knowledge of typical structures of podcast contents, this paper proposes a novel efficient advertisement discovery approach for large audio podcasting collections. The proposed approach offers a significant improvement on search speed with sufficient accuracy. The key to the acceleration comes from the advantages of candidate segmentation and sampling technique introduced to reduce both search areas and number of matching frames. The approach has been tested over a variety of podcast contents collected from MIT Technology Review, Scientific American, and Singapore Podcast websites. Experimental results show that the proposed algorithm archives detection rate of 97.5% with a significant computation saving as compared to existing state-of-the-art methods.
引用
收藏
相关论文
共 50 条
  • [41] Video scene segmentation using video and audio features
    Sundaram, H
    Chang, SF
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1145 - 1148
  • [42] Violence content classification using audio features
    Giannakopoulos, Theodoros
    Kosmopoulos, Dimitrios
    Aristidou, Andreas
    Theodoridis, Sergios
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 3955 : 502 - 507
  • [43] Audio content identification by using perceptual hashing
    Lancini, R
    Mapelli, F
    Pezzano, R
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 739 - 742
  • [44] Text-Like Segmentation of General Audio for Content-Based Retrieval
    Lu, Lie
    Hanjalic, Alan
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2009, 11 (04) : 658 - 669
  • [45] Unsupervised speaker segmentation and tracking in real-time audio content analysis
    Lu, L
    Zhang, HJ
    [J]. MULTIMEDIA SYSTEMS, 2005, 10 (04) : 332 - 343
  • [46] ON THE USE OF THE TEMPOGRAM TO DESCRIBE AUDIO CONTENT AND ITS APPLICATION TO MUSIC STRUCTURAL SEGMENTATION
    Tian, Mi
    Fazekas, Gyorgy
    Black, Dawn A. A.
    Sandler, Mark
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 419 - 423
  • [47] Unsupervised speaker segmentation and tracking in real-time audio content analysis
    Lie Lu
    Hong-Jiang Zhang
    [J]. Multimedia Systems, 2005, 10 : 332 - 343
  • [48] Integrative analysis for efficient candidate oncogene discovery in cancer cell lines
    Yee, Andrew J.
    Wittner, Ben S.
    Gentry, Jeff
    Mahoney, Crystal
    McCutcheon, Kaitlin
    Singh, Anurag
    Greninger, Patricia
    Ma, Xiao-Jun
    Erlander, Mark G.
    Sharma, Sreenath V.
    Haber, Daniel A.
    Shioda, Toshi
    Settleman, Jeffrey
    Ramaswamy, Sridhar
    [J]. CANCER RESEARCH, 2010, 70
  • [49] An efficient audio watermarking by using spectrum warping
    Choi, KP
    Lee, KY
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2002, E85A (06): : 1257 - 1264
  • [50] Efficient audio equalization using multirate processing
    Vaananen, Riitta
    Hiipakka, Jarmo
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2008, 56 (04): : 255 - 266