Efficient and Robust Detection of Duplicate Videos in a Large Database

被引:23
|
作者
Sarkar, Anindya [1 ]
Singh, Vishwarkarma [2 ]
Ghosh, Pratim [1 ]
Manjunath, Bangalore S. [1 ]
Singh, Ambuj [2 ]
机构
[1] Univ Calif Santa Barbara, Dept Elect & Comp Engn, Santa Barbara, CA 93106 USA
[2] Univ Calif Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
基金
美国国家科学基金会;
关键词
Color layout descriptor (CLD); duplicate detection; nonmetric distance; vector quantization (VQ); video fingerprinting; HISTOGRAM;
D O I
10.1109/TCSVT.2010.2046056
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present an efficient and accurate method for duplicate video detection in a large database using video fingerprints. We have empirically chosen the color layout descriptor, a compact and robust frame-based descriptor, to create fingerprints which are further encoded by vector quantization (VQ). We propose a new nonmetric distance measure to find the similarity between the query and a database video fingerprint and experimentally show its superior performance over other distance measures for accurate duplicate detection. Efficient search cannot be performed for high-dimensional data using a nonmetric distance measure with existing indexing techniques. Therefore, we develop novel search algorithms based on precomputed distances and new dataset pruning techniques yielding practical retrieval times. We perform experiments with a database of 38 000 videos, worth 1600 h of content. For individual queries with an average duration of 60 s (about 50% of the average database video length), the duplicate video is retrieved in 0.032 s, on Intel Xeon with CPU 2.33 GHz, with a very high accuracy of 97.5%.
引用
收藏
页码:870 / 885
页数:16
相关论文
共 50 条
  • [31] Robust and efficient detection of DDoS attacks for large-scale internet
    Lu, Kejie
    Wu, Dapeng
    Fan, Heyan
    Todorovic, Sinisa
    Nucci, Antonio
    COMPUTER NETWORKS, 2007, 51 (18) : 5036 - 5056
  • [32] Duplicate video detection for large-scale multimedia
    Jun, Woogyoung
    Lee, Yillbyung
    Jun, Byoung-Min
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (23) : 15665 - 15678
  • [33] Efficient and Robust KPI Outlier Detection for Large-Scale Datacenters
    Sun, Yongqian
    Cheng, Daguo
    Yang, Tiankai
    Ji, Yuhe
    Zhang, Shenglin
    Zhu, Man
    Xiong, Xiao
    Fan, Qiliang
    Liang, Minghan
    Pei, Dan
    Ma, Tianchi
    Chen, Yu
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (10) : 2858 - 2871
  • [35] A Duplicate Image Detection Scheme using Hash Functions for Database Retrieval
    Hsieh, Shang-Lin
    Chen, Chuan-Ren
    Chen, Chun-Che
    IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010, : 3487 - 3492
  • [36] A Multilevel and Domain-Independent Duplicate Detection Model for Scientific Database
    Song, Jie
    Bao, Yubin
    Yu, Ge
    WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2010, 6184 : 729 - 741
  • [37] A robust unsupervised epileptic seizure detection methodology to accelerate large EEG database evaluation
    Tsiouris, Kostas M.
    Markoula, Sofia
    Konitsiotis, Spiros
    Koutsouris, Dimitrios. D.
    Fotiadis, Dimitrios I.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2018, 40 : 275 - 285
  • [38] ReMotENet: Efficient Relevant Motion Event Detection for Large-scale Home Surveillance Videos
    Yu, Ruichi
    Wang, Hongcheng
    Davis, Larry S.
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1642 - 1651
  • [39] Robust Foreground and Abandonment Analysis For Large-scale Abandoned Object Detection in Complex Surveillance Videos
    Fan, Quanfu
    Pankanti, Sharath
    2012 IEEE NINTH INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL-BASED SURVEILLANCE (AVSS), 2012, : 58 - 63
  • [40] Multiple Object Detection in 360° Videos for Robust Tracking
    Kumar, V. Vineeth
    Naik, Shanthika
    Sarvani, Polisetty L.
    Pattanshetti, Shreya M.
    Mudenagudi, Uma
    Maralappanavar, Meena
    Patil, Priyadarshini
    Tabib, Ramesh A.
    Vandrotti, Basavaraja S.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT II, 2019, 11942 : 499 - 506