Learning spatial-temporal features for video copy detection by the combination of CNN and RNN

被引:28
|
作者
Hu, Yaocong [1 ,2 ,3 ]
Lu, Xiaobo [1 ,2 ,3 ]
机构
[1] Southeast Univ, Coll Automat, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China
[3] Southeast Univ, Minist Educ, Key Lab Measurement & Control Complex Syst Engn, Nanjing 210096, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Video copyright; CNN; Sequence matching; SiamesLSTM; CLASSIFICATION; WATERMARKING;
D O I
10.1016/j.jvcir.2018.05.013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Following the rapid developments of network multimedia, video copyright protection online has become a hot topic in recent researches. However, video copy detection is still a challenging task in the domain of video analysis and computer vision, due to the large variations in scale and illumination of the copied contents. In this paper, we propose a novel deep learning based approach, in which we jointly use the Convolution Neural Network (CNN) and Recurrent Neural Network (RNN) to solve the specific problem of detecting copied segments in videos. We first utilize a Residual Convolutional Neural Network(ResNet) to extract content features of frame-levels, and then employ a SiameseLSTM architecture for spatial-temporal fusion and sequence matching. Finally, the copied segments are detected by a graph based temporal network. We evaluate the performance of the proposed CNN-RNN based approach on a public large scale video copy dataset called VCDB, and the experiment results demonstrate the effectiveness and high robustness of our method which achieves the significant performance improvements compared to the state of the art.
引用
收藏
页码:21 / 29
页数:9
相关论文
共 50 条
  • [1] Video Copy Detection Using Spatio-Temporal CNN Features
    Zhou, Zhili
    Chen, Jingcheng
    Yang, Ching-Nung
    Sun, Xingming
    IEEE ACCESS, 2019, 7 : 100658 - 100665
  • [2] Multistep hybrid learning: CNN driven by spatial-temporal features for faults detection on metallic surfaces
    Fantinel, Riccardo
    Cenedese, Angelo
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (04)
  • [3] Spatial-Temporal Structural and Dynamics Features for Video Fire Detection
    Wang, Hongcheng
    Finn, Alan
    Erdinc, Ozgur
    Vincitore, Antonio
    2013 IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION (WACV), 2013, : 513 - 519
  • [4] Scene Cut Detection in Video by using Combination of Spatial-Temporal Video Characteristics
    Jokovic, Jugoslav
    Dordevic, Danilo
    TELSIKS 2009, VOLS 1 AND 2, 2009, : 479 - 482
  • [5] Exploring spatial-temporal features fusion model for Deepfake video detection
    Wu, Jiujiu
    Zhou, Jiyu
    Wang, Danyu
    Wang, Lin
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (06)
  • [6] Spatial-Temporal Salient Unit Detection Based on Features in Video Sequences
    Liu, Suolan
    Yang, Wankou
    Wang, Hongyuan
    Sun, Changyin
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 3556 - 3559
  • [7] Deepfake Video Detection Model Based on Consistency of Spatial-Temporal Features
    Zhao L.
    Ge W.
    Mao Y.
    Han M.
    Li W.
    Li X.
    Gongcheng Kexue Yu Jishu/Advanced Engineering Sciences, 2020, 52 (04): : 243 - 250
  • [8] Learning Graph Enhanced Spatial-Temporal Coherence for Video Anomaly Detection
    Cheng, Kai
    Liu, Yang
    Zeng, Xinhua
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 314 - 318
  • [9] Learning Complementary Spatial-Temporal Transformer for Video Salient Object Detection
    Liu, Nian
    Nan, Kepan
    Zhao, Wangbo
    Yao, Xiwen
    Han, Junwei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 10663 - 10673
  • [10] Spatial-temporal features for smoke detections on video images
    Ma, Li
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON MULTIMEDIA TECHNOLOGY (ICMT-13), 2013, 84 : 1284 - 1291