An End-to-End Two-Branch Network Towards Robust Video Fingerprinting

被引:0
|
作者
Xu Y. [1 ]
Zhou Y. [1 ]
Li X. [1 ]
Zhao G. [1 ]
Qin C. [1 ]
机构
[1] University of Shanghai for Science and Technology, School of Optical-Electrical and Computer Engineering, Shanghai
来源
关键词
Content authentication; depthwise (DW) separable convolution; dilated convolution; robust video fingerprinting; two-branch network;
D O I
10.1109/TAI.2023.3318888
中图分类号
学科分类号
摘要
With the increasing number of edited videos, many robust video fingerprinting schemes have been proposed to solve the problem of video content authentication. However, most of them either deal with the temporal and spatial features symmetrically or insufficiently consider the temporal information. In this work, an end-to-end two-branch network toward robust video fingerprinting (RVFNet) is proposed, where the two branches focus on the temporal and spatial information, respectively. The temporal branch aims to comprehensively capture complex motion patterns by combining subtle motion changes with the overall motion trend. The spatial branch exploits the pixel-level information obtained by multiple receptive fields while preserving significant structural features. Deep metric learning is employed in the training process, and we adopt hard triplet loss to constrain the generation of fingerprints. Furthermore, we construct a large-scale and complex dataset for the robust video fingerprinting task based on multiple video content-preserving manipulations in actual scenarios. The size of our dataset exceeds most datasets adopted in the current robust video fingerprinting schemes. Based on the proposed dataset, experimental results demonstrate that our scheme achieves outstanding performance improvements compared with the state of the art. © 2020 IEEE.
引用
收藏
页码:2371 / 2384
页数:13
相关论文
共 50 条
  • [41] MPNET: An End-to-End Deep Neural Network for Object Detection in Surveillance Video
    Wang, Hanyu
    Wang, Ping
    Qian, Xueming
    IEEE ACCESS, 2018, 6 : 30296 - 30308
  • [42] Models and Analysis of Video Streaming End-to-End Distortion over LTE Network
    Fu, Huayong
    Yuan, Hui
    Li, Mengyu
    Sun, Zhenzhen
    Li, Fengrong
    PROCEEDINGS OF THE 2016 IEEE 11TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2016, : 516 - 521
  • [43] End-to-End Video Saliency Detection via a Deep Contextual Spatiotemporal Network
    Wei, Lina
    Zhao, Shanshan
    Bourahla, Omar Farouk
    Li, Xi
    Wu, Fei
    Zhuang, Yueting
    Han, Junwei
    Xu, Mingliang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (04) : 1691 - 1702
  • [44] End-to-end video subtitle recognition via a deep Residual Neural Network
    Yan, Hongyu
    Xu, Xin
    PATTERN RECOGNITION LETTERS, 2020, 131 : 368 - 375
  • [45] A Robust and Accurate End-to-End Template Matching Method Based on the Siamese Network
    Ren, Qiang
    Zheng, Yongbin
    Sun, Peng
    Xu, Wanying
    Zhu, Di
    Yang, Dongxu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [46] Invited paper: Towards robust end-to-end neural network-based transceivers for short reach fiber links
    Karanov, Boris
    Optical Fiber Technology, 2025, 90
  • [47] Towards the Design of an End-to-End Automated System for Image an Video-based Recognition
    Chellappa, Rama
    Chen, Jun-Cheng
    Ranjan, Rajeev
    Sankaranarayanan, Swami
    Kumar, Amit
    Patel, Vishal M.
    Castillo, Carlos D.
    2016 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2016,
  • [48] End-to-End Transport for Video QoE Fairness
    Nathan, Vikram
    Sivaraman, Vibhaalakshmi
    Addanki, Ravichandra
    Khani, Mehrdad
    Goyal, Prateesh
    Alizadeh, Mohammad
    SIGCOMM '19 - PROCEEDINGS OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION, 2019, : 408 - 423
  • [49] End-to-End Video Instance Segmentation with Transformers
    Wang, Yuqing
    Xu, Zhaoliang
    Wang, Xinlong
    Shen, Chunhua
    Cheng, Baoshan
    Shen, Hao
    Xia, Huaxia
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8737 - 8746
  • [50] End-to-end stereoscopic video streaming system
    Pehlivan, Selen
    Aksay, Anil
    Bilen, Cagdas
    Akar, Gozde Bozdagi
    Civanlar, M. Reha
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 2169 - 2172