An End-to-End Two-Branch Network Towards Robust Video Fingerprinting

被引：0

作者：

Xu Y. ^{[1
]}

Zhou Y. ^{[1
]}

Li X. ^{[1
]}

Zhao G. ^{[1
]}

Qin C. ^{[1
]}

机构：

[1] University of Shanghai for Science and Technology, School of Optical-Electrical and Computer Engineering, Shanghai

来源：

IEEE Transactions on Artificial Intelligence | 2024年 / 5卷 / 05期

关键词：

Content authentication; depthwise (DW) separable convolution; dilated convolution; robust video fingerprinting; two-branch network;

D O I：

10.1109/TAI.2023.3318888

中图分类号：

学科分类号：

摘要：

With the increasing number of edited videos, many robust video fingerprinting schemes have been proposed to solve the problem of video content authentication. However, most of them either deal with the temporal and spatial features symmetrically or insufficiently consider the temporal information. In this work, an end-to-end two-branch network toward robust video fingerprinting (RVFNet) is proposed, where the two branches focus on the temporal and spatial information, respectively. The temporal branch aims to comprehensively capture complex motion patterns by combining subtle motion changes with the overall motion trend. The spatial branch exploits the pixel-level information obtained by multiple receptive fields while preserving significant structural features. Deep metric learning is employed in the training process, and we adopt hard triplet loss to constrain the generation of fingerprints. Furthermore, we construct a large-scale and complex dataset for the robust video fingerprinting task based on multiple video content-preserving manipulations in actual scenarios. The size of our dataset exceeds most datasets adopted in the current robust video fingerprinting schemes. Based on the proposed dataset, experimental results demonstrate that our scheme achieves outstanding performance improvements compared with the state of the art. © 2020 IEEE.

引用

页码：2371 / 2384

页数：13

共 50 条

[41] MPNET: An End-to-End Deep Neural Network for Object Detection in Surveillance Video
Wang, Hanyu
Wang, Ping
Qian, Xueming
IEEE ACCESS, 2018, 6 : 30296 - 30308
[42] Models and Analysis of Video Streaming End-to-End Distortion over LTE Network
Fu, Huayong
Yuan, Hui
Li, Mengyu
Sun, Zhenzhen
Li, Fengrong
PROCEEDINGS OF THE 2016 IEEE 11TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2016, : 516 - 521
[43] End-to-End Video Saliency Detection via a Deep Contextual Spatiotemporal Network
Wei, Lina
Zhao, Shanshan
Bourahla, Omar Farouk
Li, Xi
Wu, Fei
Zhuang, Yueting
Han, Junwei
Xu, Mingliang
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (04) : 1691 - 1702
[44] End-to-end video subtitle recognition via a deep Residual Neural Network
Yan, Hongyu
Xu, Xin
PATTERN RECOGNITION LETTERS, 2020, 131 : 368 - 375
[45] A Robust and Accurate End-to-End Template Matching Method Based on the Siamese Network
Ren, Qiang
Zheng, Yongbin
Sun, Peng
Xu, Wanying
Zhu, Di
Yang, Dongxu
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[46] Invited paper: Towards robust end-to-end neural network-based transceivers for short reach fiber links
Karanov, Boris
Optical Fiber Technology, 2025, 90
[47] Towards the Design of an End-to-End Automated System for Image an Video-based Recognition
Chellappa, Rama
Chen, Jun-Cheng
Ranjan, Rajeev
Sankaranarayanan, Swami
Kumar, Amit
Patel, Vishal M.
Castillo, Carlos D.
2016 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2016,
[48] End-to-End Transport for Video QoE Fairness
Nathan, Vikram
Sivaraman, Vibhaalakshmi
Addanki, Ravichandra
Khani, Mehrdad
Goyal, Prateesh
Alizadeh, Mohammad
SIGCOMM '19 - PROCEEDINGS OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION, 2019, : 408 - 423
[49] End-to-End Video Instance Segmentation with Transformers
Wang, Yuqing
Xu, Zhaoliang
Wang, Xinlong
Shen, Chunhua
Cheng, Baoshan
Shen, Hao
Xia, Huaxia
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8737 - 8746
[50] End-to-end stereoscopic video streaming system
Pehlivan, Selen
Aksay, Anil
Bilen, Cagdas
Akar, Gozde Bozdagi
Civanlar, M. Reha
2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 2169 - 2172

← 1 2 3 4 5 →