Spatio-temporal co-attention fusion network for video splicing localization

被引：0

作者：

Lin, Man ^{[1
,2
]}

Cao, Gang ^{[1
,2
]}

Lou, Zijie ^{[1
,2
]}

Zhang, Chi ^{[1
,2
]}

机构：

[1] Commun Univ China, Sch Comp & Cyber Sci, Beijing, Peoples R China

[2] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing, Peoples R China

来源：

JOURNAL OF ELECTRONIC IMAGING | 2024年 / 33卷 / 03期

基金：

中国国家自然科学基金;

关键词：

video forensics; digital video forgery; video splicing localization; co-attention; MANIPULATION;

D O I：

10.1117/1.JEI.33.3.033027

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

. Digital video splicing has become easy and ubiquitous. Malicious users copy some regions of a video and paste them into another video to create realistic forgeries. It is important to blindly detect such forgery regions in videos. A spatio-temporal co-attention fusion network (SCFNet) is proposed for video splicing localization. Specifically, a three-stream network is used as an encoder to capture manipulation traces across multiple frames. The deep interaction and fusion of spatio-temporal forensic features are achieved by the novel parallel and cross co-attention fusion modules. A lightweight multilayer perceptron decoder is adopted to yield a pixel-level tampering localization map. A new large-scale video splicing dataset is created for training the SCFNet. Extensive tests on benchmark datasets show that the localization and generalization performances of our SCFNet outperform the state-of-the-art. Code and datasets are available at https://github.com/multimediaFor/SCFNet.

引用

页数：9

共 50 条

[21] Video modeling by spatio-temporal resampling and Bayesian fusion
Zheng, Yunfei
Li, Xin
2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 3201 - 3204
[22] Spatio-Temporal Self-Attention Network for Fire Detection and Segmentation in Video Surveillance
Shahid, Mohammad
Virtusio, John Jethro
Wu, Yu-Hsien
Chen, Yung-Yao
Tanveer, M.
Muhammad, Khan
Hua, Kai-Lung
IEEE ACCESS, 2022, 10 : 1259 - 1275
[23] PASTFNet: a paralleled attention spatio-temporal fusion network for micro-expression recognition
Tian, Haichen
Gong, Weijun
Li, Wei
Qian, Yurong
MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2024, 62 (06) : 1911 - 1924
[24] PASTFNet: a paralleled attention spatio-temporal fusion network for micro-expression recognition
Haichen Tian
Weijun Gong
Wei Li
Yurong Qian
Medical & Biological Engineering & Computing, 2024, 62 : 1911 - 1924
[25] HASTF: a hybrid attention spatio-temporal feature fusion network for EEG emotion recognition
Hu, Fangzhou
Wang, Fei
Bi, Jinying
An, Zida
Chen, Chao
Qu, Gangguo
Han, Shuai
FRONTIERS IN NEUROSCIENCE, 2024, 18
[26] COMatchNet: Co-Attention Matching Network for Video Object Segmentation
Huang, Lufei
Sun, Fengming
Yuan, Xia
PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 271 - 284
[27] Co-Attention Fusion Network for Multimodal Skin Cancer Diagnosis
He, Xiaoyu
Wang, Yong
Zhao, Shuang
Chen, Xiang
PATTERN RECOGNITION, 2023, 133
[28] Gated Spatio-Temporal Attention-Guided Video Deblurring
Suin, Maitreya
Rajagopalan, A. N.
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7798 - 7807
[29] Video Captioning via Sentence Augmentation and Spatio-Temporal Attention
Chen, Tseng-Hung
Zeng, Kuo-Hao
Hsu, Wan-Ting
Sun, Min
COMPUTER VISION - ACCV 2016 WORKSHOPS, PT I, 2017, 10116 : 269 - 286
[30] Automatic video summarization driven by a spatio-temporal attention model
Barland, R.
Saadane, A.
HUMAN VISION AND ELECTRONIC IMAGING XIII, 2008, 6806

← 1 2 3 4 5 →