Spatio-temporal co-attention fusion network for video splicing localization

被引:0
|
作者
Lin, Man [1 ,2 ]
Cao, Gang [1 ,2 ]
Lou, Zijie [1 ,2 ]
Zhang, Chi [1 ,2 ]
机构
[1] Commun Univ China, Sch Comp & Cyber Sci, Beijing, Peoples R China
[2] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
video forensics; digital video forgery; video splicing localization; co-attention; MANIPULATION;
D O I
10.1117/1.JEI.33.3.033027
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
. Digital video splicing has become easy and ubiquitous. Malicious users copy some regions of a video and paste them into another video to create realistic forgeries. It is important to blindly detect such forgery regions in videos. A spatio-temporal co-attention fusion network (SCFNet) is proposed for video splicing localization. Specifically, a three-stream network is used as an encoder to capture manipulation traces across multiple frames. The deep interaction and fusion of spatio-temporal forensic features are achieved by the novel parallel and cross co-attention fusion modules. A lightweight multilayer perceptron decoder is adopted to yield a pixel-level tampering localization map. A new large-scale video splicing dataset is created for training the SCFNet. Extensive tests on benchmark datasets show that the localization and generalization performances of our SCFNet outperform the state-of-the-art. Code and datasets are available at https://github.com/multimediaFor/SCFNet.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Video modeling by spatio-temporal resampling and Bayesian fusion
    Zheng, Yunfei
    Li, Xin
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 3201 - 3204
  • [22] Spatio-Temporal Self-Attention Network for Fire Detection and Segmentation in Video Surveillance
    Shahid, Mohammad
    Virtusio, John Jethro
    Wu, Yu-Hsien
    Chen, Yung-Yao
    Tanveer, M.
    Muhammad, Khan
    Hua, Kai-Lung
    IEEE ACCESS, 2022, 10 : 1259 - 1275
  • [23] PASTFNet: a paralleled attention spatio-temporal fusion network for micro-expression recognition
    Tian, Haichen
    Gong, Weijun
    Li, Wei
    Qian, Yurong
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2024, 62 (06) : 1911 - 1924
  • [24] PASTFNet: a paralleled attention spatio-temporal fusion network for micro-expression recognition
    Haichen Tian
    Weijun Gong
    Wei Li
    Yurong Qian
    Medical & Biological Engineering & Computing, 2024, 62 : 1911 - 1924
  • [25] HASTF: a hybrid attention spatio-temporal feature fusion network for EEG emotion recognition
    Hu, Fangzhou
    Wang, Fei
    Bi, Jinying
    An, Zida
    Chen, Chao
    Qu, Gangguo
    Han, Shuai
    FRONTIERS IN NEUROSCIENCE, 2024, 18
  • [26] COMatchNet: Co-Attention Matching Network for Video Object Segmentation
    Huang, Lufei
    Sun, Fengming
    Yuan, Xia
    PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 271 - 284
  • [27] Co-Attention Fusion Network for Multimodal Skin Cancer Diagnosis
    He, Xiaoyu
    Wang, Yong
    Zhao, Shuang
    Chen, Xiang
    PATTERN RECOGNITION, 2023, 133
  • [28] Gated Spatio-Temporal Attention-Guided Video Deblurring
    Suin, Maitreya
    Rajagopalan, A. N.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7798 - 7807
  • [29] Video Captioning via Sentence Augmentation and Spatio-Temporal Attention
    Chen, Tseng-Hung
    Zeng, Kuo-Hao
    Hsu, Wan-Ting
    Sun, Min
    COMPUTER VISION - ACCV 2016 WORKSHOPS, PT I, 2017, 10116 : 269 - 286
  • [30] Automatic video summarization driven by a spatio-temporal attention model
    Barland, R.
    Saadane, A.
    HUMAN VISION AND ELECTRONIC IMAGING XIII, 2008, 6806