Multi-scale Target-Aware Framework for Constrained Image Splicing Detection and Localization

被引:1
|
作者
Tan, Yuxuan [1 ]
Li, Yuanman [1 ]
Zeng, Limin [1 ]
Ye, Jiaxiong [1 ]
Wang, Wei [2 ]
Li, Xia [1 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Guangdong Key Lab Intelligent Informat Proc, Shenzhen, Peoples R China
[2] Shenzhen MSU BIT Univ, Dept Engn, Shenzhen, Peoples R China
关键词
image forensics; constrained image splicing detection and localization; Transformer; attention; FORGERY;
D O I
10.1145/3581783.3613763
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Constrained image splicing detection and localization (CISDL) is a fundamental task of multimedia forensics, which detects splicing operation between two suspected images and localizes the spliced region on both images. Recent works regard it as a deep matching problem and have made significant progress. However, existing frameworks typically perform feature extraction and correlation matching as separate processes, which may hinder the model's ability to learn discriminative features for matching and can be susceptible to interference from ambiguous background pixels. In this work, we propose a multi-scale target-aware framework to couple feature extraction and correlation matching in a unified pipeline. In contrast to previous methods, we design a target-aware attention mechanism that jointly learns features and performs correlation matching between the probe and donor images. Our approach can effectively promote the collaborative learning of related patches, and perform mutual promotion of feature learning and correlation matching. Additionally, in order to handle scale transformations, we introduce a multi-scale projection method, which can be readily integrated into our target-aware framework that enables the attention process to be conducted between tokens containing information of varying scales. Our experiments demonstrate that our model, which uses a unified pipeline, outperforms state-of-the-art methods on several benchmark datasets and is robust against scale transformations.
引用
下载
收藏
页码:8790 / 8798
页数:9
相关论文
共 50 条
  • [31] Unsupervised segmentation of noisy image in a multi-scale framework
    Zhang, YB
    Ma, S
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 905 - 909
  • [32] Multi-scale Deep Learning for Gesture Detection and Localization
    Neverova, Natalia
    Wolf, Christian
    Taylor, Graham W.
    Nebout, Florian
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 474 - 490
  • [33] Multi-scale volumes for deep object detection and localization
    Ohn-Bar, Eshed
    Trivedi, Mohan Manubhai
    PATTERN RECOGNITION, 2017, 61 : 557 - 572
  • [34] Tiny Target Detection Based on Multi-scale Attention
    Gao, Shurun
    Liu, Chang
    Zhang, Haimiao
    Qiu, Jun
    Optics InfoBase Conference Papers, 2022,
  • [35] A multi-scale feature fusion target detection algorithm
    Dong, Chong
    Li, Jingmei
    Wang, Jiaxiang
    2018 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2018, 10836
  • [36] Adversarial Learning for Constrained Image Splicing Detection and Localization Based on Atrous Convolution
    Liu, Yaqi
    Zhu, Xiaobin
    Zhao, Xianfeng
    Cao, Yun
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (10) : 2551 - 2566
  • [37] Depth-Constrained Network for Multi-Scale Object Detection
    Liu, Guohua
    Li, Yijun
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (10)
  • [38] MPCaD: a multi-scale radiomics-driven framework for automated prostate cancer localization and detection
    Farzad Khalvati
    Junjie Zhang
    Audrey G. Chung
    Mohammad Javad Shafiee
    Alexander Wong
    Masoom A. Haider
    BMC Medical Imaging, 18
  • [39] MPCaD: a multi-scale radiomics-driven framework for automated prostate cancer localization and detection
    Khalvati, Farzad
    Zhang, Junjie
    Chung, Audrey G.
    Shafiee, Mohammad Javad
    Wong, Alexander
    Haider, Masoom A.
    BMC MEDICAL IMAGING, 2018, 18
  • [40] Target-Aware Camera Placement for Large-Scale Video Surveillance
    Wu, Hongxin
    Zeng, Qinghou
    Guo, Chen
    Zhao, Tiesong
    Wen Chen, Chang
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (12) : 13338 - 13348