Image manipulation detection and localization using multi-scale contrastive learning

被引:1
|
作者
Bai, Ruyi [1 ]
机构
[1] Shanxi Univ, Coll Automat & Software, Taiyuan 030006, Shanxi, Peoples R China
关键词
Contrastive learning; Image manipulation detection; SRM; Self-attention;
D O I
10.1016/j.asoc.2024.111914
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current image tampering detection methods rely on various forgery footprints, such as JPEG artifacts and edge inconsistencies, and use algorithms related to image segmentation. However, these methods have several issues, including over-fitting, focusing on only a few specific forgery footprints, and emphasizing semantically relevant information while ignoring tampering traces. This paper proposes a model for image manipulation detection and localization based on multi-scale contrast learning(MSCL-Net). The model utilizes the differences in feature distributions between tampered and untampered regions to extract a comprehensive tamper trace. It uses a dualstream structured encoder that incorporates both RGB raw images and SRM noise features. A Feature CrossFusion Module (FCFM) is proposed to fuse features for improving feature representation of tampered information. The decoding process involves the use of an Adaptive Self-Attention Module (ASAM) to filter and aggregate relevant context from coarse feature maps. Additionally, a Supervised Contrastive Learning Module (SCLM) is used to expand the difference between tampered and untampered areas. The loss function for multi-loss fusion comprises classification loss, segmentation loss, and multi-scale supervised contrastive loss. This improves the network's understanding of global differences, reduces false positives, weakens semantic information, and enhances the model's ability to locate tampered regions of varying sizes. Extensive experiments are conducted across multiple datasets, demonstrating that our model is robust against attacks and resilient to false-positive predictions at both the image-level and pixel-level. Furthermore, its overall performance exceeds that of stateof-the-art alternatives in reliably detecting and localizing tampered images.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Multi-scale Contrastive Learning with Attention for Histopathology Image Classification
    Tan, Jing Wei
    Khoa Tuan Nguyen
    Lee, Kyoungbun
    Jeong, Won-Ki
    [J]. MEDICAL IMAGING 2023, 2023, 12471
  • [2] Multi-scale contrastive learning method for PolSAR image classification
    Hua, Wenqiang
    Wang, Chen
    Sun, Nan
    Liu, Lin
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (01)
  • [3] ANEMONE: Graph Anomaly Detection with Multi-Scale Contrastive Learning
    Jin, Ming
    Liu, Yixin
    Zheng, Yu
    Chi, Lianhua
    Li, Yuan-Fang
    Pan, Shirui
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3122 - 3126
  • [4] Image Manipulation Localization Using Multi-Scale Feature Fusion and Adaptive Edge Supervision
    Li, Fengyong
    Pei, Zhenjia
    Zhang, Xinpeng
    Qin, Chuan
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7851 - 7866
  • [5] Multi-Scale Subgraph Contrastive Learning
    Liu, Yanbei
    Zhao, Yu
    Wang, Xiao
    Geng, Lei
    Xiao, Zhitao
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 2215 - 2223
  • [6] MITD-Net: Multi-scale iterative tamper detection network for image manipulation localization
    Deng, Fan
    Yu, Haibo
    Liu, Tao
    Yang, Ruitao
    [J]. Digital Signal Processing: A Review Journal, 2025, 157
  • [7] Image Manipulation Detection by Multi-View Multi-Scale Supervision
    Chen, Xinru
    Dong, Chengbo
    Ji, Jiaqi
    Cao, Juan
    Li, Xirong
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14165 - 14173
  • [8] Efficient Multi-Scale Feature Fusion for Image Manipulation Detection
    Zhang, Yuxue
    Feng, Guorui
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (05) : 1107 - 1111
  • [9] Multi-scale Deep Learning for Gesture Detection and Localization
    Neverova, Natalia
    Wolf, Christian
    Taylor, Graham W.
    Nebout, Florian
    [J]. COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 474 - 490
  • [10] Multi-scale multi-instance contrastive learning for whole slide image classification
    Zhang, Jianan
    Hao, Fang
    Liu, Xueyu
    Yao, Shupei
    Wu, Yongfei
    Li, Ming
    Zheng, Wen
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138