CTNet: hybrid architecture based on CNN and transformer for image inpainting detection

被引:3
|
作者
Xiao, Fengjun [1 ]
Zhang, Zhuxi [2 ]
Yao, Ye [2 ]
机构
[1] Hangzhou Dianzi Univ, Zhejiang Informatizat Dev Inst, Xiasha Higher Educ Zone, Hangzhou 310018, Zhejiang, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Cyberspace, Xiasha Higher Educ Zone, Hangzhou 310018, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Image inpainting detection; Deep neural network; Hybrid CNN-Transformer encoder; High-pass filter; DIFFUSION; NETWORK;
D O I
10.1007/s00530-023-01184-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Digital image inpainting technology has increasingly gained popularity as a result of the development of image processing and machine vision. However, digital image inpainting can be used not only to repair damaged photographs, but also to remove specific people or distort the semantic content of images. To address the issue of image inpainting forgeries, a hybrid CNN-Transformer Network (CTNet), which is composed of the hybrid CNN-Transformer encoder, the feature enhancement module, and the decoder module, is proposed for image inpainting detection and localization. Different from existing inpainting detection methods that rely on hand-crafted attention mechanisms, the hybrid CNN-Transformer encoder employs CNN as a feature extractor to build feature maps tokenized as the input patches of the Transformer encoder. The hybrid structure exploits the innate global self-attention mechanisms of Transformer and can effectively capture the long-term dependency of the image. Since inpainting traces mainly exist in the high-frequency components of digital images, the feature enhancement module performs feature extraction in the frequency domain. The decoder regularizes the upsampling process of the predicted masks with the assistance of high-frequency features. We investigate the generalization capacity of our CTNet on datasets generated by ten commonly used inpainting methods. The experimental results show that the proposed model can detect a variety of unknown inpainting operations after being trained on the datasets generated by a single inpainting method.
引用
下载
收藏
页码:3819 / 3832
页数:14
相关论文
共 50 条
  • [1] CTNet: hybrid architecture based on CNN and transformer for image inpainting detection
    Fengjun Xiao
    Zhuxi Zhang
    Ye Yao
    Multimedia Systems, 2023, 29 (6) : 3819 - 3832
  • [2] Bidirectional interaction of CNN and Transformer for image inpainting
    Liu, Jialu
    Gong, Maoguo
    Gao, Yuan
    Lu, Yiheng
    Li, Hao
    KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [3] A transformer–CNN for deep image inpainting forensics
    Xinshan Zhu
    Junyan Lu
    Honghao Ren
    Hongquan Wang
    Biao Sun
    The Visual Computer, 2023, 39 : 4721 - 4735
  • [4] IMIHCT: improved multi-stage image inpainting with hybrid CNN and transformer
    Tao Ning
    Xingfang Wang
    Hongwei Ding
    Pattern Analysis and Applications, 2025, 28 (1)
  • [5] A transformer-CNN for deep image inpainting forensics
    Zhu, Xinshan
    Lu, Junyan
    Ren, Honghao
    Wang, Hongquan
    Sun, Biao
    VISUAL COMPUTER, 2023, 39 (10): : 4721 - 4735
  • [6] TransCNN-HAE: Transformer-CNN Hybrid AutoEncoder for Blind Image Inpainting
    Zhao, Haoru
    Gu, Zhaorui
    Zheng, Bing
    Zheng, Haiyong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 6813 - 6821
  • [7] A CNN-Transformer Hybrid Model Based on CSWin Transformer for UAV Image Object Detection
    Lu, Wanjie
    Lan, Chaozhen
    Niu, Chaoyang
    Liu, Wei
    Lyu, Liang
    Shi, Qunshan
    Wang, Shiju
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 1211 - 1231
  • [8] CNN-Transformer Hybrid Architecture for Early Fire Detection
    Yang, Chenyue
    Pan, Yixuan
    Cao, Yichao
    Lu, Xiaobo
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 570 - 581
  • [9] Weak Appearance Aware Pipeline Leak Detection based on CNN-Transformer Hybrid Architecture
    Zhang, Bulin
    Yuan, Haiwen
    Ge, Jie
    Cheng, Li
    Li, Xuan
    Xiao, Changshi
    IEEE Transactions on Instrumentation and Measurement, 2024,
  • [10] HyFormer: a hybrid transformer-CNN architecture for retinal OCT image segmentation
    Jiang, Qingxin
    Fan, Ying
    Li, Menghan
    Fang, Sheng
    Zhu, Weifang
    Xiang, Dehui
    Peng, Tao
    Chen, Xinjian
    Xu, Xun
    Shi, Fei
    Biomedical Optics Express, 2024, 15 (11) : 6156 - 6170