AI-Generated Image Detection using a Cross-Attention Enhanced Dual-Stream Network

被引:0
|
作者
Xi, Ziyi [1 ]
Huang, Wenmin [1 ]
Wei, Kangkang [1 ]
Luo, Weiqi [1 ]
Zheng, Peijia [1 ]
机构
[1] Sun Yat Sen Univ, GuangDong Prov Key Lab Informat Secur Technol, Guangzhou, Peoples R China
关键词
D O I
10.1109/APSIPAASC58517.2023.10317126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid evolution of AI Generated Content (AIGC), forged images produced through this technology are inherently more deceptive and require less human intervention compared to traditional Computer-generated Graphics (CG). However, owing to the disparities between CG and AIGC, conventional CG detection methods tend to be inadequate in identifying AIGC-produced images. To address this issue, our research concentrates on the text-to-image generation process in AIGC. Initially, we first assemble two text-to-image databases utilizing two distinct AI systems, DALL center dot E2 and DreamStudio. Aiming to holistically capture the inherent anomalies produced by AIGC, we develope a robust dual-stream network comprised of a residual stream and a content stream. The former employs the Spatial Rich Model (SRM) to meticulously extract various texture information from images, while the latter seeks to capture additional forged traces in low frequency, thereby extracting complementary information that the residual stream may overlook. To enhance the information exchange between these two streams, we incorporate a cross multi-head attention mechanism. Numerous comparative experiments are performed on both databases, and the results show that our detection method consistently outperforms traditional CG detection techniques across a range of image resolutions. Moreover, our method exhibits superior performance through a series of robustness tests and cross-database experiments. When applied to widely recognized traditional CG benchmarks such as SPL2018 and DsTok, our approach significantly exceeds the capabilities of other existing methods in the field of CG detection.
引用
收藏
页码:1463 / 1470
页数:8
相关论文
共 50 条
  • [1] DSCA: A dual-stream network with cross-attention on whole-slide image pyramids for cancer prognosis
    Liu, Pei
    Fu, Bo
    Ye, Feng
    Yang, Rui
    Ji, Luping
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 227
  • [2] Speech Emotion Recognition Using Dual-Stream Representation and Cross-Attention Fusion
    Yu, Shaode
    Meng, Jiajian
    Fan, Wenqing
    Chen, Ye
    Zhu, Bing
    Yu, Hang
    Xie, Yaoqin
    Sun, Qiuirui
    ELECTRONICS, 2024, 13 (11)
  • [3] Dual-Stream Attention Network for Hyperspectral Image Unmixing
    School of Computer Science and Technology, Ocean University of China, Qingdao
    266100, China
    不详
    266100, China
    arXiv,
  • [4] Dual-Stream Attention Network for Hyperspectral Image Unmixing
    Wang, Yufang
    Wu, Wenmin
    Qi, Lin
    Gao, Feng
    International Geoscience and Remote Sensing Symposium (IGARSS), 2024, : 9438 - 9441
  • [5] Dual-Stream Discriminative Attention Network for Cross-Scene Hyperspectral Image Classification
    Wang, Chenglong
    Guo, Yi
    Fu, Jiaojiao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [6] Dual-stream Self-attention Network for Image Captioning
    Wan, Boyang
    Jiang, Wenhui
    Fang, Yuming
    Wen, Wenying
    Liu, Hantao
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [7] Intersection-union dual-stream cross-attention Lova-SwinUnet for skin cancer hair segmentation and image repair
    Qin, Juanjuan
    Pei, Dong
    Guo, Qian
    Cai, Xingjuan
    Xie, Liping
    Zhang, Wensheng
    Computers in Biology and Medicine, 2024, 180
  • [8] Infrared image fault diagnosis based on dual-stream attention convolution network
    Lu, Dong
    Yang, Jing
    Ming, Lyu
    Zhang, Jie
    ENGINEERING RESEARCH EXPRESS, 2024, 6 (02):
  • [9] A Frequency Attention-Based Dual-Stream Network for Image Inpainting Forensics
    Wang, Hongquan
    Zhu, Xinshan
    Ren, Chao
    Zhang, Lan
    Ma, Shugen
    MATHEMATICS, 2023, 11 (12)
  • [10] Dual-stream network with complementary fusion and hierarchical attention for image tampering localization
    Zhanpeng Mao
    Tongwei Lu
    Signal, Image and Video Processing, 2025, 19 (3)