AI-Generated Image Detection using a Cross-Attention Enhanced Dual-Stream Network

被引:0
|
作者
Xi, Ziyi [1 ]
Huang, Wenmin [1 ]
Wei, Kangkang [1 ]
Luo, Weiqi [1 ]
Zheng, Peijia [1 ]
机构
[1] Sun Yat Sen Univ, GuangDong Prov Key Lab Informat Secur Technol, Guangzhou, Peoples R China
关键词
D O I
10.1109/APSIPAASC58517.2023.10317126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid evolution of AI Generated Content (AIGC), forged images produced through this technology are inherently more deceptive and require less human intervention compared to traditional Computer-generated Graphics (CG). However, owing to the disparities between CG and AIGC, conventional CG detection methods tend to be inadequate in identifying AIGC-produced images. To address this issue, our research concentrates on the text-to-image generation process in AIGC. Initially, we first assemble two text-to-image databases utilizing two distinct AI systems, DALL center dot E2 and DreamStudio. Aiming to holistically capture the inherent anomalies produced by AIGC, we develope a robust dual-stream network comprised of a residual stream and a content stream. The former employs the Spatial Rich Model (SRM) to meticulously extract various texture information from images, while the latter seeks to capture additional forged traces in low frequency, thereby extracting complementary information that the residual stream may overlook. To enhance the information exchange between these two streams, we incorporate a cross multi-head attention mechanism. Numerous comparative experiments are performed on both databases, and the results show that our detection method consistently outperforms traditional CG detection techniques across a range of image resolutions. Moreover, our method exhibits superior performance through a series of robustness tests and cross-database experiments. When applied to widely recognized traditional CG benchmarks such as SPL2018 and DsTok, our approach significantly exceeds the capabilities of other existing methods in the field of CG detection.
引用
收藏
页码:1463 / 1470
页数:8
相关论文
共 50 条
  • [41] MADS-Net: a multiple attention-based dual-stream network for deformable medical image registration
    Chao Fan
    Xinru Zhu
    Bincheng Peng
    Zhihui Xuan
    Zhentong Zhu
    The Journal of Supercomputing, 81 (4)
  • [42] Cross-View Gait Recognition Based on Dual-Stream Network
    Zhao, Xiaoyan
    Zhang, Wenjing
    Zhang, Tianyao
    Zhang, Zhaohui
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2021, 22 (05) : 671 - 678
  • [43] Self-attention-enhanced 3D Convolutional Dual-stream Pyramid Registration Neural Network
    Yin, Yunfei
    Yuan, Zheng
    Yuan, Zhiwei
    Bao, Xianjian
    Proceedings - 2024 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2024, 2024, : 3918 - 3923
  • [44] A multitask dual-stream attention network for the identification of KRAS mutation in colorectal cancer
    Song, Kai
    Zhao, Zijuan
    Ma, Yulan
    Wang, JiaWen
    Wu, Wei
    Qiang, Yan
    Zhao, Juanjuan
    Chaudhary, Suman
    MEDICAL PHYSICS, 2022, 49 (01) : 254 - 270
  • [45] Dual-stream reinforcement network for few-shot image segmentation
    Tang, Mingwei
    Zhu, Lin
    Xu, Yangsheng
    Zhao, Mingfeng
    DIGITAL SIGNAL PROCESSING, 2023, 134
  • [46] Cross-Scene Building Identification Based on Dual-Stream Neural Network and Efficient Channel Attention Mechanism
    Li, Wenmei
    Zhang, Jiadong
    Xia, Hao
    Liu, Qing
    Wang, Yu
    Jia, Yan
    Chen, Yixiang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 6920 - 6932
  • [47] Cross-Domain Coral Image Classification Using Dual-Stream Hierarchical Neural Networks
    Han, Hongyong
    Wang, Wei
    Zhang, Gaowei
    Li, Mingjie
    Wang, Yi
    39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 945 - 952
  • [48] Infrared and Visible Image Fusion Using Dual-stream Generative Adversarial Network with Multiple Discriminators
    Wu L.
    Kang J.
    Ji Y.
    Ma H.
    Binggong Xuebao/Acta Armamentarii, 2024, 45 (06): : 1799 - 1812
  • [49] Stereoscopic image discomfort prediction using dual-stream multi-level interactive network
    Zhou, Yang
    Chen, Pingan
    Yin, Haibing
    Huang, Xiaofeng
    Li, Zhu
    DISPLAYS, 2023, 78
  • [50] Using a dual-stream attention neural network to characterize mild cognitive impairment based on retinal images
    Gao, Hebei
    Zhao, Shuaiye
    Zheng, Gu
    Wang, Xinmin
    Zhao, Runyi
    Pan, Zhigeng
    Li, Hong
    Lu, Fan
    Shen, Meixiao
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 166