AI-Generated Image Detection using a Cross-Attention Enhanced Dual-Stream Network

被引:0
|
作者
Xi, Ziyi [1 ]
Huang, Wenmin [1 ]
Wei, Kangkang [1 ]
Luo, Weiqi [1 ]
Zheng, Peijia [1 ]
机构
[1] Sun Yat Sen Univ, GuangDong Prov Key Lab Informat Secur Technol, Guangzhou, Peoples R China
关键词
D O I
10.1109/APSIPAASC58517.2023.10317126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid evolution of AI Generated Content (AIGC), forged images produced through this technology are inherently more deceptive and require less human intervention compared to traditional Computer-generated Graphics (CG). However, owing to the disparities between CG and AIGC, conventional CG detection methods tend to be inadequate in identifying AIGC-produced images. To address this issue, our research concentrates on the text-to-image generation process in AIGC. Initially, we first assemble two text-to-image databases utilizing two distinct AI systems, DALL center dot E2 and DreamStudio. Aiming to holistically capture the inherent anomalies produced by AIGC, we develope a robust dual-stream network comprised of a residual stream and a content stream. The former employs the Spatial Rich Model (SRM) to meticulously extract various texture information from images, while the latter seeks to capture additional forged traces in low frequency, thereby extracting complementary information that the residual stream may overlook. To enhance the information exchange between these two streams, we incorporate a cross multi-head attention mechanism. Numerous comparative experiments are performed on both databases, and the results show that our detection method consistently outperforms traditional CG detection techniques across a range of image resolutions. Moreover, our method exhibits superior performance through a series of robustness tests and cross-database experiments. When applied to widely recognized traditional CG benchmarks such as SPL2018 and DsTok, our approach significantly exceeds the capabilities of other existing methods in the field of CG detection.
引用
收藏
页码:1463 / 1470
页数:8
相关论文
共 50 条
  • [21] A New Deepfake Detection Method Based on Compound Scaling Dual-Stream Attention Network
    Wang, Shuya
    Du, Chenjun
    Chen, Yunfang
    EAI Endorsed Transactions on Pervasive Health and Technology, 2024, 10
  • [22] DSSFN: A Dual-Stream Self-Attention Fusion Network for Effective Hyperspectral Image Classification
    Yang, Zian
    Zheng, Nairong
    Wang, Feng
    REMOTE SENSING, 2023, 15 (15)
  • [23] Enhancing Interpretability in AI-Generated Image Detection with Genetic Programming
    Lin, Mingqian
    Shang, Lin
    Gao, Xiaoying
    2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 371 - 378
  • [24] Locally GAN-generated face detection algorithm based on dual-stream features fused by attention
    Chen B.
    Wang P.
    Yu L.
    Shu H.
    Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2023, 53 (03): : 543 - 551
  • [25] TDS-Net: Transformer enhanced dual-stream network for video Anomaly Detection
    Hussain, Adnan
    Ullah, Waseem
    Khan, Noman
    Khan, Zulfiqar Ahmad
    Kim, Min Je
    Baik, Sung Wook
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 256
  • [26] A dual-stream encoder-decoder network with attention mechanism for saliency detection in video(s)
    Kumain, Sandeep Chand
    Singh, Maheep
    Awasthi, Lalit Kumar
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (03) : 2037 - 2046
  • [27] Dual-Stream Intermediate Fusion Network for Image Forgery Localization
    Yan, Caiping
    Liu, Renhai
    Li, Hong
    Wu, Jinghui
    Pan, Haojie
    IEEE ACCESS, 2024, 12 : 90511 - 90524
  • [28] Micro Expression Recognition via Dual-Stream Spatiotemporal Attention Network
    Wang, Yan
    Huang, Yikun
    Liu, Can
    Gu, Xiaoying
    Yang, Dandan
    Wang, Shuopeng
    Zhang, Bo
    JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021
  • [29] DSAGAN: A generative adversarial network based on dual-stream attention mechanism for anatomical and functional image fusion
    Fu, Jun
    Li, Weisheng
    Du, Jiao
    Xu, Liming
    Li, Weisheng (liws@cqupt.edu.cn), 1600, Elsevier Inc. (576): : 484 - 506
  • [30] DSAGAN: A generative adversarial network based on dual-stream attention mechanism for anatomical and functional image fusion
    Fu, Jun
    Li, Weisheng
    Du, Jiao
    Xu, Liming
    INFORMATION SCIENCES, 2021, 576 : 484 - 506