A Multi-Stage Deep Learning Approach Incorporating Text-Image and Image-Image Comparisons for Cheapfake Detection

被引:1
|
作者
Seo, Jangwon [1 ]
Hwang, Hyo-Seok [1 ]
Lee, Jiyoung [2 ]
Lee, Minhyeok [3 ]
Kim, Wonsuk [2 ]
Seok, Junhee [1 ]
机构
[1] Korea Univ, Sch Elect Engn, Seoul, South Korea
[2] Safe AI, Seoul, South Korea
[3] Chung Ang Univ, Sch Elect & Elect Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Cheapfakes; Misinformation; Out-of-context; BERT; Stable Diffusion; Ground Image Captioning; Semantic Textual Similarity;
D O I
10.1145/3652583.3657601
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advancement of multimedia and artificial intelligence (AI) technologies has dismantled the barriers of information sharing, yet it has also ushered in a double-edged sword: a surge in the spread of fake information. In this context, there is a growing need for research on the detection of 'cheapfakes,' which are low-cost fake media, known for their ease of creation. This paper proposes a multi-stage deep learning process designed to effectively detect the diverse and rapidly evolving nature of cheapfakes. A singlestep deep learning model faces limitations in distinguishing various types of cheapfakes, necessitating the application of a complex deep learning model approach to detect subtle Out-of-Context (OOC) phenomena. This study employs models based on Bidirectional Encoder Representations from Transformers (BERT) and stable diffusion technologies to approach cheapfake detection. Through the ACM ICMR 2024 challenge, the performance of this model was evaluated on a real dataset, achieving an accuracy of 71.9% in Task 1, an improvement of 7% over previous methods, and an accuracy of 55.7% in Task 2. These results are expected to make a significant contribution to the development of strategies for creating and countering cheapfakes. Additionally, this research aims to contribute to the detection of OOC media misuse through this challenge.
引用
收藏
页码:1312 / 1316
页数:5
相关论文
共 50 条
  • [31] A Multi-Stage Transformer Network for Image Dehazing Based on Contrastive Learning
    Gao F.
    Ji S.
    Guo J.
    Hou J.
    Ouyang C.
    Yang B.
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2023, 57 (01): : 195 - 210
  • [32] Watermarking for multi-stage encoded image authentication
    Tsai, Y-S
    Tsai, P.
    Hu, Y-C
    IMAGING SCIENCE JOURNAL, 2013, 61 (02): : 65 - 79
  • [33] Adaptive hypergraph learning with multi-stage optimizations for image and tag recommendation
    Karantaidis, Georgios
    Sarridis, Ioannis
    Kotropoulos, Constantine
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 97
  • [34] Leverage Boosting and Transformer on Text-Image Matching for Cheap Fakes Detection
    Tuan-Vinh La
    Dao, Minh-Son
    Le, Duy-Dong
    Thai, Kim-Phung
    Nguyen, Quoc-Hung
    Phan-Thi, Thuy-Kieu
    ALGORITHMS, 2022, 15 (11)
  • [35] A Tale of a Deep Learning Approach to Image Forgery Detection
    Majumder, Md. Taksir Hasan
    Al Islam, A. B. M. Alim
    PROCEEDINGS OF 2018 5TH INTERNATIONAL CONFERENCE ON NETWORKING, SYSTEMS AND SECURITY (NSYSS), 2018, : 102 - 110
  • [36] A Novel Deep Learning Approach for Deepfake Image Detection
    Raza, Ali
    Munir, Kashif
    Almutairi, Mubarak
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [37] Image Region Forgery Detection: A Deep Learning Approach
    Zhang, Ying
    Goh, Jonathan
    Win, Lei Lei
    Thing, Vrizlynn
    PROCEEDINGS OF THE SINGAPORE CYBER-SECURITY CONFERENCE (SG-CRC) 2016: CYBER-SECURITY BY DESIGN, 2016, 14 : 1 - 11
  • [38] The Importance of the Depth for Text-Image Selection Strategy in Learning-To-Rank
    Buffoni, David
    Tollari, Sabrina
    Gallinari, Patrick
    ADVANCES IN INFORMATION RETRIEVAL, 2011, 6611 : 743 - 746
  • [39] A multi-stage data augmentation approach for imbalanced samples in image recognition
    Wang, Ruo-Bin
    An, Zhi-Wei
    Wang, Wei-Feng
    Yin, Shuo
    Xu, Lin
    Wang, Ruo-Bin, 1600, Taiwan Ubiquitous Information (06): : 94 - 106
  • [40] Text-image multimodal fusion model for enhanced fake news detection
    Lin, Szu-Yin
    Chen, Yen-Chiu
    Chang, Yu-Han
    Lo, Shih-Hsin
    Chao, Kuo-Ming
    SCIENCE PROGRESS, 2024, 107 (04)