A Multi-Stage Deep Learning Approach Incorporating Text-Image and Image-Image Comparisons for Cheapfake Detection

被引:1
|
作者
Seo, Jangwon [1 ]
Hwang, Hyo-Seok [1 ]
Lee, Jiyoung [2 ]
Lee, Minhyeok [3 ]
Kim, Wonsuk [2 ]
Seok, Junhee [1 ]
机构
[1] Korea Univ, Sch Elect Engn, Seoul, South Korea
[2] Safe AI, Seoul, South Korea
[3] Chung Ang Univ, Sch Elect & Elect Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Cheapfakes; Misinformation; Out-of-context; BERT; Stable Diffusion; Ground Image Captioning; Semantic Textual Similarity;
D O I
10.1145/3652583.3657601
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advancement of multimedia and artificial intelligence (AI) technologies has dismantled the barriers of information sharing, yet it has also ushered in a double-edged sword: a surge in the spread of fake information. In this context, there is a growing need for research on the detection of 'cheapfakes,' which are low-cost fake media, known for their ease of creation. This paper proposes a multi-stage deep learning process designed to effectively detect the diverse and rapidly evolving nature of cheapfakes. A singlestep deep learning model faces limitations in distinguishing various types of cheapfakes, necessitating the application of a complex deep learning model approach to detect subtle Out-of-Context (OOC) phenomena. This study employs models based on Bidirectional Encoder Representations from Transformers (BERT) and stable diffusion technologies to approach cheapfake detection. Through the ACM ICMR 2024 challenge, the performance of this model was evaluated on a real dataset, achieving an accuracy of 71.9% in Task 1, an improvement of 7% over previous methods, and an accuracy of 55.7% in Task 2. These results are expected to make a significant contribution to the development of strategies for creating and countering cheapfakes. Additionally, this research aims to contribute to the detection of OOC media misuse through this challenge.
引用
收藏
页码:1312 / 1316
页数:5
相关论文
共 50 条
  • [1] Enhancing Cheapfake Detection: An Approach Using Prompt Engineering and Interleaved Text-Image Model
    Vu, Dang
    Nguyen, Minh-Nhat
    Nguyen, Quoc-Trung
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1306 - 1311
  • [2] Deep image compression with multi-stage representation*
    Wang, Zixi
    Ding, Guiguang
    Han, Jungong
    Li, Fan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 79
  • [3] TimNet: A text-image matching network integrating multi-stage feature extraction with multi-scale metrics
    Zheng, Xiaoqi
    Tao, Yingfan
    Zhang, Ruikai
    Yang, Wenming
    Liao, Qingmin
    NEUROCOMPUTING, 2021, 465 : 540 - 548
  • [4] Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis
    Park, Minho
    Yun, Jooyeol
    Choi, Seunghwan
    Choo, Jaegul
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7557 - 7566
  • [5] Automated Generation of Chinese Text-Image Summaries Using Deep Learning Techniques
    Xu, Meiling
    Abd Rahman, Hayati
    Li, Feng
    TRAITEMENT DU SIGNAL, 2023, 40 (06) : 2835 - 2843
  • [6] A Learning to Rank framework applied to text-image retrieval
    David Buffoni
    Sabrina Tollari
    Patrick Gallinari
    Multimedia Tools and Applications, 2012, 60 : 161 - 180
  • [7] A Learning to Rank framework applied to text-image retrieval
    Buffoni, David
    Tollari, Sabrina
    Gallinari, Patrick
    MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (01) : 161 - 180
  • [8] Text-Image Theory: A New Approach to Literary Semiotics
    Yuping, Li
    FORUM FOR WORLD LITERATURE STUDIES, 2022, 14 (02): : 357 - 365
  • [9] Text Detection for Dust Image Based on Deep Learning
    Liu, Hao
    Li, Ce
    Jia, Shengze
    Zhang, Dong
    PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, : 754 - 759
  • [10] Text-image matching for multi-model machine translation
    Xiayang Shi
    Zhenqiang Yu
    Xuhui Wang
    Yijun Li
    Yufeng Niu
    The Journal of Supercomputing, 2023, 79 : 17810 - 17823