A Multi-Stage Deep Learning Approach Incorporating Text-Image and Image-Image Comparisons for Cheapfake Detection

被引：1

作者：

Seo, Jangwon ^{[1
]}

Hwang, Hyo-Seok ^{[1
]}

Lee, Jiyoung ^{[2
]}

Lee, Minhyeok ^{[3
]}

Kim, Wonsuk ^{[2
]}

Seok, Junhee ^{[1
]}

机构：

[1] Korea Univ, Sch Elect Engn, Seoul, South Korea

[2] Safe AI, Seoul, South Korea

[3] Chung Ang Univ, Sch Elect & Elect Engn, Seoul, South Korea

来源：

PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024 | 2024年

基金：

新加坡国家研究基金会;

关键词：

Cheapfakes; Misinformation; Out-of-context; BERT; Stable Diffusion; Ground Image Captioning; Semantic Textual Similarity;

D O I：

10.1145/3652583.3657601

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The advancement of multimedia and artificial intelligence (AI) technologies has dismantled the barriers of information sharing, yet it has also ushered in a double-edged sword: a surge in the spread of fake information. In this context, there is a growing need for research on the detection of 'cheapfakes,' which are low-cost fake media, known for their ease of creation. This paper proposes a multi-stage deep learning process designed to effectively detect the diverse and rapidly evolving nature of cheapfakes. A singlestep deep learning model faces limitations in distinguishing various types of cheapfakes, necessitating the application of a complex deep learning model approach to detect subtle Out-of-Context (OOC) phenomena. This study employs models based on Bidirectional Encoder Representations from Transformers (BERT) and stable diffusion technologies to approach cheapfake detection. Through the ACM ICMR 2024 challenge, the performance of this model was evaluated on a real dataset, achieving an accuracy of 71.9% in Task 1, an improvement of 7% over previous methods, and an accuracy of 55.7% in Task 2. These results are expected to make a significant contribution to the development of strategies for creating and countering cheapfakes. Additionally, this research aims to contribute to the detection of OOC media misuse through this challenge.

引用

页码：1312 / 1316

页数：5

共 50 条

[1] Enhancing Cheapfake Detection: An Approach Using Prompt Engineering and Interleaved Text-Image Model
Vu, Dang
Nguyen, Minh-Nhat
Nguyen, Quoc-Trung
PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1306 - 1311
[2] Deep image compression with multi-stage representation*
Wang, Zixi
Ding, Guiguang
Han, Jungong
Li, Fan
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 79
[3] TimNet: A text-image matching network integrating multi-stage feature extraction with multi-scale metrics
Zheng, Xiaoqi
Tao, Yingfan
Zhang, Ruikai
Yang, Wenming
Liao, Qingmin
NEUROCOMPUTING, 2021, 465 : 540 - 548
[4] Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis
Park, Minho
Yun, Jooyeol
Choi, Seunghwan
Choo, Jaegul
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7557 - 7566
[5] Automated Generation of Chinese Text-Image Summaries Using Deep Learning Techniques
Xu, Meiling
Abd Rahman, Hayati
Li, Feng
TRAITEMENT DU SIGNAL, 2023, 40 (06) : 2835 - 2843
[6] A Learning to Rank framework applied to text-image retrieval
David Buffoni
Sabrina Tollari
Patrick Gallinari
Multimedia Tools and Applications, 2012, 60 : 161 - 180
[7] A Learning to Rank framework applied to text-image retrieval
Buffoni, David
Tollari, Sabrina
Gallinari, Patrick
MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (01) : 161 - 180
[8] Text-Image Theory: A New Approach to Literary Semiotics
Yuping, Li
FORUM FOR WORLD LITERATURE STUDIES, 2022, 14 (02): : 357 - 365
[9] Text Detection for Dust Image Based on Deep Learning
Liu, Hao
Li, Ce
Jia, Shengze
Zhang, Dong
PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, : 754 - 759
[10] Text-image matching for multi-model machine translation
Xiayang Shi
Zhenqiang Yu
Xuhui Wang
Yijun Li
Yufeng Niu
The Journal of Supercomputing, 2023, 79 : 17810 - 17823

← 1 2 3 4 5 →