Turning captchas against humanity: Captcha-based attacks in online social media

被引：1

作者：

Conti, Mauro ^{[1
]}

Pajola, Luca ^{[1
]}

Tricomi, Pier Paolo ^{[1
]}

机构：

[1] Univ Padua, Padua, Italy

来源：

ONLINE SOCIAL NETWORKS AND MEDIA | 2023年 / 36卷

关键词：

Online social networks; Automatic content moderator; Adversarial machine learning; Hate speech; Cybersecurity; Instagram; Obfuscation techniques; RECOGNITION;

D O I：

10.1016/j.osnem.2023.100252

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Nowadays, people generate and share massive amounts of content on online platforms (e.g., social networks, blogs). In 2021, the 1.9 billion daily active Facebook users posted around 150 thousand photos every minute. Content moderators constantly monitor these online platforms to prevent the spreading of inappropriate content (e.g., hate speech, nudity images). Based on deep learning (DL) advances, Automatic Content Moderators (ACM) help human moderators handle high data volume. Despite their advantages, attackers can exploit weaknesses of DL components (e.g., preprocessing, model) to affect their performance. Therefore, an attacker can leverage such techniques to spread inappropriate content by evading ACM. In this work, we analyzed 4600 potentially toxic Instagram posts, and we discovered that 44% of them adopt obfuscations that might undermine ACM. As these posts are reminiscent of captchas (i.e., not understandable by automated mechanisms), we coin this threat as Captcha Attack ( CAPA ). Our contributions start by proposing a CAPA taxonomy to better understand how ACM is vulnerable to obfuscation attacks. We then focus on the broad sub-category of CAPA using textual Captcha Challenges, namely CC-CAPA, and we empirically demonstrate that it evades real-world ACM (i.e., Amazon, Google, Microsoft) with 100% accuracy. Our investigation revealed that ACM failures are caused by the OCR text extraction phase. The training of OCRs to withstand such obfuscation is therefore crucial, but huge amounts of data are required. Thus, we investigate methods to identify CC-CAPA samples from large sets of data (originated by three OSN - Pinterest, Twitter, Yahoo-Flickr), and we empirically demonstrate that supervised techniques identify target styles of samples almost perfectly. Unsupervised solutions, on the other hand, represent a solid methodology for inspecting uncommon data to detect new obfuscation techniques.

引用

页数：14

共 50 条

[1] CAPTCHA-based DDoS Defense System of Call Centers against Zombie Smart-Phone
Jung, Seung Wook
INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2012, 6 (03): : 29 - 36
[2] Is Puzzle-Based CAPTCHA Secure Against Attacks Based on CNN?
Terada, Kenta
Okabe, Yasuo
Matsumoto, Yoshinori
2023 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN, 2023, : 358 - 362
[3] Securing Online Social Networks against Bad bots based on a Necklace CAPTCHA Approach
Torkyl, Mohamed
Meligy, Ali
Ibrahim, Hani
ICENCO 2016 - 2016 12TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO) - BOUNDLESS SMART SOCIETIES, 2016, : 158 - 163
[4] Developing an Empirical Algorithm for Protecting Text-based CAPTCHAs against Segmentation Attacks
Pan, Lei
Zhou, Yan
2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013), 2013, : 636 - 643
[5] Decoding the animated text-based captchas to verify their robustness against automated attacks
Arain, Rafaqat Hussain
Shaikh, Riaz Ahmed
Shah, Safdar Ali
Shah, Sajjad Ali
Rafique, Saima
Ansari, Ahmed Masood
MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2023, 42 (04) : 164 - 183
[6] Is image-based CAPTCHA secure against attacks based on machine learning? An experimental study
Alqahtani, Fatmah H.
Alsulaiman, Fawaz A.
COMPUTERS & SECURITY, 2020, 88
[7] Social media attacks against female Canadian journalists
Al-Rawi, Ahmed
FRONTIERS IN COMMUNICATION, 2023, 8
[8] All about uncertainties and traps: Statistical oracle-based attacks on a new CAPTCHA protection against oracle attacks
Javier Hernandez-Castro, Carlos
Li, Shujun
R-Moreno, Maria D.
COMPUTERS & SECURITY, 2020, 92
[9] Generating-Based Attacks to Online Social Networks
Gao, Tianchong
Bian, Yucheng
Li, Feng
Sundar, Agnideven Palanisamy
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,
[10] Online musicking for humanity: the role of imagined listening and the moral economies of music sharing on social media
Campos Valverde, Raquel
POPULAR MUSIC, 2022, 41 (02) : 194 - 215

← 1 2 3 4 5 →