Turning captchas against humanity: Captcha-based attacks in online social media

被引:1
|
作者
Conti, Mauro [1 ]
Pajola, Luca [1 ]
Tricomi, Pier Paolo [1 ]
机构
[1] Univ Padua, Padua, Italy
来源
关键词
Online social networks; Automatic content moderator; Adversarial machine learning; Hate speech; Cybersecurity; Instagram; Obfuscation techniques; RECOGNITION;
D O I
10.1016/j.osnem.2023.100252
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, people generate and share massive amounts of content on online platforms (e.g., social networks, blogs). In 2021, the 1.9 billion daily active Facebook users posted around 150 thousand photos every minute. Content moderators constantly monitor these online platforms to prevent the spreading of inappropriate content (e.g., hate speech, nudity images). Based on deep learning (DL) advances, Automatic Content Moderators (ACM) help human moderators handle high data volume. Despite their advantages, attackers can exploit weaknesses of DL components (e.g., preprocessing, model) to affect their performance. Therefore, an attacker can leverage such techniques to spread inappropriate content by evading ACM. In this work, we analyzed 4600 potentially toxic Instagram posts, and we discovered that 44% of them adopt obfuscations that might undermine ACM. As these posts are reminiscent of captchas (i.e., not understandable by automated mechanisms), we coin this threat as Captcha Attack ( CAPA ). Our contributions start by proposing a CAPA taxonomy to better understand how ACM is vulnerable to obfuscation attacks. We then focus on the broad sub-category of CAPA using textual Captcha Challenges, namely CC-CAPA, and we empirically demonstrate that it evades real-world ACM (i.e., Amazon, Google, Microsoft) with 100% accuracy. Our investigation revealed that ACM failures are caused by the OCR text extraction phase. The training of OCRs to withstand such obfuscation is therefore crucial, but huge amounts of data are required. Thus, we investigate methods to identify CC-CAPA samples from large sets of data (originated by three OSN - Pinterest, Twitter, Yahoo-Flickr), and we empirically demonstrate that supervised techniques identify target styles of samples almost perfectly. Unsupervised solutions, on the other hand, represent a solid methodology for inspecting uncommon data to detect new obfuscation techniques.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] CAPTCHA-based DDoS Defense System of Call Centers against Zombie Smart-Phone
    Jung, Seung Wook
    INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2012, 6 (03): : 29 - 36
  • [2] Is Puzzle-Based CAPTCHA Secure Against Attacks Based on CNN?
    Terada, Kenta
    Okabe, Yasuo
    Matsumoto, Yoshinori
    2023 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN, 2023, : 358 - 362
  • [3] Securing Online Social Networks against Bad bots based on a Necklace CAPTCHA Approach
    Torkyl, Mohamed
    Meligy, Ali
    Ibrahim, Hani
    ICENCO 2016 - 2016 12TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO) - BOUNDLESS SMART SOCIETIES, 2016, : 158 - 163
  • [4] Developing an Empirical Algorithm for Protecting Text-based CAPTCHAs against Segmentation Attacks
    Pan, Lei
    Zhou, Yan
    2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013), 2013, : 636 - 643
  • [5] Decoding the animated text-based captchas to verify their robustness against automated attacks
    Arain, Rafaqat Hussain
    Shaikh, Riaz Ahmed
    Shah, Safdar Ali
    Shah, Sajjad Ali
    Rafique, Saima
    Ansari, Ahmed Masood
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2023, 42 (04) : 164 - 183
  • [6] Is image-based CAPTCHA secure against attacks based on machine learning? An experimental study
    Alqahtani, Fatmah H.
    Alsulaiman, Fawaz A.
    COMPUTERS & SECURITY, 2020, 88
  • [8] All about uncertainties and traps: Statistical oracle-based attacks on a new CAPTCHA protection against oracle attacks
    Javier Hernandez-Castro, Carlos
    Li, Shujun
    R-Moreno, Maria D.
    COMPUTERS & SECURITY, 2020, 92
  • [9] Generating-Based Attacks to Online Social Networks
    Gao, Tianchong
    Bian, Yucheng
    Li, Feng
    Sundar, Agnideven Palanisamy
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,
  • [10] Online musicking for humanity: the role of imagined listening and the moral economies of music sharing on social media
    Campos Valverde, Raquel
    POPULAR MUSIC, 2022, 41 (02) : 194 - 215