Detecting malicious behavior in social platforms via hybrid knowledge- and data-driven systems

被引:6
|
作者
Paredes, Jose N. [1 ,3 ,4 ]
Simari, Gerardo I. [1 ,3 ,4 ,5 ]
Martinez, Maria Vanina [2 ,6 ,7 ]
Falappa, Marcelo A. [1 ,3 ,4 ]
机构
[1] San Andres 800,Campus Palihue, RA-8000 Bahia Blanca, Buenos Aires, Argentina
[2] Pabellon 1,Ciudad Univ,C1428EGA, Buenos Aires, DF, Argentina
[3] Univ Nacl Sur UNS, Dept Comp Sci & Engn, Bahia Blanca, Buenos Aires, Argentina
[4] Inst Comp Sci & Engn UNS CONICET, Bahia Blanca, Buenos Aires, Argentina
[5] Arizona State Univ, Sch Comp Informat & Decis Syst Engn CIDSE, Tempe, AZ 85281 USA
[6] Univ Buenos Aires UBA, Dept Comp Sci, Buenos Aires, DF, Argentina
[7] Inst Comp Sci Res UBA CONICET, Buenos Aires, DF, Argentina
关键词
Malicious behavior; Fake news; Botnets; Social data; Information/misinformation diffusion; Decision support systems; Human-in-the-loop computing;
D O I
10.1016/j.future.2021.06.033
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Among the wide variety of malicious behavior commonly observed in modern social platforms, one of the most notorious is the diffusion of fake news, given its potential to influence the opinions of millions of people who can be voters, consumers, or simply citizens going about their daily lives. In this paper, we implement and carry out an empirical evaluation of a version of the recently-proposed NETDER architecture for hybrid AI decision-support systems with the capability of leveraging the availability of machine learning modules, logical reasoning about unknown objects, and forecasts based on diffusion processes. NETDER is a general architecture for reasoning about different kinds of malicious behavior such as dissemination of fake news, hate speech, and malware, detection of botnet operations, prevention of cyber attacks including those targeting software products or blockchain transactions, among others. Here, we focus on the case of fake news dissemination on social platforms by three different kinds of users: non-malicious, malicious, and botnet members. In particular, we focus on three tasks: (i) determining who is responsible for posting a fake news article, (ii) detecting malicious users, and (iii) detecting which users belong to a botnet designed to disseminate fake news. Given the difficulty of obtaining adequate data with ground truth, we also develop a testbed that combines real-world fake news datasets with synthetically generated networks of users and fully-detailed traces of their behavior throughout a series of time points. We designed our testbed to be customizable for different problem sizes and settings, and make its code publicly available to be used in similar evaluation efforts. Finally, we report on the results of a thorough experimental evaluation of three variants of our model and six environmental settings over the three tasks. Our results clearly show the effects that the quality of knowledge engineering tasks, the quality of the underlying machine learning classifier used to detect fake news, and the specific environmental conditions have on smart policing efforts in social platforms. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:232 / 246
页数:15
相关论文
共 50 条
  • [1] Configurable Interactive Environment for Hybrid Knowledge- and Data-Driven Geriatric Risk Assessment
    Urosevic, Vladimir
    Paolini, Paolo
    Tatsiopoulos, Christos
    2017 25TH INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS (SOFTCOM), 2017, : 466 - 472
  • [2] Low-dimensional dynamical models of structures with uncertain boundaries via a hybrid knowledge- and data-driven approach
    Chen, Chao
    Wang, Yilong
    Fang, Bo
    Chen, Shuai
    Yang, Yang
    Wang, Biao
    Han, Hesheng
    Cao, Dengqing
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2025, 223
  • [3] Knowledge- and Data-driven Services for Energy Systems using Graph Neural Networks
    Fusco, Francesco
    Eck, Bradley
    Gormally, Robert
    Purcell, Mark
    Tirupathi, Seshu
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 1301 - 1308
  • [4] Combining knowledge- and data-driven methods for de-identification of clinical narratives
    Dehghan, Azad
    Kovacevic, Aleksandar
    Karystianis, George
    Keane, John A.
    Nenadic, Goran
    JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 58 : S53 - S59
  • [5] Knowledge- and data-driven intelligent simulation method for tidal river network hydrodynamics
    Yuan, Saiyu
    Chen, Yihong
    Luo, Xiao
    Zhang, Huiming
    Tang, Hongwu
    Shuikexue Jinzhan/Advances in Water Science, 2025, 36 (01): : 28 - 38
  • [6] Detecting the penetration of malicious behavior in big data using hybrid algorithms
    Wang, Yue
    Shi, Yan
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 919 - 933
  • [7] Knowledge- and data-driven prediction of blast furnace gas generation and consumption in iron and steel sites
    Liu, Shuhan
    Sun, Wenqiang
    APPLIED ENERGY, 2025, 390
  • [8] Systematizing the lexicon of platforms in information systems: a data-driven study
    Bartelheimer, Christian
    zur Heiden, Philipp
    Luettenberg, Hedda
    Beverungen, Daniel
    ELECTRONIC MARKETS, 2022, 32 (01) : 375 - 396
  • [9] Systematizing the lexicon of platforms in information systems: a data-driven study
    Christian Bartelheimer
    Philipp zur Heiden
    Hedda Lüttenberg
    Daniel Beverungen
    Electronic Markets, 2022, 32 : 375 - 396
  • [10] Data-Driven Dialogue Systems for Social Agents
    Bowden, Kevin K.
    Oraby, Shereen
    Misra, Amita
    Wu, Jiaqi
    Lukin, Stephanie
    Walker, Marilyn
    ADVANCED SOCIAL INTERACTION WITH AGENTS, 2019, 510 : 53 - 56