Detecting malicious behavior in social platforms via hybrid knowledge- and data-driven systems

被引:6
|
作者
Paredes, Jose N. [1 ,3 ,4 ]
Simari, Gerardo I. [1 ,3 ,4 ,5 ]
Martinez, Maria Vanina [2 ,6 ,7 ]
Falappa, Marcelo A. [1 ,3 ,4 ]
机构
[1] San Andres 800,Campus Palihue, RA-8000 Bahia Blanca, Buenos Aires, Argentina
[2] Pabellon 1,Ciudad Univ,C1428EGA, Buenos Aires, DF, Argentina
[3] Univ Nacl Sur UNS, Dept Comp Sci & Engn, Bahia Blanca, Buenos Aires, Argentina
[4] Inst Comp Sci & Engn UNS CONICET, Bahia Blanca, Buenos Aires, Argentina
[5] Arizona State Univ, Sch Comp Informat & Decis Syst Engn CIDSE, Tempe, AZ 85281 USA
[6] Univ Buenos Aires UBA, Dept Comp Sci, Buenos Aires, DF, Argentina
[7] Inst Comp Sci Res UBA CONICET, Buenos Aires, DF, Argentina
关键词
Malicious behavior; Fake news; Botnets; Social data; Information/misinformation diffusion; Decision support systems; Human-in-the-loop computing;
D O I
10.1016/j.future.2021.06.033
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Among the wide variety of malicious behavior commonly observed in modern social platforms, one of the most notorious is the diffusion of fake news, given its potential to influence the opinions of millions of people who can be voters, consumers, or simply citizens going about their daily lives. In this paper, we implement and carry out an empirical evaluation of a version of the recently-proposed NETDER architecture for hybrid AI decision-support systems with the capability of leveraging the availability of machine learning modules, logical reasoning about unknown objects, and forecasts based on diffusion processes. NETDER is a general architecture for reasoning about different kinds of malicious behavior such as dissemination of fake news, hate speech, and malware, detection of botnet operations, prevention of cyber attacks including those targeting software products or blockchain transactions, among others. Here, we focus on the case of fake news dissemination on social platforms by three different kinds of users: non-malicious, malicious, and botnet members. In particular, we focus on three tasks: (i) determining who is responsible for posting a fake news article, (ii) detecting malicious users, and (iii) detecting which users belong to a botnet designed to disseminate fake news. Given the difficulty of obtaining adequate data with ground truth, we also develop a testbed that combines real-world fake news datasets with synthetically generated networks of users and fully-detailed traces of their behavior throughout a series of time points. We designed our testbed to be customizable for different problem sizes and settings, and make its code publicly available to be used in similar evaluation efforts. Finally, we report on the results of a thorough experimental evaluation of three variants of our model and six environmental settings over the three tasks. Our results clearly show the effects that the quality of knowledge engineering tasks, the quality of the underlying machine learning classifier used to detect fake news, and the specific environmental conditions have on smart policing efforts in social platforms. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:232 / 246
页数:15
相关论文
共 50 条
  • [21] Intercontinental prediction of soybean phenology via hybrid ensemble of knowledge-based and data-driven models
    McCormick, Ryan F.
    Truong, Sandra K.
    Rotundo, Jose
    Gaspar, Adam P.
    Kyle, Don
    van Eeuwijk, Fred
    Messina, Carlos D.
    IN SILICO PLANTS, 2021, 3 (01):
  • [22] Towards an Architecture for Big Data-Driven Knowledge Management Systems
    Thang Le Dinh
    Thuong-Cang Phan
    Bui, Trung
    AMCIS 2016 PROCEEDINGS, 2016,
  • [23] Enterprise systems, emerging technologies, and the data-driven knowledge organisation
    Yu Chung Wang, William
    Pauleen, David
    Taskin, Nazim
    KNOWLEDGE MANAGEMENT RESEARCH & PRACTICE, 2022, 20 (01) : 1 - 13
  • [24] DATA-DRIVEN METHODS FOR DETECTING TRAFFIC JAMS IN VEHICULAR TRAFFIC SYSTEMS
    Ghadami, Amin
    Doering, Charles R.
    Epureanu, Bogdan I.
    PROCEEDINGS OF THE ASME 2020 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2020, VOL 7B, 2020,
  • [25] Hybrid Computational Steering for Dynamic Data-Driven Application Systems
    Han, Junyi
    Brooke, John
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 407 - 417
  • [26] Distributed Fault Diagnosis for Heterogeneous Multiagent Systems: A Hybrid Knowledge-Based and Data-Driven Method
    Li, Runze
    Jiang, Bin
    Zong, Yan
    Lu, Ningyun
    Guo, Li
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (09) : 4940 - 4949
  • [27] Data-Driven Behavior-Based Negotiation Model for Cyber-Physical-Social Systems
    Li, Lin
    Lai, K. Robert
    Zhu, Shunzhi
    IEEE ACCESS, 2019, 7 : 83319 - 83331
  • [28] Data-Driven Model-Based Detection of Malicious Insiders via Physical Access Logs
    Cheh, Carmen
    Chen, Binbin
    Temple, William G.
    Sanders, William H.
    QUANTITATIVE EVALUATION OF SYSTEMS (QEST 2017), 2017, 10503 : 275 - 291
  • [29] Data-driven Model-based Detection of Malicious Insiders via Physical Access Logs
    Cheh, Carmen
    Thakore, Uttam
    Fawaz, Ahmed
    Chen, Binbin
    Temple, William G.
    Sanders, William H.
    ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2019, 29 (04):
  • [30] Learning to identify Protected Health Information by integrating knowledge- and data-driven algorithms: A case study on psychiatric evaluation notes
    Dehghan, Azad
    Kovacevic, Aleksandar
    Karystianis, George
    Keane, John A.
    Nenadic, Goran
    JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 75 : S28 - S33