Efficient detection of hacker community based on twitter data using complex networks and machine learning algorithm

被引:7
|
作者
Al-Tarawneh, Ahmed [1 ]
Al-Saraireh, Ja'afer [1 ]
机构
[1] Princess Sumaya Univ Technol, Comp Sci Dept, Amman, Jordan
关键词
Tweets; hacking; prediction; twitter; social networks;
D O I
10.3233/JIFS-210458
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Twitter is one of the most popular platforms used to share and post ideas. Hackers and anonymous attackers use these platforms maliciously, and their behavior can be used to predict the risk of future attacks, by gathering and classifying hackers' tweets using machine-learning techniques. Previous approaches for detecting infected tweets are based on human efforts or text analysis, thus they are limited to capturing the hidden text between tweet lines. The main aim of this research paper is to enhance the efficiency of hacker detection for the Twitter platform using the complex networks technique with adapted machine learning algorithms. This work presents a methodology that collects a list of users with their followers who are sharing their posts that have similar interests from a hackers' community on Twitter. The list is built based on a set of suggested keywords that are the commonly used terms by hackers in their tweets. After that, a complex network is generated for all users to find relations among them in terms of network centrality, closeness, and betweenness. After extracting these values, a dataset of the most influential users in the hacker community is assembled. Subsequently, tweets belonging to users in the extracted dataset are gathered and classified into positive and negative classes. The output of this process is utilized with a machine learning process by applying different algorithms. This research build and investigate an accurate dataset containing real users who belong to a hackers' community. Correctly, classified instances were measured for accuracy using the average values of K-nearest neighbor, Naive Bayes, Random Tree, and the support vector machine techniques, demonstrating about 90% and 88% accuracy for cross-validation and percentage split respectively. Consequently, the proposed network cyber Twitter model is able to detect hackers, and determine if tweets pose a risk to future institutions and individuals to provide early warning of possible attacks.
引用
收藏
页码:12321 / 12337
页数:17
相关论文
共 50 条
  • [1] An Extreme Learning Machine-Based Community Detection Algorithm in Complex Networks
    Wang, Feifan
    Zhang, Baihai
    Chai, Senchun
    Xia, Yuanqing
    [J]. COMPLEXITY, 2018,
  • [2] Detection Traffic Congestion Based on Twitter Data using Machine Learning
    Zulfikar, Muhammad Taufiq
    Suharjito
    [J]. 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2019) : ENABLING COLLABORATION TO ESCALATE IMPACT OF RESEARCH RESULTS FOR SOCIETY, 2019, 157 : 118 - 124
  • [3] An efficient algorithm for overlapping community detection in complex networks
    Chen, Duanbing
    Fu, Yan
    Shang, Mingsheng
    [J]. PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL I, 2009, : 244 - 247
  • [4] An efficient algorithm for community detection in complex weighted networks
    Masooleh, Leila Samandari
    Arbogast, Jeffrey E.
    Seider, Warren D.
    Oktem, Ulku
    Soroush, Masoud
    [J]. AICHE JOURNAL, 2021, 67 (07)
  • [5] An Efficient Hierarchy Algorithm for Community Detection in Complex Networks
    Zhang, Lili
    Ye, Qing
    Shao, Yehong
    Li, Chenming
    Gao, Hongmin
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [6] Community detection in complex networks using density-based clustering algorithm and manifold learning
    You, Tao
    Cheng, Hui-Min
    Ning, Yi-Zi
    Shia, Ben-Chang
    Zhang, Zhong-Yuan
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2016, 464 : 221 - 230
  • [7] Sentimental analysis over twitter data using clustering based machine learning algorithm
    Jacob, Sharon Susan
    Vijayakumar, R.
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021,
  • [8] Machine Learning for the Detection of Spam in Twitter Networks
    Wang, Alex Hai
    [J]. E-BUSINESS AND TELECOMMUNICATIONS, 2012, 222 : 319 - 333
  • [9] Distributed learning automata-based algorithm for community detection in complex networks
    Khomami, Mohammad Mehdi Daliri
    Rozvanian, Alireza
    Meybodi, Mohammed Reza
    [J]. INTERNATIONAL JOURNAL OF MODERN PHYSICS B, 2016, 30 (08):
  • [10] Efficient community detection algorithm based on higher-order structures in complex networks
    Huang, Jinyu
    Hou, Yani
    Li, Yuansong
    [J]. CHAOS, 2020, 30 (02)