Comparison of Machine Learning Algorithms for Spam Detection

被引:2
|
作者
Sadia, Azeema [1 ]
Bashir, Fatima [1 ]
Khan, Reema Qaiser [1 ]
Bashir, Amna [2 ]
Khalid, Ammarah [3 ]
机构
[1] Bahria Univ, Dept Comp Sci, Karachi, Pakistan
[2] Sir Syed Univ Engn Technol, Dept Software Engn, Karachi, Pakistan
[3] Bahria Univ, Dept Software Engn, Karachi, Pakistan
关键词
spam detection; twitter; Naive Bayes; machine learning; data analysis; artificial analysis;
D O I
10.12720/jait.14.2.178-184
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Internet is used as a tool to offer people with endless knowledge. It is a global platform which is used for connectivity, communication, and sharing. At almost no cost, an individual can use the Internet to send email messages, update tweets, and Facebook messages to a vast number of people. These messages can also contain unsolicited advertisement which is identified as a spam. The company Twitter too is massively affected by spamming and it is an alarming issue for them. Twitter considers spam as actions that are unsolicited and repeated. These include tweet repetition, and the URLs that lead users to completely unrelated websites. The authors' have worked with twitter's dataset focusing on tweets about "iPhone". It was collected by using an API which was further pre-processed. In this paper, content-based features have been selected that recognize the spamming tweet by using R. Multiple machine learning algorithms were applied to detect spamming tweets: Naive Bayes, Logistic Regression, KNN, Decision Tree, and Support Vector Machine. It was observed that the best performance was achieved by Naive Bayes Algorithm giving an accuracy of 89%.
引用
收藏
页码:178 / 184
页数:7
相关论文
共 50 条
  • [41] Comparison of the performance of machine learning algorithms in breast cancer screening and detection: A protocol
    Salod, Zakia
    Singh, Yashik
    [J]. JOURNAL OF PUBLIC HEALTH RESEARCH, 2019, 8 (03) : 112 - 118
  • [42] An Empirical Comparison of Fake News Detection using different Machine Learning Algorithms
    Albahr, Abdulaziz
    Albahar, Marwan
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (09) : 146 - 152
  • [43] Comparison of Algorithms for the Detection of Plasmodium Falciparum: A Review of Machine Learning Based Approaches
    Ouedraogo, Josue
    Guinko, Ferdinand T.
    [J]. 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2023, 583 : 270 - 279
  • [44] Machine learning algorithm-based spam detection in social networks
    Sumathi, M.
    Raja, S. P.
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2023, 13 (01)
  • [45] Detection of Social Network Spam Based on Improved Extreme Learning Machine
    Zhang, Zhijie
    Hou, Rui
    Yang, Jin
    [J]. IEEE ACCESS, 2020, 8 : 112003 - 112014
  • [46] An Efficient Spam Detection Technique for IoT Devices Using Machine Learning
    Makkar, Aaisha
    Garg, Sahil
    Kumar, Neeraj
    Hossain, M. Shamim
    Ghoneim, Ahmed
    Alrashoud, Mubarak
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (02) : 903 - 912
  • [47] The Role of Feature Selection in Machine Learning for Detection of Spam and Phishing Attacks
    Salihovic, Ina
    Serdarevic, Haris
    Kevric, Jasmin
    [J]. ADVANCED TECHNOLOGIES, SYSTEMS, AND APPLICATIONS III, VOL 2, 2019, 60 : 476 - 483
  • [48] Spam detection for Youtube video comments using machine learning approaches
    Xiao, Andrew S.
    Liang, Qilian
    [J]. MACHINE LEARNING WITH APPLICATIONS, 2024, 16
  • [49] Comparative Study of Feature Reduction and Machine Learning Methods for Spam Detection
    Agarwal, Basant
    Mittal, Namita
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2012), 2014, 236 : 761 - 769
  • [50] Machine learning algorithm-based spam detection in social networks
    M. Sumathi
    S. P. Raja
    [J]. Social Network Analysis and Mining, 13