Comparison of Machine Learning Algorithms for Spam Detection

被引:2
|
作者
Sadia, Azeema [1 ]
Bashir, Fatima [1 ]
Khan, Reema Qaiser [1 ]
Bashir, Amna [2 ]
Khalid, Ammarah [3 ]
机构
[1] Bahria Univ, Dept Comp Sci, Karachi, Pakistan
[2] Sir Syed Univ Engn Technol, Dept Software Engn, Karachi, Pakistan
[3] Bahria Univ, Dept Software Engn, Karachi, Pakistan
关键词
spam detection; twitter; Naive Bayes; machine learning; data analysis; artificial analysis;
D O I
10.12720/jait.14.2.178-184
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Internet is used as a tool to offer people with endless knowledge. It is a global platform which is used for connectivity, communication, and sharing. At almost no cost, an individual can use the Internet to send email messages, update tweets, and Facebook messages to a vast number of people. These messages can also contain unsolicited advertisement which is identified as a spam. The company Twitter too is massively affected by spamming and it is an alarming issue for them. Twitter considers spam as actions that are unsolicited and repeated. These include tweet repetition, and the URLs that lead users to completely unrelated websites. The authors' have worked with twitter's dataset focusing on tweets about "iPhone". It was collected by using an API which was further pre-processed. In this paper, content-based features have been selected that recognize the spamming tweet by using R. Multiple machine learning algorithms were applied to detect spamming tweets: Naive Bayes, Logistic Regression, KNN, Decision Tree, and Support Vector Machine. It was observed that the best performance was achieved by Naive Bayes Algorithm giving an accuracy of 89%.
引用
收藏
页码:178 / 184
页数:7
相关论文
共 50 条
  • [1] SMS spam detection and comparison of various machine learning algorithms
    Sethi, Paras
    Bhandari, Vaibhav
    Kohli, Bhavna
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES FOR SMART NATION (IC3TSN), 2017, : 28 - 31
  • [2] Machine and Deep Learning Algorithms for Twitter Spam Detection
    Alsaffar, Dalia
    Alfahhad, Amjad
    Alqhtani, Bashaier
    Alamri, Lama
    Alansari, Shahad
    Alqahtani, Nada
    Alboaneen, Dabiah A.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2019, 2020, 1058 : 483 - 491
  • [3] Comparison of machine learning techniques for spam detection
    Ghosh, Argha
    Senthilrajan, A.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) : 29227 - 29254
  • [4] Comparison of machine learning techniques for spam detection
    Argha Ghosh
    A. Senthilrajan
    [J]. Multimedia Tools and Applications, 2023, 82 : 29227 - 29254
  • [5] Comparative Study of Machine Learning Algorithms for SMS Spam Detection
    Alzahrani, Amani
    Rawat, Danda B.
    [J]. 2019 IEEE SOUTHEASTCON, 2019,
  • [6] Efficient Detection of Spam Over Internet Telephony by Machine Learning Algorithms
    Behan, Ladislav
    Rozhon, Jan
    Safarik, Jakub
    Rezac, Filip
    Voznak, Miroslav
    [J]. IEEE ACCESS, 2022, 10 : 133412 - 133426
  • [7] Comparison of Multiple Machine Learning Approaches and Sentiment Analysis in Detection of Spam
    Alam, A. N. M. Sajedul
    Zaman, Shifat
    Dey, Arnob Kumar
    Bin Kibria, Junaid
    Alam, Zawad
    Mahbub, Mohammed Julfikar Ali
    Mahtab, Md. Motahar
    Rasel, Annajiat Alim
    [J]. ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT I, 2022, 1613 : 37 - 50
  • [8] Spam Detection Approach for Secure Mobile Message Communication Using Machine Learning Algorithms
    Luo GuangJun
    Nazir, Shah
    Khan, Habib Ullah
    Ul Haq, Amin
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2020, 2020
  • [9] Machine Learning for the Detection of Spam in Twitter Networks
    Wang, Alex Hai
    [J]. E-BUSINESS AND TELECOMMUNICATIONS, 2012, 222 : 319 - 333
  • [10] Comparison of Machine Learning Algorithms for Detection of Network Intrusions
    Li, Zhida
    Batta, Prerna
    Trajkovic, Ljiljana
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 4242 - 4247