A comparison of classification algorithms for hate speech detection

被引:14
|
作者
Putri, T. T. A. [1 ]
Sriadhi, S. [1 ]
Sari, R. D. [1 ]
Rahmadani, R. [1 ]
Hutahaean, H. D. [1 ]
机构
[1] Univ Negeri Medan, PTIK FT, Medan, Indonesia
关键词
D O I
10.1088/1757-899X/830/3/032006
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Freedom of opinion through social media is frequently affect a negative impact that spreads hatred. This study aims to automatically detect Indonesian tweets that contain hate speech on Twitter social media. The data used amounted to 4,002 tweets related to politics, religion, ethnicity and race in Indonesia. The application model uses classification methods with machine learning algorithms such as Naive Bayes, Multi Level Perceptron, AdaBoost Classifier, Decision Tree and Support Vector Machine. The study also compared the performance of the model using SMOTE to overcome imbalanced data. The results show that the Multinomial Naive Bayes algorithm produces the best model with the highest recall value of 93.2% which has an accuracy value of 71.2% for the classification of hate speech. Therefore, the Multinomial Naive Bayes algorithm without SMOTE is recommended as the model to detect hate speech on social media.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Afaan Oromo Hate Speech Detection and Classification on Social Media
    Ababu, Teshome Mulugeta
    Woldeyohannis, Michael Melese
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6612 - 6619
  • [2] Hate Speech Classification in Bulgarian
    Ralev, Radoslav
    Pfeffer, Juergen
    [J]. PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE COMPUTATIONAL LINGUISTICS IN BULGARIA, CLIB 2022, 2022, : 49 - 58
  • [3] HateVersarial: Adversarial Attack Against Hate Speech Detection Algorithms on Twitter
    Grolman, Edita
    Binyamini, Hodaya
    Shabtai, Asaf
    Elovici, Yuval
    Morikawa, Ikuya
    Shimizu, Toshiya
    [J]. PROCEEDINGS OF THE 30TH ACM CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, UMAP 2022, 2022, : 143 - 152
  • [4] Automatic hate speech detection in audio using machine learning algorithms
    Joan L. Imbwaga
    Nagatatna B. Chittaragi
    Shashidhar G. Koolagudi
    [J]. International Journal of Speech Technology, 2024, 27 (2) : 447 - 469
  • [5] A comparative analysis of machine learning algorithms for hate speech detection in social media
    Omran, Esraa
    Al Tararwah, Estabraq
    Al Qundus, Jamal
    [J]. ONLINE JOURNAL OF COMMUNICATION AND MEDIA TECHNOLOGIES, 2023, 13 (04):
  • [6] Advances in Machine Learning Algorithms for Hate Speech Detection in Social Media: A Review
    Mullah, Nanlir Sallau
    Zainon, Wan Mohd Nazmee Wan
    [J]. IEEE ACCESS, 2021, 9 : 88364 - 88376
  • [7] Hate Speech Detection in Clubhouse
    Mansourifar, Hadi
    Alsagheer, Dana
    Fathi, Reza
    Shi, Weidong
    Ni, Lan
    Huang, Yan
    [J]. MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2021, 1525 : 341 - 351
  • [9] Classification of Hate Speech Language Detection on Social Media: Preliminary Study for Improvement
    Muzakir, Ari
    Adi, Kusworo
    Kusumaningrum, Retno
    [J]. EMERGING TRENDS IN INTELLIGENT SYSTEMS & NETWORK SECURITY, 2023, 147 : 146 - 156
  • [10] A comparison of text preprocessing techniques for hate and offensive speech detection in Twitter
    Anna Glazkova
    [J]. Social Network Analysis and Mining, 13