Advances in Machine Learning Algorithms for Hate Speech Detection in Social Media: A Review

被引:27
|
作者
Mullah, Nanlir Sallau [1 ,2 ]
Zainon, Wan Mohd Nazmee Wan [2 ]
机构
[1] Univ Sains Malaysia, Sch Comp Sci, George Town, Malaysia
[2] Fed Coll Educ Pankshin, PMB1027, Pankshin, Plateau State, Nigeria
关键词
Text classification; cyber hate; deep learning; ensemble technique; machine learning; social media networks; CYBERBULLYING DETECTION; TWITTER; CLASSIFICATION; MODEL;
D O I
10.1109/ACCESS.2021.3089515
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aim of this paper is to review machine learning (ML) algorithms and techniques for hate speech detection in social media (SM). Hate speech problem is normally model as a text classification task. In this study, we examined the basic baseline components of hate speech classification using ML algorithms. There are five basic baseline components - data collection and exploration, feature extraction, dimensionality reduction, classifier selection and training, and model evaluation, were reviewed. There have been improvements in ML algorithms that were employed for hate speech detection over time. New datasets and different performance metrics have been proposed in the literature. To keep the researchers informed regarding these trends in the automatic detection of hate speech, it calls for a comprehensive and an updated state-of-the-art. The contributions of this study are three-fold. First to equip the readers with the necessary information on the critical steps involved in hate speech detection using ML algorithms. Secondly, the weaknesses and strengths of each method is critically evaluated to guide researchers in the algorithm choice dilemma. Lastly, some research gaps and open challenges were identified. The different variants of ML techniques were reviewed which include classical ML, ensemble approach and deep learning methods. Researchers and professionals alike will benefit immensely from this study.
引用
收藏
页码:88364 / 88376
页数:13
相关论文
共 50 条
  • [1] A comparative analysis of machine learning algorithms for hate speech detection in social media
    Omran, Esraa
    Al Tararwah, Estabraq
    Al Qundus, Jamal
    [J]. ONLINE JOURNAL OF COMMUNICATION AND MEDIA TECHNOLOGIES, 2023, 13 (04):
  • [2] Sinhala Hate Speech Detection in Social Media Using Machine Learning and Deep Learning
    Fernando, W. S. S.
    Weerasinghe, Ruvan
    Bandara, E. R. A. D.
    [J]. 2022 22ND INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER), 2022,
  • [3] Transfer learning for hate speech detection in social media
    Yuan, Lanqin
    Wang, Tianyu
    Ferraro, Gabriela
    Suominen, Hanna
    Rizoiu, Marian-Andrei
    [J]. JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2023, 6 (02): : 1081 - 1101
  • [4] Transfer learning for hate speech detection in social media
    Lanqin Yuan
    Tianyu Wang
    Gabriela Ferraro
    Hanna Suominen
    Marian-Andrei Rizoiu
    [J]. Journal of Computational Social Science, 2023, 6 : 1081 - 1101
  • [5] Sinhala Hate Speech Detection in Social Media using Text Mining and Machine learning
    Sandaruwan, H. M. S. T.
    Lorensuhewa, S. A. S.
    Kalyani, M. A. L.
    [J]. 2019 19TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER - 2019), 2019,
  • [6] Automatic hate speech detection in audio using machine learning algorithms
    Joan L. Imbwaga
    Nagatatna B. Chittaragi
    Shashidhar G. Koolagudi
    [J]. International Journal of Speech Technology, 2024, 27 (2) : 447 - 469
  • [7] Hate Speech Detection in Social Networks using Machine Learning and Deep Learning Methods
    Toktarova, Aigerim
    Syrlybay, Dariga
    Myrzakhmetova, Bayan
    Anuarbekova, Gulzat
    Rakhimbayeva, Gulbarshin
    Zhylanbaeva, Balkiya
    Suieuova, Nabat
    Kerimbekov, Mukhtar
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (05) : 396 - 406
  • [8] Intelligent detection of hate speech in Arabic social network: A machine learning approach
    Aljarah, Ibrahim
    Habib, Maria
    Hijazi, Neveen
    Faris, Hossam
    Qaddoura, Raneem
    Hammo, Bassam
    Abushariah, Mohammad
    Alfawareh, Mohammad
    [J]. JOURNAL OF INFORMATION SCIENCE, 2021, 47 (04) : 483 - 501
  • [9] Leveraging Transfer Learning for Hate Speech Detection in Portuguese Social Media Posts
    Ramos, Gil
    Batista, Fernando
    Ribeiro, Ricardo
    Fialho, Pedro
    Moro, Sergio
    Fonseca, Antonio
    Guerra, Rita
    Carvalho, Paula
    Marques, Catarina
    Silva, Claudia
    [J]. IEEE ACCESS, 2024, 12 : 101374 - 101389
  • [10] Lifelong Learning of Hate Speech Classification on Social Media
    Qian, Jing
    Wang, Hong
    ElSherief, Mai
    Yan, Xifeng
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 2304 - 2314