Advances in Machine Learning Algorithms for Hate Speech Detection in Social Media: A Review

被引：27

作者：

Mullah, Nanlir Sallau ^{[1
,2
]}

Zainon, Wan Mohd Nazmee Wan ^{[2
]}

机构：

[1] Univ Sains Malaysia, Sch Comp Sci, George Town, Malaysia

[2] Fed Coll Educ Pankshin, PMB1027, Pankshin, Plateau State, Nigeria

来源：

IEEE ACCESS | 2021年 / 9卷

关键词：

Text classification; cyber hate; deep learning; ensemble technique; machine learning; social media networks; CYBERBULLYING DETECTION; TWITTER; CLASSIFICATION; MODEL;

D O I：

10.1109/ACCESS.2021.3089515

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The aim of this paper is to review machine learning (ML) algorithms and techniques for hate speech detection in social media (SM). Hate speech problem is normally model as a text classification task. In this study, we examined the basic baseline components of hate speech classification using ML algorithms. There are five basic baseline components - data collection and exploration, feature extraction, dimensionality reduction, classifier selection and training, and model evaluation, were reviewed. There have been improvements in ML algorithms that were employed for hate speech detection over time. New datasets and different performance metrics have been proposed in the literature. To keep the researchers informed regarding these trends in the automatic detection of hate speech, it calls for a comprehensive and an updated state-of-the-art. The contributions of this study are three-fold. First to equip the readers with the necessary information on the critical steps involved in hate speech detection using ML algorithms. Secondly, the weaknesses and strengths of each method is critically evaluated to guide researchers in the algorithm choice dilemma. Lastly, some research gaps and open challenges were identified. The different variants of ML techniques were reviewed which include classical ML, ensemble approach and deep learning methods. Researchers and professionals alike will benefit immensely from this study.

引用

页码：88364 / 88376

页数：13

共 50 条

[1] A comparative analysis of machine learning algorithms for hate speech detection in social media
Omran, Esraa
Al Tararwah, Estabraq
Al Qundus, Jamal
[J]. ONLINE JOURNAL OF COMMUNICATION AND MEDIA TECHNOLOGIES, 2023, 13 (04):
[2] Sinhala Hate Speech Detection in Social Media Using Machine Learning and Deep Learning
Fernando, W. S. S.
Weerasinghe, Ruvan
Bandara, E. R. A. D.
[J]. 2022 22ND INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER), 2022,
[3] Transfer learning for hate speech detection in social media
Yuan, Lanqin
Wang, Tianyu
Ferraro, Gabriela
Suominen, Hanna
Rizoiu, Marian-Andrei
[J]. JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2023, 6 (02): : 1081 - 1101
[4] Transfer learning for hate speech detection in social media
Lanqin Yuan
Tianyu Wang
Gabriela Ferraro
Hanna Suominen
Marian-Andrei Rizoiu
[J]. Journal of Computational Social Science, 2023, 6 : 1081 - 1101
[5] Sinhala Hate Speech Detection in Social Media using Text Mining and Machine learning
Sandaruwan, H. M. S. T.
Lorensuhewa, S. A. S.
Kalyani, M. A. L.
[J]. 2019 19TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER - 2019), 2019,
[6] Automatic hate speech detection in audio using machine learning algorithms
Joan L. Imbwaga
Nagatatna B. Chittaragi
Shashidhar G. Koolagudi
[J]. International Journal of Speech Technology, 2024, 27 (2) : 447 - 469
[7] Hate Speech Detection in Social Networks using Machine Learning and Deep Learning Methods
Toktarova, Aigerim
Syrlybay, Dariga
Myrzakhmetova, Bayan
Anuarbekova, Gulzat
Rakhimbayeva, Gulbarshin
Zhylanbaeva, Balkiya
Suieuova, Nabat
Kerimbekov, Mukhtar
[J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (05) : 396 - 406
[8] Intelligent detection of hate speech in Arabic social network: A machine learning approach
Aljarah, Ibrahim
Habib, Maria
Hijazi, Neveen
Faris, Hossam
Qaddoura, Raneem
Hammo, Bassam
Abushariah, Mohammad
Alfawareh, Mohammad
[J]. JOURNAL OF INFORMATION SCIENCE, 2021, 47 (04) : 483 - 501
[9] Leveraging Transfer Learning for Hate Speech Detection in Portuguese Social Media Posts
Ramos, Gil
Batista, Fernando
Ribeiro, Ricardo
Fialho, Pedro
Moro, Sergio
Fonseca, Antonio
Guerra, Rita
Carvalho, Paula
Marques, Catarina
Silva, Claudia
[J]. IEEE ACCESS, 2024, 12 : 101374 - 101389
[10] Lifelong Learning of Hate Speech Classification on Social Media
Qian, Jing
Wang, Hong
ElSherief, Mai
Yan, Xifeng
[J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 2304 - 2314

← 1 2 3 4 5 →