Hate Speech Detection Model Using Bag of Words and Naive Bayes

被引:1
|
作者
Pandey, Yogesh [1 ]
Sharma, Monika [1 ]
Siddiqui, Mohammad Kashaf [1 ]
Yadav, Sudeept Singh [1 ]
机构
[1] Galgotias Univ, SCSE, Greater Noida, India
来源
ADVANCES IN DATA AND INFORMATION SCIENCES | 2022年 / 318卷
关键词
Hate-speech; Bag of Words; Naive-Bayes; Digital-platforms;
D O I
10.1007/978-981-16-5689-7_40
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this era of increasing hate and intolerance among the people, especially among those who interact with each other over the Web, there is a dire need of some technological innovation that would cater to this situation. The said hate and clash of opinions among the people often comes out in the form of hate speech in texts and in pictures. To counter this situation, we have come up with a hate speech detection model which would be able to detect and identify hateful and provocative content in a textual data, which is published on various social media websites, viz. Twitter, Facebook, and Instagram. The sole idea behind the making of this model is to be able to prevent every individual from spreading as well as witnessing hate-speech on different digital-platforms. We have developed a text classifier using basic principles of natural language processing. This has been achieved by the use of the bag of words model for feature extraction purposes, followed by various text filtering processes, and ultimately feeding this data to a naive-Bayes classifier, and hence training the same to work autonomously to classify textual data depending upon the sentiments indicated by them, i.e. whether they imply negative aspects over a certain matter/topic or positive. As a result of this experiment, we were able to successfully classify all the data taken by us with a cumulative accuracy of 99.7% upon the test data set.
引用
收藏
页码:457 / 470
页数:14
相关论文
共 50 条
  • [1] Detection of Hate Speech using BERT and Hate Speech Word Embedding with Deep Model
    Saleh, Hind
    Alhothali, Areej
    Moria, Kawthar
    APPLIED ARTIFICIAL INTELLIGENCE, 2023, 37 (01)
  • [2] Comparing Bag of Words and TF-IDF with different models for hate speech detection from live tweets
    Akuma S.
    Lubem T.
    Adom I.T.
    International Journal of Information Technology, 2022, 14 (7) : 3629 - 3635
  • [3] ON NAIVE BAYES IN SPEECH RECOGNITION
    Toth, Laszlo
    Kocsor, Andras
    Csirik, Janos
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2005, 15 (02) : 287 - 294
  • [4] Beyond Words: A Preliminary Study for Multimodal Hate Speech Detection
    Barcelo, Sofia
    Boulanger, Magali
    Tommasel, Antonela
    Rodriguez, Juan Manuel
    2024 L LATIN AMERICAN COMPUTER CONFERENCE, CLEI 2024, 2024,
  • [5] Automatic speech emotion detection using hybrid of gray wolf optimizer and naive Bayes
    Ramesh, S.
    Gomathi, S.
    Sasikala, S.
    Saravanan, T. R.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 26 (3) : 571 - 578
  • [6] Implementation Of Naive Bayes Classifier Algorithm On Social Media (Twitter) To The Teaching Of Indonesian Hate Speech
    Fatahillah, Naufal Riza
    Suryati, Pulut
    Haryawan, Cosmas
    2017 INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY (SIET), 2017, : 128 - 131
  • [7] Intrusion Detection Model Using Naive Bayes and Deep Learning Technique
    Tabash, Mohammed
    Abd Allah, Mohamed
    Tawfik, Bella
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (02) : 215 - 224
  • [8] Estimating confidence measures for speech recognition verification using a smoothed naive Bayes model
    Sanchis, A
    Juan, A
    Vidal, E
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PROCEEDINGS, 2003, 2652 : 910 - 918
  • [9] NETWORK INTRUSION DETECTION USING NAIVE BAYES
    Panda, Mrutyunjaya
    Patra, Manas Ranjan
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2007, 7 (12): : 258 - 263
  • [10] Web advertisement detection using Naive Bayes
    Deng, Xin
    Hou, Lunqing
    Wang, Fei
    2018 INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS AND CONTROL ENGINEERING (ISPECE 2018), 2019, 1187