A Deep Learning Framework for the Detection of Malay Hate Speech

被引:4
|
作者
Maity, Krishanu [1 ]
Bhattacharya, Shaubhik [1 ]
Saha, Sriparna [1 ]
Seera, Manjeevan [2 ]
机构
[1] Indian Inst Technol Patna, CSE Dept, Bihta 801106, India
[2] Monash Univ Malaysia, Sch Business, Dept Econometr & Business Stat, Subang Jaya 47500, Selangor Darul, Malaysia
关键词
Hate speech; Malay; transformer; capsule network; FastText;
D O I
10.1109/ACCESS.2023.3298808
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although social media can efficiently disseminate information, they also facilitate the dissemination of online abuse, harassment, and hate speech. In 2019, United Nations Secretary-General introduced the United Nations Strategy and Plan of Action on Hate Speech in response to the alarming global trend of rising hate speech. It is crucial to prevent hate speech because it can have severe negative effects on both individuals and society. While much research has been conducted on detecting online hate speech in English, little research has been conducted in other languages, such as Malay. In this paper, we present the first benchmark dataset HateM for detecting hate speech in Malay, comprised of over 4,892 annotated tweets. We created a two-channel deep learning model, XLCaps, to effectively manage noisy Malay language posts. One channel's input is the XLNet language model followed by the capsule network, while the other channel's input is the FastText embedding with Bi-GRU. Our proposed model surpasses the baseline models in terms of overall accuracy and F1 measurement, which are 80.69% and 80.41%, respectively. This work contributes to the prevention of hate speech in Malay and can serve as a basis for future study in this area. The approach to effectively managing noisy Malay posts can be also applied to other languages. The code and dataset are available at https://github.com/MaityKrishanu/Hate_Malay.
引用
收藏
页码:79542 / 79552
页数:11
相关论文
共 50 条
  • [1] A Deep Learning Framework for Automatic Detection of Hate Speech Embedded in Arabic Tweets
    Rehab Duwairi
    Amena Hayajneh
    Muhannad Quwaider
    [J]. Arabian Journal for Science and Engineering, 2021, 46 : 4001 - 4014
  • [2] A Deep Learning Framework for Automatic Detection of Hate Speech Embedded in Arabic Tweets
    Duwairi, Rehab
    Hayajneh, Amena
    Quwaider, Muhannad
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2021, 46 (04) : 4001 - 4014
  • [3] Deep Learning Ensembles for Hate Speech Detection
    Alsafari, Safa
    Sadaoui, Samira
    Mouhoub, Malek
    [J]. 2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 526 - 531
  • [4] Deep Learning for Hate Speech Detection in Tweets
    Badjatiya, Pinkesh
    Gupta, Shashank
    Gupta, Manish
    Varma, Vasudeva
    [J]. WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 759 - 760
  • [5] Improving Hate Speech Detection with Deep Learning Ensembles
    Zimmerman, Steven
    Fox, Chris
    Kruschwitz, Udo
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 2546 - 2553
  • [6] Indonesia Hate Speech Detection using Deep Learning
    Sutejo, Taufic Leonardo
    Lestari, Dessi Puji
    [J]. 2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 39 - 43
  • [7] mBERT-GRU multilingual deep learning framework for hate speech detection in social media
    Singh, Pardeep
    Singh, Nitin Kumar
    Monika
    Chand, Satish
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (05) : 8177 - 8192
  • [8] Improving Sinhala Hate Speech Detection Using Deep Learning
    Gamage, Kavishka
    Welgama, Viraj
    Weerasinghe, Ruvan
    [J]. 2022 22ND INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER), 2022,
  • [9] Detection of hate speech in Arabic tweets using deep learning
    Al-Hassan, Areej
    Al-Dossari, Hmood
    [J]. MULTIMEDIA SYSTEMS, 2022, 28 (06) : 1963 - 1974
  • [10] Detection of hate speech in Arabic tweets using deep learning
    Areej Al-Hassan
    Hmood Al-Dossari
    [J]. Multimedia Systems, 2022, 28 : 1963 - 1974