Deep-BERT: Transfer Learning for Classifying Multilingual Offensive Texts on Social Media

被引:18
|
作者
Wadud, Md Anwar Hussen [1 ]
Mridha, M. F. [1 ]
Shin, Jungpil [2 ]
Nur, Kamruddin [3 ]
Saha, Aloke Kumar [4 ]
机构
[1] Bangladesh Univ Business & Technol, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] Univ Aizu, Sch Comp Sci & Engn, Aizu Wakamatsu, Fukushima, Japan
[3] Amer Int Univ Bangladesh, Dept Comp Sci, Dhaka, Bangladesh
[4] Univ Asia Pacific, Dept Comp Sci & Engn, Dhaka, Bangladesh
来源
关键词
Offensive text classification; deep convolutional neural network (DCNN); bidirectional encoder representations from transformers (BERT); natural language processing (NLP);
D O I
10.32604/csse.2023.027841
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Offensive messages on social media, have recently been frequently used to harass and criticize people. In recent studies, many promising algorithms have been developed to identify offensive texts. Most algorithms analyze text in a unidirectional manner, where a bidirectional method can maximize performance results and capture semantic and contextual information in sentences. In addition, there are many separate models for identifying offensive texts based on monolingual and multilingual, but there are a few models that can detect both monolingual and multilingual-based offensive texts. In this study, a detection system has been developed for both monolingual and multilingual offensive texts by combining deep convolutional neural network and bidirectional encoder representations from transformers (Deep-BERT) to identify offensive posts on social media that are used to harass others. This paper explores a variety of ways to deal with multilingualism, including collaborative multilingual and translation-based approaches. Then, the Deep-BERT is tested on the Bengali and English datasets, including the different bidirectional encoder representations from transformers (BERT) pre-trained word-embedding techniques, and found that the proposed DeepBERT's efficacy outperformed all existing offensive text classification algorithms reaching an accuracy of 91.83%. The proposed model is a state-of-the-art model that can classify both monolingual-based and multilingual-based offensive texts.
引用
收藏
页码:1775 / 1791
页数:17
相关论文
共 50 条
  • [1] Filtering offensive language from multilingual social media contents: A deep learning approach
    Saumya, Sunil
    Kumar, Abhinav
    Singh, Jyoti Prakash
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [2] Mental Illness Classification on Social Media Texts Using Deep Learning and Transfer Learning
    Arif, Muhammad
    Ameer, Iqra
    Bolucu, Necva
    Sidorov, Grigori
    Gelbukh, Alexander
    Elangovan, Vinnayak
    [J]. COMPUTACION Y SISTEMAS, 2024, 28 (02): : 451 - 464
  • [3] EFFECTIVE OFFENSIVE LANGUAGE DEDUCTION USING DEEP LEARNING IN SOCIAL MEDIA
    Adaikkan, Kalaivani
    Thenmozhi, Duraio
    [J]. REVUE ROUMAINE DES SCIENCES TECHNIQUES-SERIE ELECTROTECHNIQUE ET ENERGETIQUE, 2024, 69 (02): : 201 - 206
  • [4] Advancing offensive language detection in Arabic social media: a BERT-based ensemble learning approach
    Mazari, Ahmed Cherif
    Benterkia, Asmaa
    Takdenti, Zineb
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
  • [5] Deep learning and multilingual sentiment analysis on social media data: An overview
    Aguero-Torales, Marvin M.
    Salas, Jose I. Abreu
    Lopez-Herrera, Antonio G.
    [J]. APPLIED SOFT COMPUTING, 2021, 107
  • [6] A transfer learning approach for detecting offensive and hate speech on social media platforms
    Ishaani Priyadarshini
    Sandipan Sahu
    Raghvendra Kumar
    [J]. Multimedia Tools and Applications, 2023, 82 : 27473 - 27499
  • [7] Detecting Hateful and Offensive Speech in Arabic Social Media Using Transfer Learning
    Boulouard, Zakaria
    Ouaissa, Mariya
    Ouaissa, Mariyam
    Krichen, Moez
    Almutiq, Mutiq
    Gasmi, Karim
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (24):
  • [8] A transfer learning approach for detecting offensive and hate speech on social media platforms
    Priyadarshini, Ishaani
    Sahu, Sandipan
    Kumar, Raghvendra
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (18) : 27473 - 27499
  • [9] A BERT-based Deep Learning Approach for Reputation Analysis in Social Media
    Rahman, Mohammad Wali Ur
    Shao, Sicong
    Satam, Pratik
    Hariri, Salim
    Padilla, Chris
    Taylor, Zoe
    Nevarez, Carlos
    [J]. 2022 IEEE/ACS 19TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2022,
  • [10] SSN@LT-EDI-ACL2022: Transfer Learning using BERT for Detecting Signs of Depression from Social Media Texts
    Adarsh, S.
    Antony, Betina
    [J]. PROCEEDINGS OF THE SECOND WORKSHOP ON LANGUAGE TECHNOLOGY FOR EQUALITY, DIVERSITY AND INCLUSION (LTEDI 2022), 2022, : 326 - 330