Multi-label Hate Speech and Abusive Language Detection in Indonesian Twitter

被引:0
|
作者
Ibrohim, Muhammad Okky [1 ]
Budi, Indra [1 ]
机构
[1] Univ Indonesia, Fac Comp Sci, Kampus UI, Depok 16424, Indonesia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hate speech and abusive language spreading on social media need to be detected automatically to avoid conflicts between citizens. Moreover, hate speech has a target, category, and level that also need to be detected to help the authority in prioritizing which hate speech must be addressed immediately. This research discusses multi-label text classification for abusive language and hate speech detection including detecting the target, category, and level of hate speech in Indonesian Twitter using machine learning approaches with Support Vector Machine (SVM), Naive Bayes (NB), and Random Forest Decision Tree (RFDT) classifier and Binary Relevance (BR), Label Power-set (LP), and Classifier Chains (CC) as the data transformation method. We used several kinds of feature extractions which are term frequency, orthography, and lexicon features. Our experiment results show that in general the RFDT classifier using LP as the transformation method gives the best accuracy with fast computational time.
引用
收藏
页码:46 / 57
页数:12
相关论文
共 50 条
  • [31] Outliers Detection in Multi-label Datasets
    Bello, Marilyn
    Napoles, Gonzalo
    Morera, Rafael
    Vanhoof, Koen
    Bello, Rafael
    ADVANCES IN SOFT COMPUTING, MICAI 2020, PT I, 2020, 12468 : 65 - 75
  • [32] Source Detection With Multi-Label Classification
    Vijayamohanan, Jayakrishnan
    Gupta, Arjun
    Noakoasteen, Oameed
    Goudos, Sotirios K. K.
    Christodoulou, Christos G.
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2023, 4 : 336 - 345
  • [33] Detection and Multi-label Classification of Bats
    Dierckx, Lucile
    Beauvois, Melanie
    Nijssen, Siegfried
    ADVANCES IN INTELLIGENT DATA ANALYSIS XX, IDA 2022, 2022, 13205 : 53 - 65
  • [34] Language comprehension as a multi-label classification problem
    Sering, Konstantin
    Milin, Petar
    Baayen, R. Harald
    STATISTICA NEERLANDICA, 2018, 72 (03) : 339 - 353
  • [35] Enhancing Multi-label Vulnerability Detection of Smart Contract using Language Model
    Duong Vu
    Tuan Nguyen
    Van Tong
    Souihi, Sami
    2023 5TH CONFERENCE ON BLOCKCHAIN RESEARCH & APPLICATIONS FOR INNOVATIVE NETWORKS AND SERVICES, BRAINS, 2023,
  • [36] Implementation Of Naive Bayes Classifier Algorithm On Social Media (Twitter) To The Teaching Of Indonesian Hate Speech
    Fatahillah, Naufal Riza
    Suryati, Pulut
    Haryawan, Cosmas
    2017 INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY (SIET), 2017, : 128 - 131
  • [37] Hate speech detection with ADHAR: a multi-dialectal hate speech corpus in Arabic
    Charfi, Anis
    Besghaier, Mabrouka
    Akasheh, Raghda
    Atalla, Andria
    Zaghouani, Wajdi
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [39] HateVersarial: Adversarial Attack Against Hate Speech Detection Algorithms on Twitter
    Grolman, Edita
    Binyamini, Hodaya
    Shabtai, Asaf
    Elovici, Yuval
    Morikawa, Ikuya
    Shimizu, Toshiya
    PROCEEDINGS OF THE 30TH ACM CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, UMAP 2022, 2022, : 143 - 152
  • [40] Hate speech detection on multilingual twitter using convolutional neural networks
    Elouali A.
    Elberrichi Z.
    Elouali N.
    Elouali, Aya (n.elouali@esi-sba.dz), 1600, International Information and Engineering Technology Association (34): : 81 - 88