Towards Automatic Detection and Explanation of Hate Speech and Offensive Language

被引:2
|
作者
Dorris, Wyatt [1 ]
Hu, Ruijia [1 ]
Vishwamitra, Nishant [1 ]
Luo, Feng [1 ]
Costello, Matthew [1 ]
机构
[1] Clemson Univ, Clemson, SC 29631 USA
关键词
D O I
10.1145/3375708.3380312
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of hate speech and offensive language online has become widely recognized as a critical social problem plaguing today's Internet users. Previous research in the detection of hate speech and offensive language has primarily focused on using machine learning approaches to naively detect hate speech and offensive language, without explaining the reasons for their detection. In this work, we introduce a novel hate speech and offensive language defense system called HateDefender, which consists of a detection model based on deep Long Short-term Memory (LSTM) neural networks and an explanation model based on the gating signals of LSTMs. HateDefender effectively detects hate speech and offensive language (average accuracy of 90.82% and 89.10% on hate speech and offensive language, respectively) and explains their factors by pinpointing the exact words that are responsible for causing them. Our system uses these explanations for the effective intervention of such incidents online.
引用
收藏
页码:23 / 29
页数:7
相关论文
共 50 条
  • [1] On the Impact ofWord Representation in Hate Speech and Offensive Language Detection and Explanation
    Hu, Ruijia
    Dorris, Wyatt
    Vishwamitra, Nishant
    Luo, Feng
    Costello, Matthew
    [J]. PROCEEDINGS OF THE TENTH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY, CODASPY 2020, 2020, : 171 - 173
  • [2] Offensive Language and Hate Speech Detection for Danish
    Sigurbergsson, Gudbjartur Ingi
    Derczynski, Leon
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3498 - 3508
  • [3] Hate-Speech and Offensive Language Detection in Roman Urdu
    Rizwan, Hammad
    Shakeel, Muhammad Haroon
    Karim, Asim
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2512 - 2522
  • [4] Offensive Language and Hate Speech Detection Based on Transfer Learning
    Touahri, Ibtissam
    Mazroui, Azzeddine
    [J]. ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2020), VOL 2, 2022, 1418 : 300 - 311
  • [5] Automatic Hate and Offensive speech detection framework from social media : the case of Afaan Oromoo language
    Kanessa, Lata Guta
    Tulu, Solomon Gizaw
    [J]. 2021 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY FOR DEVELOPMENT FOR AFRICA (ICT4DA), 2021, : 42 - 47
  • [6] Detection of Hate and Offensive Speech in Text
    Wani, Abid Hussain
    Molvi, Nahida Shafi
    Ashraf, Sheikh Ishrah
    [J]. INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2019), 2020, 11886 : 87 - 93
  • [8] Power of Explanations: Towards automatic debiasing in hate speech detection
    Cai, Yi
    Zimek, Arthur
    Wunder, Gerhard
    Ntoutsi, Eirini
    [J]. 2022 IEEE 9TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2022, : 465 - 474
  • [9] TM-HOL: Topic memory model for detection of hate speech and offensive language
    Chen, Jing
    Ma, Kun
    Ji, Ke
    Chen, Zhenxiang
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (14):
  • [10] Hate speech and offensive language detection in Dravidian languages using deep ensemble framework
    Roy, Pradeep Kumar
    Bhawal, Snehaan
    Subalalitha, Chinnaudayar Navaneethakrishnan
    [J]. COMPUTER SPEECH AND LANGUAGE, 2022, 75