Ex-ThaiHate: A Generative Multi-task Framework for Sentiment and Emotion Aware Hate Speech Detection with Explanation in Thai

被引：0

作者：

Maity, Krishanu ^{[1
]}

Bhattacharya, Shaubhik ^{[1
]}

Phosit, Salisa ^{[2
]}

Kongsamlit, Sawarod ^{[2
]}

Saha, Sriparna ^{[1
]}

Pasupa, Kitsuchart ^{[2
]}

机构：

[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Patna 801103, Bihar, India

[2] King Mongkuts Inst Technol Ladkrabang, Sch Informat Technol, Bangkok 10520, Thailand

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI | 2023年 / 14174卷

关键词：

Hate Speech; Sentiment; Emotion; Explainability; Thai; Multi-task;

D O I：

10.1007/978-3-031-43427-3_9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Social media platforms have both positive and negative impacts on users in diverse societies. One of the adverse effects of social media platforms is the usage of hate and offensive language, which not only fosters prejudice but also harms the vulnerable. Additionally, a person's sentiment and emotional state heavily influence the intended content of any social media post. Despite extensive research being conducted to detect online hate speech in English, there is a lack of similar studies on low-resource languages such as Thai. The recent enactment of laws like the "right to explanations" in the General Data Protection Regulation has stimulated the development of interpretable models rather than solely focusing on performance. Motivated by this, we created the first benchmark hate speech corpus, called Ex-ThaiHate, in the Thai language. Each post is annotated with four labels, namely hate, sentiment, emotion, and rationales (explainability), which specify the phrases that are responsible for annotating the post as hate. In order to investigate the effect of sentiment and emotional information on detecting hate speech posts, we propose a unified generative framework called GenX, which redefines this multi-task problem as a text-to-text generation task to simultaneously solve four tasks: hate-speech identification, rationale detection, sentiment, and emotion detection. Our extensive experiments demonstrate that GenX significantly outperforms all baselines and state-of-the-art models, thereby highlighting its effectiveness in detecting hate speech and identifying the rationales in low-resource languages. The code and dataset are available at https://github.com/dsmlr/Ex-ThaiHate. Disclaimer: The article contains offensive text and profanity. This is due to the nature of the work and does not reflect any opinion or stance of the authors.

引用

页码：139 / 156

页数：18

共 27 条

[21] Knowledge-Interactive Network with Sentiment Polarity Intensity-Aware Multi-Task Learning for Emotion Recognition in Conversations
Xie, Yunhe
Yang, Kailai
Sun, Chengjie
Liu, Bingquan
Ji, Zhenzhou
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 2879 - 2889
[22] A transformer-based multi-task framework for joint detection of aggression and hate on social media data
Ghosh, Soumitra
Priyankar, Amit
Ekbal, Asif
Bhattacharyya, Pushpak
NATURAL LANGUAGE ENGINEERING, 2023, 29 (06) : 1495 - 1515
[23] A Facial Expression-Aware Multimodal Multi-task Learning Framework for Emotion Recognition in Multi-party Conversations
Zheng, Wenjie
Yu, Jianfei
Xia, Rui
Wang, Shijin
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 15445 - 15459
[24] Spanish MTLHateCorpus 2023: Multi-task learning for hate speech detection to identify speech type, target, target group and intensity
Pan, Ronghao
Garcia-Diaz, Jose Antonio
Valencia-Garcia, Rafael
COMPUTER STANDARDS & INTERFACES, 2025, 94
[25] A multi-task learning framework for politeness and emotion detection in dialogues for mental health counselling and legal aid
Priya, Priyanshu
Firdaus, Mauajama
Ekbal, Asif
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 224
[26] A multi-task learning and auditory attention detection framework towards EEG-assisted target speech extraction
Wang, Xuefei
Ding, Yuting
Wang, Lei
Chen, Fei
APPLIED ACOUSTICS, 2025, 231
[27] A Multitask Framework for Sentiment, Emotion and Sarcasm aware Cyberbullying Detection from Multi-modal Code-Mixed Memes
Maity, Krishanu
Jha, Prince
Saha, Sriparna
Bhattacharyya, Pushpak
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1739 - 1749

← 1 2 3 →