Ex-ThaiHate: A Generative Multi-task Framework for Sentiment and Emotion Aware Hate Speech Detection with Explanation in Thai

被引:0
|
作者
Maity, Krishanu [1 ]
Bhattacharya, Shaubhik [1 ]
Phosit, Salisa [2 ]
Kongsamlit, Sawarod [2 ]
Saha, Sriparna [1 ]
Pasupa, Kitsuchart [2 ]
机构
[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Patna 801103, Bihar, India
[2] King Mongkuts Inst Technol Ladkrabang, Sch Informat Technol, Bangkok 10520, Thailand
关键词
Hate Speech; Sentiment; Emotion; Explainability; Thai; Multi-task;
D O I
10.1007/978-3-031-43427-3_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media platforms have both positive and negative impacts on users in diverse societies. One of the adverse effects of social media platforms is the usage of hate and offensive language, which not only fosters prejudice but also harms the vulnerable. Additionally, a person's sentiment and emotional state heavily influence the intended content of any social media post. Despite extensive research being conducted to detect online hate speech in English, there is a lack of similar studies on low-resource languages such as Thai. The recent enactment of laws like the "right to explanations" in the General Data Protection Regulation has stimulated the development of interpretable models rather than solely focusing on performance. Motivated by this, we created the first benchmark hate speech corpus, called Ex-ThaiHate, in the Thai language. Each post is annotated with four labels, namely hate, sentiment, emotion, and rationales (explainability), which specify the phrases that are responsible for annotating the post as hate. In order to investigate the effect of sentiment and emotional information on detecting hate speech posts, we propose a unified generative framework called GenX, which redefines this multi-task problem as a text-to-text generation task to simultaneously solve four tasks: hate-speech identification, rationale detection, sentiment, and emotion detection. Our extensive experiments demonstrate that GenX significantly outperforms all baselines and state-of-the-art models, thereby highlighting its effectiveness in detecting hate speech and identifying the rationales in low-resource languages. The code and dataset are available at https://github.com/dsmlr/Ex-ThaiHate. Disclaimer: The article contains offensive text and profanity. This is due to the nature of the work and does not reflect any opinion or stance of the authors.
引用
收藏
页码:139 / 156
页数:18
相关论文
共 27 条
  • [21] Knowledge-Interactive Network with Sentiment Polarity Intensity-Aware Multi-Task Learning for Emotion Recognition in Conversations
    Xie, Yunhe
    Yang, Kailai
    Sun, Chengjie
    Liu, Bingquan
    Ji, Zhenzhou
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 2879 - 2889
  • [22] A transformer-based multi-task framework for joint detection of aggression and hate on social media data
    Ghosh, Soumitra
    Priyankar, Amit
    Ekbal, Asif
    Bhattacharyya, Pushpak
    NATURAL LANGUAGE ENGINEERING, 2023, 29 (06) : 1495 - 1515
  • [23] A Facial Expression-Aware Multimodal Multi-task Learning Framework for Emotion Recognition in Multi-party Conversations
    Zheng, Wenjie
    Yu, Jianfei
    Xia, Rui
    Wang, Shijin
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 15445 - 15459
  • [24] Spanish MTLHateCorpus 2023: Multi-task learning for hate speech detection to identify speech type, target, target group and intensity
    Pan, Ronghao
    Garcia-Diaz, Jose Antonio
    Valencia-Garcia, Rafael
    COMPUTER STANDARDS & INTERFACES, 2025, 94
  • [25] A multi-task learning framework for politeness and emotion detection in dialogues for mental health counselling and legal aid
    Priya, Priyanshu
    Firdaus, Mauajama
    Ekbal, Asif
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 224
  • [26] A multi-task learning and auditory attention detection framework towards EEG-assisted target speech extraction
    Wang, Xuefei
    Ding, Yuting
    Wang, Lei
    Chen, Fei
    APPLIED ACOUSTICS, 2025, 231
  • [27] A Multitask Framework for Sentiment, Emotion and Sarcasm aware Cyberbullying Detection from Multi-modal Code-Mixed Memes
    Maity, Krishanu
    Jha, Prince
    Saha, Sriparna
    Bhattacharyya, Pushpak
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1739 - 1749