IndicCONAN: A Multilingual Dataset for Combating Hate Speech in Indian Context

被引：0

作者：

Sahoo, Nihar Ranja ^{[1
]}

Beria, Gyana Prakash ^{[1
]}

Bhattacharyya, Pushpak ^{[1
]}

机构：

[1] Indian Inst Technol, CFILT, Bombay, India

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20 | 2024年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Hate speech (HS) is a growing concern in many parts of the world, including India, where it has led to numerous instances of violence and discrimination. The development of effective counter-narratives (CNs) is a critical step in combating hate speech, but there is a lack of research in this area, especially in non-English languages. In this paper, we introduce a new dataset, IndicCONAN, of counter-narratives against hate speech in Hindi and Indian English. We propose a scalable human-in-the-loop approach for generating counter-narratives by an auto-regressive language model through machine generation - human correction cycle, where the model uses augmented data from previous cycles to generate new training samples. These newly generated samples are then reviewed and edited by annotators, leading to further model refinement. The dataset consists of over (2) over tilde ,500 examples of counter-narratives each in both English and Hindi corresponding to various hate speeches in the Indian context. We also present a framework for generating CNs conditioned on specific CN type with a mean perplexity of 3.85 for English and 3.70 for Hindi, a mean toxicity score of 0.04 for English and 0.06 for Hindi, and a mean diversity of 0.08 for English and 0.14 for Hindi. Our dataset and framework provide valuable resources for researchers and practitioners working to combat hate speech in the Indian context.

引用

页码：22313 / 22321

页数：9

共 50 条

[41] Hate speech review in the context of online social networks
Chetty, Naganna
Alathur, Sreejith
AGGRESSION AND VIOLENT BEHAVIOR, 2018, 40 : 108 - 118
[42] The Content and Context of Hate Speech: Rethinking Regulation and Responses
Nyman-Metcalf, Katrin
INTERNATIONAL & COMPARATIVE LAW QUARTERLY, 2014, 63 (02) : 510 - 513
[43] HateDetector: Multilingual technique for the analysis and detection of online hate speech in social networks
Rahul Anjum
Multimedia Tools and Applications, 2024, 83 : 48021 - 48048
[44] Multilingual speech mode classification model for Indian languages
Tripathi, Kumud
Rao, K. Sreenivasa
2020 TWENTY SIXTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC 2020), 2020,
[45] From cancellation to dispositive: hate speech in the context of consumption
Hoff, Tania
Holtz, Ana Catarina
Fraga, Lucas L.
REVISTA COMUNICACAO MIDIATICA, 2022, 17 (02): : 44 - 56
[46] The Content and Context of Hate Speech: Rethinking Regulation and Responses
Neier, Aryeh
ICON-INTERNATIONAL JOURNAL OF CONSTITUTIONAL LAW, 2014, 12 (03): : 816 - 820
[47] Multilingual Hate Speech Detection: A Semi-Supervised Generative Adversarial Approach
Mnassri, Khouloud
Farahbakhsh, Reza
Crespi, Noel
ENTROPY, 2024, 26 (04)
[48] Multilingual Twitter Corpus and Baselines for Evaluating Demographic Bias in Hate Speech Recognition
Huang, Xiaolei
Xing, Linzi
Dernoncourt, Franck
Paul, Michael J.
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 1440 - 1448
[49] Online Multilingual Hate Speech Detection: Experimenting with Hindi and English Social Media
Vashistha, Neeraj
Zubiaga, Arkaitz
INFORMATION, 2021, 12 (01) : 1 - 16
[50] HateDetector: Multilingual technique for the analysis and detection of online hate speech in social networks
Anjum
Katarya, Rahul
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (16) : 48021 - 48048

← 1 2 3 4 5 →