Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate Speech

被引：0

作者：

Fanton, Margherita ^{[1
,2
]}

Bonaldi, Helena ^{[1
,2
]}

Tekiroglu, Serra Sinem ^{[2
]}

Guerini, Marco ^{[2
]}

机构：

[1] Univ Trento, Trento, TN, Italy

[2] Fdn Bruno Kessler, Via Sommar 18, Povo, Trento, Italy

来源：

59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1 | 2021年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Undermining the impact of hateful content with informed and non-aggressive responses, called counter narratives, has emerged as a possible solution for having healthier online communities. Thus, some NLP studies have started addressing the task of counter narrative generation. Although such studies have made an effort to build hate speech / counter narrative (HS/CN) datasets for neural generation, they fall short in reaching either highquality and/or high-quantity. In this paper, we propose a novel human-in-the-loop data collection methodology in which a generative language model is refined iteratively by using its own data from the previous loops to generate new training samples that experts review and/or post-edit. Our experiments comprised several loops including dynamic variations. Results show that the methodology is scalable and facilitates diverse, novel, and cost-effective data collection. To our knowledge, the resulting dataset is the only expertbased multi-target HS/CN dataset available to the community.

引用

下载

页码：3226 / 3240

页数：15

共 3 条

[1] CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech
Chung, Yi-Ling
Kuzmenko, Elizaveta
Tekiroglu, Serra Sinem
Guerini, Marco
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2819 - 2829
[2] Adaptive human-in-the-loop multi-target recognition improved by learning
Wu, Xuesong
Wang, Chang
Niu, Yifeng
Hu, Xiaoping
Fan, Chen
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2018, 15 (03):
[3] Human-in-the-loop online multi-agent approach to increase trustworthiness in ML models through trust scores and data augmentation
Bravo-Rocca, Gusseppe
Liu, Peini
Guitart, Jordi
Dholakia, Ajay
Ellison, David
Hodak, Miroslav
2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 32 - 37

← 1 →