Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate Speech

被引:0
|
作者
Fanton, Margherita [1 ,2 ]
Bonaldi, Helena [1 ,2 ]
Tekiroglu, Serra Sinem [2 ]
Guerini, Marco [2 ]
机构
[1] Univ Trento, Trento, TN, Italy
[2] Fdn Bruno Kessler, Via Sommar 18, Povo, Trento, Italy
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Undermining the impact of hateful content with informed and non-aggressive responses, called counter narratives, has emerged as a possible solution for having healthier online communities. Thus, some NLP studies have started addressing the task of counter narrative generation. Although such studies have made an effort to build hate speech / counter narrative (HS/CN) datasets for neural generation, they fall short in reaching either highquality and/or high-quantity. In this paper, we propose a novel human-in-the-loop data collection methodology in which a generative language model is refined iteratively by using its own data from the previous loops to generate new training samples that experts review and/or post-edit. Our experiments comprised several loops including dynamic variations. Results show that the methodology is scalable and facilitates diverse, novel, and cost-effective data collection. To our knowledge, the resulting dataset is the only expertbased multi-target HS/CN dataset available to the community.
引用
下载
收藏
页码:3226 / 3240
页数:15
相关论文
共 3 条
  • [1] CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech
    Chung, Yi-Ling
    Kuzmenko, Elizaveta
    Tekiroglu, Serra Sinem
    Guerini, Marco
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2819 - 2829
  • [2] Adaptive human-in-the-loop multi-target recognition improved by learning
    Wu, Xuesong
    Wang, Chang
    Niu, Yifeng
    Hu, Xiaoping
    Fan, Chen
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2018, 15 (03):
  • [3] Human-in-the-loop online multi-agent approach to increase trustworthiness in ML models through trust scores and data augmentation
    Bravo-Rocca, Gusseppe
    Liu, Peini
    Guitart, Jordi
    Dholakia, Ajay
    Ellison, David
    Hodak, Miroslav
    2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 32 - 37