Detoxifying Language Models with a Toxic Corpus

被引:0
|
作者
Park, Yoon A. [1 ,2 ]
Rudzicz, Frank [1 ,2 ,3 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Vector Inst Artificial Intelligence, Toronto, ON, Canada
[3] Unity Hlth Toronto, Toronto, ON, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing studies have investigated the tendency of autoregressive language models to generate contexts that exhibit undesired biases and toxicity. Various debiasing approaches have been proposed, which are primarily categorized into data-based and decoding-based. In our study, we investigate the ensemble of the two debiasing paradigms, proposing to use toxic corpus as an additional resource to reduce the toxicity. Our result shows that toxic corpus can indeed help to reduce the toxicity of the language generation process substantially, complementing the existing debiasing methods.
引用
收藏
页码:41 / 46
页数:6
相关论文
共 50 条
  • [31] ALC - Alcohol Language Corpus
    Schiel, Florian
    Heinrich, Christian
    Barfuesser, Sabine
    Gilg, Thomas
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1641 - 1645
  • [32] The ATIS Sign Language Corpus
    Bungeroth, Jan
    Stein, Daniel
    Dreuw, Philippe
    Ney, Hermann
    Morrissey, Sara
    Way, Andy
    van Zijl, Lynette
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2943 - 2946
  • [33] Linguistic corpus and language teaching
    Almau, Sonia Almau
    Serrano, Maribel
    RILCE-REVISTA DE FILOLOGIA HISPANICA, 2025, 41 (01): : 455 - 457
  • [34] Language variation and corpus linguistics
    Kachru, Yamuna
    WORLD ENGLISHES, 2008, 27 (01) : 1 - 8
  • [35] CORPUS LINGUISTICS AND LANGUAGE ACQUISITION
    Merino Gonzalez, Alicia
    ESTUDIOS DE LINGUISTICA-UNIVERSIDAD DE ALICANTE-ELUA, 2018, : 343 - 346
  • [36] Digital Corpus of Santali Language
    Akhtar, Md Amir Khusru
    Sahoo, Gadadhar
    Kumar, Mohit
    2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 934 - 938
  • [37] COMPUTERIZED CORPUS OF THE RUSSIAN LANGUAGE
    ANDRYUSHCHENKO, VM
    RUSSIAN LINGUISTICS, 1990, 14 (02) : 171 - 179
  • [38] Corpus studies in language education
    胡伟
    姜茉然
    范晨辉
    青春岁月, 2012, (13) : 90 - 91
  • [39] Corpus Approaches to Language Ideology
    Vessey, Rachelle
    APPLIED LINGUISTICS, 2017, 38 (03) : 277 - 296
  • [40] Corpus Linguistics and Language Teaching
    汪倩
    胥倩
    校园英语, 2018, (06) : 28 - 29