共 50 条
- [11] Statistical Analysis of Multilingual Text Corpus and Development of Language Models LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2436 - 2440
- [12] Corpus-Steered Query Expansion with Large Language Models PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2: SHORT PAPERS, 2024, : 393 - 401
- [13] Exploring the Impact of Corpus Diversity on Financial Pretrained Language Models FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 2101 - 2112
- [14] Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [15] A Warm Start and a Clean Crawled Corpus - A Recipe for Good Language Models LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 4356 - 4366
- [17] MIL-Decoding: Detoxifying Language Models at Token-Level via Multiple Instance Learning PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 190 - 202
- [18] Unveiling Toxic Tendencies of Small Language Models in Unconstrained Generation Tasks 10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTING AND COMMUNICATION TECHNOLOGIES, CONECCT 2024, 2024,
- [19] Efficient Toxic Content Detection by Bootstrapping and Distilling Large Language Models THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21779 - 21787
- [20] Probing Toxic Content in Large Pre-Trained Language Models 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4262 - 4274