共 50 条
- [33] Multi-Grained Topological Pre-Training of Language Models in Sponsored Search PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2189 - 2193
- [34] Evaluation of pre-training large language models on leadership-class supercomputers JOURNAL OF SUPERCOMPUTING, 2023, 79 (18): : 20747 - 20768
- [37] Length-Based Curriculum Learning for Efficient Pre-training of Language Models New Generation Computing, 2023, 41 : 109 - 134
- [38] Task-adaptive Pre-training of Language Models withWord Embedding Regularization FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4546 - 4553
- [39] INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of Language Models FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 6690 - 6705