共 50 条
- [31] BLESS: Benchmarking Large Language Models on Sentence Simplification 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 13291 - 13309
- [32] IdSarcasm: Benchmarking and Evaluating Language Models for Indonesian Sarcasm Detection IEEE ACCESS, 2024, 12 : 87323 - 87332
- [33] TRAM: Benchmarking Temporal Reasoning for Large Language Models FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 6389 - 6415
- [34] Evaluating the capabilities of large language models using machine learning tasks at inference-time Elektrotehniski Vestnik/Electrotechnical Review, 2023, 90 (05): : 247 - 253
- [35] Evaluating the capabilities of large language models using machine learning tasks at inference-time ELEKTROTEHNISKI VESTNIK, 2023, 90 (05): : 247 - 253
- [36] Geospatial Monitoring and Structural Mechanics Models: a Case Study of Sports Structures 11TH INTERNATIONAL CONFERENCE ENVIRONMENTAL ENGINEERING (11TH ICEE), 2020,
- [37] Adopting Pre-trained Large Language Models for Regional Language Tasks: A Case Study INTELLIGENT HUMAN COMPUTER INTERACTION, IHCI 2023, PT I, 2024, 14531 : 15 - 25
- [38] Benchmarking Transformers-based models on French Spoken Language Understanding tasks INTERSPEECH 2022, 2022, : 1238 - 1242