共 50 条
- [2] Closing the gap between open source and commercial large language models for medical evidence summarization [J]. NPJ DIGITAL MEDICINE, 2024, 7 (01):
- [7] MedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17709 - 17717
- [8] Evaluating the Summarization Comprehension of Pre-Trained Language Models [J]. Lobachevskii Journal of Mathematics, 2023, 44 : 3028 - 3039