共 50 条
- [41] Sources of Hallucination by Large Language Models on Inference Tasks FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 2758 - 2774
- [44] Benchmarking Large Language Models in Retrieval-Augmented Generation THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17754 - 17762
- [45] SEED-Bench: Benchmarking Multimodal Large Language Models 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 13299 - 13308
- [46] Quantifying Bias in Agentic Large Language Models: A Benchmarking Approach 2024 5TH INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE, ICTC 2024, 2024, : 349 - 353
- [48] Robustness of GPT Large Language Models on Natural Language Processing Tasks Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (05): : 1128 - 1142