共 50 条
- [21] Benchmarking Deep Graph Models for Large Molecular Generation 2022 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (IEEE BIGCOMP 2022), 2022, : 114 - 120
- [24] SEED-Bench: Benchmarking Multimodal Large Language Models 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 13299 - 13308
- [25] Quantifying Bias in Agentic Large Language Models: A Benchmarking Approach 2024 5TH INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE, ICTC 2024, 2024, : 349 - 353
- [28] Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions COMPUTER VISION-ECCV 2024, PT LVII, 2025, 15115 : 404 - 420
- [29] Benchmarking Large Language Models on CFLUE - A Chinese Financial Language Understanding Evaluation Dataset FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 5673 - 5693