共 50 条
- [43] Evaluating the Reliability of Self-explanations in Large Language Models DISCOVERY SCIENCE, DS 2024, PT I, 2025, 15243 : 36 - 51
- [44] Evaluating Explanations for Software Patches Generated by Large Language Models SEARCH-BASED SOFTWARE ENGINEERING, SSBSE 2023, 2024, 14415 : 147 - 152
- [45] Evaluating Cognitive Maps and planning in Large Language Models with CogEval ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [46] Evaluating the Elementary Multilingual Capabilities of Large Language Models with MULTIQ FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 4476 - 4494
- [47] Evaluating the Efficacy of Large Language Models in Identifying Phishing Attempts 2024 16TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTION, HSI 2024, 2024,
- [48] Evaluating Object Hallucination in Large Vision-Language Models 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 292 - 305
- [50] ProAgent: Building Proactive Cooperative Agents with Large Language Models THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17591 - 17599