共 50 条
- [41] Tree -of-Reasoning Question Decomposition for Complex Question Answering with Large Language Models [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19560 - 19568
- [42] PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about Change [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [43] Sasha: Creative Goal-Oriented Reasoning in Smart Homes with Large Language Models [J]. PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2024, 8 (01):
- [44] MindMap: Constructing Evidence Chains for Multi-Step Reasoning in Large Language Models [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19270 - 19278
- [45] CRASS: A Novel Data Set and Benchmark to Test Counterfactual Reasoning of Large Language Models [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 2126 - 2140
- [46] Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [47] Quartet: A Holistic Hybrid Parallel Framework for Training Large Language Models [J]. EURO-PAR 2024: PARALLEL PROCESSING, PART II, EURO-PAR 2024, 2024, 14802 : 424 - 438
- [48] Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [50] Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202