Effective Context Selection in LLM-Based Leaderboard Generation: An Empirical Study

被引：0

作者：

Kabongo, Salomon ^{[1
]}

D'Souza, Jennifer ^{[2
]}

Auer, Soren ^{[2
]}

机构：

[1] Leibniz Univ Hannover, L3S Res Ctr, Hannover, Germany

[2] TIB Leibniz Informat Ctr Sci & Technol, Hannover, Germany

来源：

NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PT II, NLDB 2024 | 2024年 / 14763卷

关键词：

D O I：

10.1007/978-3-031-70242-6_15

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper explores the impact of context selection on the efficiency of Large Language Models (LLMs) in generating Artificial Intelligence (AI) research leaderboards, a task defined as the extraction of (Task, Dataset, Metric, Score) quadruples from scholarly articles. By framing this challenge as a text generation objective and employing instruction finetuning with the FLAN-T5 collection, we introduce a novel method that surpasses traditional Natural Language Inference (NLI) approaches in adapting to new developments without a predefined taxonomy. Through experimentation with three distinct context types of varying selectivity and length, our study demonstrates the importance of effective context selection in enhancing LLM accuracy and reducing hallucinations, providing a new pathway for the reliable and efficient generation of AI leaderboards. This contribution not only advances the state of the art in leaderboard generation but also sheds light on strategies to mitigate common challenges in LLM-based information extraction.

引用

页码：150 / 160

页数：11

共 50 条

[21] Towards an In-Context LLM-Based Approach for Automating the Definition of Model Views
Miranda, James William Pontes
Bruneliere, Hugo
Tisi, Massimo
Sunye, Gerson
PROCEEDINGS OF THE 17TH ACM SIGPLAN INTERNATIONAL CONFERENCE ON SOFTWARE LANGUAGE ENGINEERING, SLE 2024, 2024, : 29 - 42
[22] LLM-based Processor Verification: A Case Study for Neuromorphic Processor
Xiao, Chao
Deng, Yifei
Yang, Zhijie
Chen, Renzhi
Wang, Hong
Zhao, Jingyue
Dai, Huadong
Wang, Lei
Tang, Yuhua
Xu, Weixia
2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
[23] Topic-guided Example Selection for Domain Adaptation in LLM-based Machine Translation
Aycock, Seth
Bawden, Rachel
PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: STUDENT RESEARCH WORKSHOP, 2024, : 175 - 195
[24] LLM-Based Student Plan Generation for Adaptive Scaffolding in Game-Based Learning Environments
Goslen, Alex
Kim, Yeo Jin
Rowe, Jonathan
Lester, James
INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION, 2024,
[25] TURSpider: A Turkish Text-to-SQL Dataset and LLM-Based Study
Kanburoglu, Ali Bugra
Tek, Faik Boray
IEEE ACCESS, 2024, 12 : 169379 - 169387
[26] ECG: Augmenting Embedded Operating System Fuzzing via LLM-Based Corpus Generation
Zhang, Qiang
Shen, Yuheng
Liu, Jianzhong
Xu, Yiru
Shi, Heyuan
Jiang, Yu
Chang, Wanli
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (11) : 4238 - 4249
[27] Managing Linux servers with LLM-based AI agents: An empirical evaluation with GPT4
Cao, Charles
Wang, Feiyi
Lindley, Lisa
Wang, Zejiang
MACHINE LEARNING WITH APPLICATIONS, 2024, 17
[28] GRACE: Empowering LLM-based software vulnerability detection with graph structure and in-context learning
Lu, Guilong
Ju, Xiaolin
Chen, Xiang
Pei, Wenlong
Cai, Zhilong
JOURNAL OF SYSTEMS AND SOFTWARE, 2024, 212
[29] Enhancing LLM-Based Coding Tools through Native Integration of IDE-Derived Static Context
Li, Yichen
Peng, Yun
Huo, Yintong
Lyu, Michael R.
2024 INTERNATIONAL WORKSHOP ON LARGE LANGUAGE MODELS FOR CODE, LLM4CODE 2024, 2024, : 70 - 74
[30] LLM-based Multi-Level Knowledge Generation for Few-shot Knowledge Graph Completion
Li, Qian
Chen, Zhuo
Ji, Cheng
Jiang, Shiqi
Li, Jianxin
PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 2135 - 2143

← 1 2 3 4 5 →