Effective Context Selection in LLM-Based Leaderboard Generation: An Empirical Study

被引:0
|
作者
Kabongo, Salomon [1 ]
D'Souza, Jennifer [2 ]
Auer, Soren [2 ]
机构
[1] Leibniz Univ Hannover, L3S Res Ctr, Hannover, Germany
[2] TIB Leibniz Informat Ctr Sci & Technol, Hannover, Germany
关键词
D O I
10.1007/978-3-031-70242-6_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper explores the impact of context selection on the efficiency of Large Language Models (LLMs) in generating Artificial Intelligence (AI) research leaderboards, a task defined as the extraction of (Task, Dataset, Metric, Score) quadruples from scholarly articles. By framing this challenge as a text generation objective and employing instruction finetuning with the FLAN-T5 collection, we introduce a novel method that surpasses traditional Natural Language Inference (NLI) approaches in adapting to new developments without a predefined taxonomy. Through experimentation with three distinct context types of varying selectivity and length, our study demonstrates the importance of effective context selection in enhancing LLM accuracy and reducing hallucinations, providing a new pathway for the reliable and efficient generation of AI leaderboards. This contribution not only advances the state of the art in leaderboard generation but also sheds light on strategies to mitigate common challenges in LLM-based information extraction.
引用
收藏
页码:150 / 160
页数:11
相关论文
共 50 条
  • [21] Towards an In-Context LLM-Based Approach for Automating the Definition of Model Views
    Miranda, James William Pontes
    Bruneliere, Hugo
    Tisi, Massimo
    Sunye, Gerson
    PROCEEDINGS OF THE 17TH ACM SIGPLAN INTERNATIONAL CONFERENCE ON SOFTWARE LANGUAGE ENGINEERING, SLE 2024, 2024, : 29 - 42
  • [22] LLM-based Processor Verification: A Case Study for Neuromorphic Processor
    Xiao, Chao
    Deng, Yifei
    Yang, Zhijie
    Chen, Renzhi
    Wang, Hong
    Zhao, Jingyue
    Dai, Huadong
    Wang, Lei
    Tang, Yuhua
    Xu, Weixia
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
  • [23] Topic-guided Example Selection for Domain Adaptation in LLM-based Machine Translation
    Aycock, Seth
    Bawden, Rachel
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: STUDENT RESEARCH WORKSHOP, 2024, : 175 - 195
  • [24] LLM-Based Student Plan Generation for Adaptive Scaffolding in Game-Based Learning Environments
    Goslen, Alex
    Kim, Yeo Jin
    Rowe, Jonathan
    Lester, James
    INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION, 2024,
  • [25] TURSpider: A Turkish Text-to-SQL Dataset and LLM-Based Study
    Kanburoglu, Ali Bugra
    Tek, Faik Boray
    IEEE ACCESS, 2024, 12 : 169379 - 169387
  • [26] ECG: Augmenting Embedded Operating System Fuzzing via LLM-Based Corpus Generation
    Zhang, Qiang
    Shen, Yuheng
    Liu, Jianzhong
    Xu, Yiru
    Shi, Heyuan
    Jiang, Yu
    Chang, Wanli
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (11) : 4238 - 4249
  • [27] Managing Linux servers with LLM-based AI agents: An empirical evaluation with GPT4
    Cao, Charles
    Wang, Feiyi
    Lindley, Lisa
    Wang, Zejiang
    MACHINE LEARNING WITH APPLICATIONS, 2024, 17
  • [28] GRACE: Empowering LLM-based software vulnerability detection with graph structure and in-context learning
    Lu, Guilong
    Ju, Xiaolin
    Chen, Xiang
    Pei, Wenlong
    Cai, Zhilong
    JOURNAL OF SYSTEMS AND SOFTWARE, 2024, 212
  • [29] Enhancing LLM-Based Coding Tools through Native Integration of IDE-Derived Static Context
    Li, Yichen
    Peng, Yun
    Huo, Yintong
    Lyu, Michael R.
    2024 INTERNATIONAL WORKSHOP ON LARGE LANGUAGE MODELS FOR CODE, LLM4CODE 2024, 2024, : 70 - 74
  • [30] LLM-based Multi-Level Knowledge Generation for Few-shot Knowledge Graph Completion
    Li, Qian
    Chen, Zhuo
    Ji, Cheng
    Jiang, Shiqi
    Li, Jianxin
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 2135 - 2143