A Retrieval-Augmented Framework for Tabular Interpretation with Large Language Model

被引:0
|
作者
Yan, Mengyi [1 ]
Rene, Weilong [2 ]
Wang, Yaoshu [2 ]
Li, Jianxin [1 ]
机构
[1] Beihang Univ, Beijing, Peoples R China
[2] Shenzhen Inst Comp Sci, Shenzhen, Peoples R China
关键词
D O I
10.1007/978-981-97-5779-4_23
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Relational tables on the web hold a vast amount of knowledge, and it is critical for machine learning models to capture the semantics of these tables such that the models can achieve good performance on table interpretation tasks, such as entity linking, column type annotation and relation extraction. However, it is very challenging for ML models to process a large amount of tables and/or retrieve inter-table context information from the tables. Instead, existing works usually rely on heavily engineered features, user-defined rules or pre-training corpus. In this work, we propose a unified Retrieval-Augmented Framework for tabular interpretation with Large language model (RAFL), a novel 2-step framework for addressing the table interpretation task. RAFL first adopts a graph-enhanced model to obtain the inter-table context information by retrieving schema-similar and topic-relevant tables from a large range of corpus; RAFL then conducts tabular interpretation learning by combining a light-weighted pre-ranking model with a re-ranking-based large language model. We verify the effectiveness of RAFL through extensive evaluations on 3 tabular interpretation tasks (including entity linking, column type annotation and relation extraction), where RAFL substantially outperforms existing methods on all tasks.
引用
收藏
页码:341 / 356
页数:16
相关论文
共 50 条
  • [31] TrojanRAG: Retrieval-Augmented Generation Can Be Backdoor Driver in Large Language Models
    Shanghai Jiao Tong University, China
    arXiv,
  • [32] Retrieval-Augmented Generation-aided causal identification of aviation accidents: A large language model methodology
    Ren, Tengfei
    Zhang, Zhipeng
    Jia, Bo
    Zhang, Shiwen
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 278
  • [33] Tabular Data Classification and Regression: XGBoost or Deep Learning With Retrieval-Augmented Generation
    Pasaribu, Jonindo
    Yudistira, Novanto
    Mahmudy, Wayan Firdaus
    IEEE ACCESS, 2024, 12 : 191719 - 191732
  • [34] Optimierte Interaktion mit Large Language ModelsPraxisorientierter Leitfaden zu Prompt Engineering und Retrieval-Augmented GenerationOptimized interaction with Large Language ModelsA practical guide to Prompt Engineering and Retrieval-Augmented Generation
    Anna Fink
    Alexander Rau
    Elmar Kotter
    Fabian Bamberg
    Maximilian Frederik Russe
    Die Radiologie, 2025, 65 (4) : 235 - 242
  • [35] Optimizing High-Level Synthesis Designs with Retrieval-Augmented Large Language Models
    Xu, Haocheng
    Hu, Haotian
    Huang, Sitao
    2024 IEEE LLM AIDED DESIGN WORKSHOP, LAD 2024, 2024,
  • [36] Adaptive Control of Retrieval-Augmented Generation for Large Language Models Through Reflective Tags
    Yao, Chengyuan
    Fujita, Satoshi
    ELECTRONICS, 2024, 13 (23):
  • [37] OpenFOAMGPT: A retrieval-augmented large language model (LLM) agent for OpenFOAM-based computational fluid dynamics
    Pandey, Sandeep
    Xu, Ran
    Wang, Wenkang
    Chu, Xu
    PHYSICS OF FLUIDS, 2025, 37 (03)
  • [38] M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions
    Wang, Zheng
    Teo, Shu Xian
    Ouyang, Jieer
    Xu, Yongjun
    Shi, Wei
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 1966 - 1978
  • [39] LeanDojo: Theorem Proving with Retrieval-Augmented Language Models
    Yang, Kaiyu
    Swope, Aidan M.
    Gu, Alex
    Chalamala, Rahul
    Song, Peiyang
    Yu, Shixing
    Godil, Saad
    Prenger, Ryan
    Anandkumar, Anima
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [40] Towards an FA ChatBot with Retrieval-augmented Language Modeling
    Fichtenkamm, Maik
    Kofler, Markus
    Schekotihin, Konstantin
    Burmer, Christian
    2024 IEEE INTERNATIONAL SYMPOSIUM ON THE PHYSICAL AND FAILURE ANALYSIS OF INTEGRATED CIRCUITS, IPFA 2024, 2024,