A Retrieval-Augmented Framework for Tabular Interpretation with Large Language Model

被引:0
|
作者
Yan, Mengyi [1 ]
Rene, Weilong [2 ]
Wang, Yaoshu [2 ]
Li, Jianxin [1 ]
机构
[1] Beihang Univ, Beijing, Peoples R China
[2] Shenzhen Inst Comp Sci, Shenzhen, Peoples R China
关键词
D O I
10.1007/978-981-97-5779-4_23
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Relational tables on the web hold a vast amount of knowledge, and it is critical for machine learning models to capture the semantics of these tables such that the models can achieve good performance on table interpretation tasks, such as entity linking, column type annotation and relation extraction. However, it is very challenging for ML models to process a large amount of tables and/or retrieve inter-table context information from the tables. Instead, existing works usually rely on heavily engineered features, user-defined rules or pre-training corpus. In this work, we propose a unified Retrieval-Augmented Framework for tabular interpretation with Large language model (RAFL), a novel 2-step framework for addressing the table interpretation task. RAFL first adopts a graph-enhanced model to obtain the inter-table context information by retrieving schema-similar and topic-relevant tables from a large range of corpus; RAFL then conducts tabular interpretation learning by combining a light-weighted pre-ranking model with a re-ranking-based large language model. We verify the effectiveness of RAFL through extensive evaluations on 3 tabular interpretation tasks (including entity linking, column type annotation and relation extraction), where RAFL substantially outperforms existing methods on all tasks.
引用
收藏
页码:341 / 356
页数:16
相关论文
共 50 条
  • [21] Mapping Drug Terms via Integration of a Retrieval-Augmented Generation Algorithm with a Large Language Model
    Kimura, Eizen
    Kawakami, Yukinobu
    Inoue, Shingo
    Okajima, Ai
    HEALTHCARE INFORMATICS RESEARCH, 2024, 30 (04) : 355 - 363
  • [22] In-Context Retrieval-Augmented Language Models
    Ram, Ori
    Levine, Yoav
    Dalmedigos, Itay
    Muhlgay, Dor
    Shashua, Amnon
    Leyton-Brown, Kevin
    Shoham, Yoav
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2023, 11 : 1316 - 1331
  • [23] ReACC: A Retrieval-Augmented Code Completion Framework
    Lu, Shuai
    Duan, Nan
    Han, Hojae
    Guo, Daya
    Hwang, Seung-won
    Svyatkovskiy, Alexey
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6227 - 6240
  • [24] ReACC: A Retrieval-Augmented Code Completion Framework
    Lu, Shuai
    Duan, Nan
    Han, Hojae
    Guo, Daya
    Hwang, Seung-Won
    Svyatkovskiy, Alexey
    Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2022, 1 : 6227 - 6240
  • [25] Integrating Graph Retrieval-Augmented Generation With Large Language Models for Supplier Discovery
    Li, Yunqing
    Ko, Hyunwoong
    Ameri, Farhad
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2025, 25 (02)
  • [26] RAVL: A Retrieval-Augmented Visual Language Model Framework for Knowledge-Based Visual Question Answering
    Chai, Naiquan
    Zou, Dongsheng
    Liu, Jiyuan
    Wang, Hao
    Yang, Yuming
    Song, Xinyi
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT III, NLPCC 2024, 2025, 15361 : 394 - 406
  • [27] Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Models
    Kim, Gangwoo
    Kim, Sungdong
    Jeon, Byeongguk
    Park, Joonsuk
    Kang, Jaewoo
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 996 - 1009
  • [28] SafetyRAG: Towards Safe Large Language Model-Based Application through Retrieval-Augmented Generation
    Omri, Sihem
    Abdelkader, Manel
    Hamdi, Mohamed
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2025, 16 (02) : 243 - 250
  • [29] Evaluation of the integration of retrieval-augmented generation in large language model for breast cancer nursing care responses
    Xu, Ruiyu
    Hong, Ying
    Zhang, Feifei
    Xu, Hongmei
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [30] Retrieval-augmented Recommender System: Enhancing Recommender Systems with Large Language Models
    Di Palma, Dario
    PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 1369 - 1373