Relationalizing Tables with Large Language Models: The Promise and Challenges

被引：0

作者：

Huang, Zezhou ^{[1
]}

Wu, Eugene ^{[2
]}

机构：

[1] Columbia Univ, New York, NY 10027 USA

[2] Columbia Univ, DSI, New York, NY 10027 USA

来源：

2024 IEEE 40TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, ICDEW | 2024年

基金：

美国国家科学基金会;

关键词：

Large Language Model; Data Transformation; Prompt Engineering; Data Management;

D O I：

10.1109/ICDEW61823.2024.00045

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Tables in the wild are usually not relationalized, making querying them difficult. To relationalize tables, recent works designed seven transformation operators, and deep neural networks were adopted to automatically find the sequence of operators, achieving an accuracy of 57.0%. In comparison, earlier versions of large language models like GPT-3.5 only reached 13.1%. However, these results were obtained using naive prompts. Furthermore, GPT-4 is recently available, which is substantially larger and more performant. This study examines how the selection of models, specifically GPT-3.5 and GPT-4, and various prompting strategies, such as Chain-of-Thought and task decomposition, affect accuracy. The main finding is that GPT-4, combined with Task Decomposition and Chain-of-Thought, attains a remarkable accuracy of 74.6%. Further analysis of errors made by GPT-4 shows the challenges that about half of the errors are not due to the model's shortcomings, but rather to ambiguities in the benchmarks. When these benchmarks are disambiguated, GPT-4's accuracy improves to 86.9%.

引用

页码：305 / 309

页数：5

共 50 条

[1] Open-source large language models in medical education: Balancing promise and challenges
Ray, Partha Pratim
ANATOMICAL SCIENCES EDUCATION, 2024, 17 (06) : 1361 - 1362
[2] The promise of large language models in health care
Arora, Anmol
Arora, Ananya
LANCET, 2023, 401 (10377): : 641 - 642
[3] The promise of AI Large Language Models for Epilepsy care
Landais, Raphaelle
Sultan, Mustafa
Thomas, Rhys H.
EPILEPSY & BEHAVIOR, 2024, 154
[4] From promise to practice: challenges and pitfalls in the evaluation of large language models for data extraction in evidence synthesis
Gartlehner, Gerald
Kahwati, Leila
Nussbaumer-Streit, Barbara
Crotty, Karen
Hilscher, Rainer
Kugley, Shannon
Viswanathan, Meera
Thomas, Ian
Konet, Amanda
Booth, Graham
Chew, Robert
BMJ EVIDENCE-BASED MEDICINE, 2024,
[5] Art or Artifice? Large Language Models and the False Promise of Creativity
Chakrabarty, Tuhin
Laban, Philippe
Agarwal, Divyansh
Muresan, Smaranda
Wu, Chien-Sheng
PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS (CHI 2024), 2024,
[6] Benchmarking Large Language Models: Opportunities and Challenges
Hodak, Miro
Ellison, David
Van Buren, Chris
Jiang, Xiaotong
Dholakia, Ajay
PERFORMANCE EVALUATION AND BENCHMARKING, TPCTC 2023, 2024, 14247 : 77 - 89
[7] Ethical and Theological Challenges of Large Language Models
Strahornik, Vojko
BOGOSLOVNI VESTNIK-THEOLOGICAL QUARTERLY-EPHEMERIDES THEOLOGICAE, 2023, 83 (04): : 839 - 852
[8] MULTILINGUAL JAILBREAK CHALLENGES IN LARGE LANGUAGE MODELS
Deng, Yue
Zhang, Wenxuan
Pan, Sinno Jialin
Bing, Lidong
arXiv, 2023,
[9] Large language models in psychiatry: Opportunities and challenges
Volkmer, Sebastian
Meyer-Lindenberg, Andreas
Schwarz, Emanuel
PSYCHIATRY RESEARCH, 2024, 339
[10] Harnessing the potential of large language models in medical education: promise and pitfalls
Benitez, Trista M.
Xu, Yueyuan
Boudreau, J. Donald
Kow, Alfred Wei Chieh
Bello, Fernando
Phuoc, Le Van
Wang, Xiaofei
Sun, Xiaodong
Leung, Gilberto Ka-Kit
Lan, Yanyan
Wang, Yaxing
Cheng, Davy
Tham, Yih-Chung
Wong, Tien Yin
Chung, Kevin C.
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (03) : 776 - 783

← 1 2 3 4 5 →