Relationalizing Tables with Large Language Models: The Promise and Challenges

被引:0
|
作者
Huang, Zezhou [1 ]
Wu, Eugene [2 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
[2] Columbia Univ, DSI, New York, NY 10027 USA
基金
美国国家科学基金会;
关键词
Large Language Model; Data Transformation; Prompt Engineering; Data Management;
D O I
10.1109/ICDEW61823.2024.00045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tables in the wild are usually not relationalized, making querying them difficult. To relationalize tables, recent works designed seven transformation operators, and deep neural networks were adopted to automatically find the sequence of operators, achieving an accuracy of 57.0%. In comparison, earlier versions of large language models like GPT-3.5 only reached 13.1%. However, these results were obtained using naive prompts. Furthermore, GPT-4 is recently available, which is substantially larger and more performant. This study examines how the selection of models, specifically GPT-3.5 and GPT-4, and various prompting strategies, such as Chain-of-Thought and task decomposition, affect accuracy. The main finding is that GPT-4, combined with Task Decomposition and Chain-of-Thought, attains a remarkable accuracy of 74.6%. Further analysis of errors made by GPT-4 shows the challenges that about half of the errors are not due to the model's shortcomings, but rather to ambiguities in the benchmarks. When these benchmarks are disambiguated, GPT-4's accuracy improves to 86.9%.
引用
收藏
页码:305 / 309
页数:5
相关论文
共 50 条
  • [1] Open-source large language models in medical education: Balancing promise and challenges
    Ray, Partha Pratim
    [J]. ANATOMICAL SCIENCES EDUCATION, 2024, 17 (06) : 1361 - 1362
  • [2] The promise of large language models in health care
    Arora, Anmol
    Arora, Ananya
    [J]. LANCET, 2023, 401 (10377): : 641 - 642
  • [3] The promise of AI Large Language Models for Epilepsy care
    Landais, Raphaelle
    Sultan, Mustafa
    Thomas, Rhys H.
    [J]. EPILEPSY & BEHAVIOR, 2024, 154
  • [4] Art or Artifice? Large Language Models and the False Promise of Creativity
    Chakrabarty, Tuhin
    Laban, Philippe
    Agarwal, Divyansh
    Muresan, Smaranda
    Wu, Chien-Sheng
    [J]. PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS (CHI 2024), 2024,
  • [5] Large language models in psychiatry: Opportunities and challenges
    Volkmer, Sebastian
    Meyer-Lindenberg, Andreas
    Schwarz, Emanuel
    [J]. PSYCHIATRY RESEARCH, 2024, 339
  • [6] Ethical and Theological Challenges of Large Language Models
    Strahornik, Vojko
    [J]. BOGOSLOVNI VESTNIK-THEOLOGICAL QUARTERLY-EPHEMERIDES THEOLOGICAE, 2023, 83 (04): : 839 - 852
  • [7] MULTILINGUAL JAILBREAK CHALLENGES IN LARGE LANGUAGE MODELS
    Deng, Yue
    Zhang, Wenxuan
    Pan, Sinno Jialin
    Bing, Lidong
    [J]. arXiv, 2023,
  • [8] Harnessing the potential of large language models in medical education: promise and pitfalls
    Benitez, Trista M.
    Xu, Yueyuan
    Boudreau, J. Donald
    Kow, Alfred Wei Chieh
    Bello, Fernando
    Phuoc, Le Van
    Wang, Xiaofei
    Sun, Xiaodong
    Leung, Gilberto Ka-Kit
    Lan, Yanyan
    Wang, Yaxing
    Cheng, Davy
    Tham, Yih-Chung
    Wong, Tien Yin
    Chung, Kevin C.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (03) : 776 - 783
  • [9] Promise and Perils of Large Language Models for Cancer Survivorship and Supportive Care
    Bitterman, Danielle S.
    Downing, Andrea
    Maues, Julia
    Lustberg, Maryam
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2024, 42 (14)
  • [10] ChatGPT and large language models in academia: opportunities and challenges
    Jesse G. Meyer
    Ryan J. Urbanowicz
    Patrick C. N. Martin
    Karen O’Connor
    Ruowang Li
    Pei-Chen Peng
    Tiffani J. Bright
    Nicholas Tatonetti
    Kyoung Jae Won
    Graciela Gonzalez-Hernandez
    Jason H. Moore
    [J]. BioData Mining, 16