Large Language Models for Tabular Data: Progresses and Future Directions

被引:0
|
作者
Dong, Haoyu [1 ]
Wang, Zhiruo [2 ]
机构
[1] Microsoft AI, Beijing, Peoples R China
[2] Carnegie Mellon Univ, Pittsburgh, PA USA
关键词
Tabular data; Large language models; Representation learning;
D O I
10.1145/3626772.3661384
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tables contain a significant portion of the world's structured information. The ability to efficiently and accurately understand, process, reason about, analyze, and generate tabular data is critical for achieving Artificial General Intelligence (AGI) systems. However, despite their prevalence and importance, tables present unique challenges due to their structured nature and the diverse semantics embedded within them. Textual content, numerical values, visual formats, and even formulas in tables carry rich semantic information that is often underutilized due to the complexity of accurately interpreting and integrating. Fortunately, the advent of Large Language Models (LLMs) has opened new frontiers in natural language processing (NLP) and machine learning (ML), showing remarkable success in understanding and generating text, code, etc. Applying these advanced models to the domain of tabular data holds the promise of significant breakthroughs in how we process and leverage structured information. Therefore, this tutorial aims to provide a comprehensive study of the advances, challenges, and opportunities in leveraging cutting-edge LLMs for tabular data. By introducing methods of prompting or training cutting-edge LLMs for table interpreting, processing, reasoning, analytics, and generation, we aim to equip researchers and practitioners with the knowledge and tools needed to unlock the full potential of LLMs for tabular data in their domains.
引用
收藏
页码:2997 / 3000
页数:4
相关论文
共 50 条
  • [1] Large Language Models for Recommendation: Progresses and Future Directions
    Bao, Keqin
    Zhang, Jizhi
    Zhang, Yang
    Wang, Wenjie
    Feng, Fuli
    He, Xiangnan
    ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL IN THE ASIA PACIFIC REGION, SIGIR-AP 2023, 2023, : 306 - 309
  • [2] TabLLM: Few-shot Classification of Tabular Data with Large Language Models
    Hegselmann, Stefan
    Buendia, Alejandro
    Lang, Hunter
    Agrawal, Monica
    Jiang, Xiaoyi
    Sontag, David
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
  • [3] Efficient anomaly detection in tabular cybersecurity data using large language models
    Zhao, Xiaoyong
    Leng, Xingxin
    Wang, Lei
    Wang, Ningning
    Liu, Yanqiong
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [4] Large Language Models in Medical Education: Opportunities, Challenges, and Future Directions
    Abd-alrazaq, Alaa
    AlSaad, Rawan
    Alhuwail, Dari
    Ahmed, Arfan
    Healy, Padraig Mark
    Latifi, Syed
    Aziz, Sarah
    Damseh, Rafat
    Alrazak, Sadam Alabed
    Sheikh, Javaid
    JMIR MEDICAL EDUCATION, 2023, 9
  • [5] Large Language Models for Mathematical Reasoning: Progresses and Challenges
    Ahn, Janice
    Verma, Rishu
    Lou, Renze
    Zhang, Rui
    Yin, Wenpeng
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: STUDENT RESEARCH WORKSHOP, 2024, : 225 - 237
  • [6] Directions Towards Efficient and Automated Data Wrangling with Large Language Models
    Zhang, Zeyu
    Groth, Paul
    Calixto, Iacer
    Schelter, Sebastian
    2024 IEEE 40TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, ICDEW, 2024, : 301 - 304
  • [7] Large language models in biomedicine and health: current research landscape and future directions
    Lu, Zhiyong
    Peng, Yifan
    Cohen, Trevor
    Ghassemi, Marzyeh
    Weng, Chunhua
    Tian, Shubo
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (09) : 1801 - 1811
  • [8] Model Checking Using Large Language Models-Evaluation and Future Directions
    Batsakis, Sotiris
    Tachmazidis, Ilias
    Mantle, Matthew
    Papadakis, Nikolaos
    Antoniou, Grigoris
    ELECTRONICS, 2025, 14 (02):
  • [9] The implementation solution for automatic visualization of tabular data in relational databases based on large language models
    Yang, Hao
    Yang, Zhaoyong
    Zhao, Ruyang
    Li, Xiaoran
    Rao, Gaoqi
    2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 175 - 180
  • [10] Research on Fine-Tuning Optimization Strategies for Large Language Models in Tabular Data Processing
    Zhao, Xiaoyong
    Leng, Xingxin
    Wang, Lei
    Wang, Ningning
    BIOMIMETICS, 2024, 9 (11)