Large Language Models for Tabular Data: Progresses and Future Directions

被引:0
|
作者
Dong, Haoyu [1 ]
Wang, Zhiruo [2 ]
机构
[1] Microsoft AI, Beijing, Peoples R China
[2] Carnegie Mellon Univ, Pittsburgh, PA USA
关键词
Tabular data; Large language models; Representation learning;
D O I
10.1145/3626772.3661384
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tables contain a significant portion of the world's structured information. The ability to efficiently and accurately understand, process, reason about, analyze, and generate tabular data is critical for achieving Artificial General Intelligence (AGI) systems. However, despite their prevalence and importance, tables present unique challenges due to their structured nature and the diverse semantics embedded within them. Textual content, numerical values, visual formats, and even formulas in tables carry rich semantic information that is often underutilized due to the complexity of accurately interpreting and integrating. Fortunately, the advent of Large Language Models (LLMs) has opened new frontiers in natural language processing (NLP) and machine learning (ML), showing remarkable success in understanding and generating text, code, etc. Applying these advanced models to the domain of tabular data holds the promise of significant breakthroughs in how we process and leverage structured information. Therefore, this tutorial aims to provide a comprehensive study of the advances, challenges, and opportunities in leveraging cutting-edge LLMs for tabular data. By introducing methods of prompting or training cutting-edge LLMs for table interpreting, processing, reasoning, analytics, and generation, we aim to equip researchers and practitioners with the knowledge and tools needed to unlock the full potential of LLMs for tabular data in their domains.
引用
收藏
页码:2997 / 3000
页数:4
相关论文
共 50 条
  • [21] Privacy Preservation of Large Language Models in the Metaverse Era: Research Frontiers, Categorical Comparisons, and Future Directions
    Huang, Dabin
    Ge, Mengyu
    Xiang, Kunlan
    Zhang, Xiaolei
    Yang, Haomiao
    INTERNATIONAL JOURNAL OF NETWORK MANAGEMENT, 2025, 35 (01)
  • [22] Federated Large Language Model: Solutions, Challenges and Future Directions
    Hu, Jiahui
    Wang, Dan
    Wang, Zhibo
    Pang, Xiaoyi
    Xu, Huiyu
    Ren, Ju
    Ren, Kui
    IEEE WIRELESS COMMUNICATIONS, 2024,
  • [23] P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models
    Yang, Shuo
    Yuan, Chenchen
    Rong, Yao
    Steinbauer, Felix
    Kasneci, Gjergji
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 248 - 264
  • [24] Exploring the Numerical Reasoning Capabilities of Language Models: A Comprehensive Analysis on Tabular Data
    Akhtar, Mubashara
    Shankarampeta, Abhilash
    Gupta, Vivek
    Patil, Arpit
    Cocarascul, Oana
    Simper, Elena
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 15391 - 15405
  • [25] Equipping Language Models with Tool Use Capability for Tabular Data Analysis in Finance
    Theuma, Adrian
    Shareghi, Ehsan
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2: SHORT PAPERS, 2024, : 90 - 103
  • [26] Large Language Models for EDA: Future or Mirage?
    He, Zhuolun
    Yu, Bei
    PROCEEDINGS OF THE 2024 INTERNATIONAL SYMPOSIUM ON PHYSICAL DESIGN, ISPD 2024, 2024, : 65 - 66
  • [27] The future landscape of large language models in medicine
    Clusmann, Jan
    Kolbinger, Fiona R.
    Muti, Hannah Sophie
    Carrero, Zunamys I.
    Eckardt, Jan-Niklas
    Laleh, Narmin Ghaffari
    Loeffler, Chiara Maria Lavinia
    Schwarzkopf, Sophie-Caroline
    Unger, Michaela
    Veldhuizen, Gregory P.
    Wagner, Sophia J.
    Kather, Jakob Nikolas
    COMMUNICATIONS MEDICINE, 2023, 3 (01):
  • [28] Looking to Future Applications of Large Language Models
    Liu, Xichong
    Rubin, Samuel J. S.
    Rogalla, Stephan
    AMERICAN JOURNAL OF GASTROENTEROLOGY, 2023, 118 (12): : 2306 - 2306
  • [29] Large Language Models and the Future of Organization Theory
    Cornelissen, Joep
    Hollerer, Markus A.
    Boxenbaum, Eva
    Faraj, Samer
    Gehman, Joel
    ORGANIZATION THEORY, 2024, 5 (01):
  • [30] The future landscape of large language models in medicine
    Jan Clusmann
    Fiona R. Kolbinger
    Hannah Sophie Muti
    Zunamys I. Carrero
    Jan-Niklas Eckardt
    Narmin Ghaffari Laleh
    Chiara Maria Lavinia Löffler
    Sophie-Caroline Schwarzkopf
    Michaela Unger
    Gregory P. Veldhuizen
    Sophia J. Wagner
    Jakob Nikolas Kather
    Communications Medicine, 3