AutoTQA: Towards Autonomous Tabular Question Answering through Multi-Agent Large Language Models

被引:0
|
作者
Zhu, Jun-Peng [1 ,2 ]
Cai, Peng [1 ]
Xu, Kai [2 ]
Li, Li [2 ]
Sun, Yishen [2 ]
Zhou, Shuai [2 ]
Su, Haihuang [2 ]
Tang, Liu [2 ]
Liu, Qi [2 ]
机构
[1] East China Normal Univ, Shanghai, Peoples R China
[2] PingCAP, Beijing, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2024年 / 17卷 / 12期
基金
中国国家自然科学基金;
关键词
D O I
10.14778/3685800.3685816
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the growing significance of data analysis, several studies aim to provide precise answers to users' natural language questions from tables, a task referred to as tabular question answering (TQA). The state-of-the-art TQA approaches are limited to handling only single-table questions. However, real-world TQA problems are inherently complex and frequently involve multiple tables, which poses challenges in directly extending single-table TQA designs to handle multiple tables, primarily due to the limited extensibility of the majority of single-table TQA methods. This paper proposes AutoTQA, a novel Autonomous Tabular Question Answering framework that employs multi-agent large language models (LLMs) across multiple tables from various systems (e.g., TiDB, BigQuery). AutoTQA comprises five agents: the User, responsible for receiving the user's natural language inquiry; the Planner, tasked with creating an execution plan for the user's inquiry; the Engineer, responsible for executing the plan step-by-step; the Executor, provides various execution environments (e.g., text-to-SQL) to fulfill specific tasks assigned by the Engineer; and the Critic, responsible for judging whether to complete the user's natural language inquiry and identifying gaps between the current results and initial tasks. To facilitate the interaction between different agents, we have also devised agent scheduling algorithms. Furthermore, we have developed LinguFlow, an open-source, low-code visual programming tool, to quickly build and debug LLM-based applications, and to accelerate the creation of various external tools and execution environments. We also implemented a series of data connectors, which allows AutoTQA to access various tables from multiple systems. Extensive experiments show that AutoTQA delivers outstanding performance on four representative datasets.
引用
收藏
页码:3920 / 3933
页数:14
相关论文
共 50 条
  • [21] A Formal Multi-Agent Language for Cooperative Autonomous Driving Scenarios
    Witsch, Andreas
    Opfer, Stephan
    Geihs, Kurt
    2014 INTERNATIONAL CONFERENCE ON CONNECTED VEHICLES AND EXPO (ICCVE), 2014, : 546 - 551
  • [22] EvaAI: A Multi-agent Framework Leveraging Large Language Models for Enhanced Automated Grading
    Lagakis, Paraskevas
    Demetriadis, Stavros
    GENERATIVE INTELLIGENCE AND INTELLIGENT TUTORING SYSTEMS, PT I, ITS 2024, 2024, 14798 : 378 - 385
  • [23] Evaluating the Adaptability of Large Language Models for Knowledge-aware Question and Answering
    Thakkar, Jay
    Kolekar, Suresh
    Gite, Shilpa
    Pradhan, Biswajeet
    Alamri, Abdullah
    INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2024, 17 (01):
  • [24] Evaluating Open-Domain Question Answering in the Era of Large Language Models
    Kamalloo, Ehsan
    Dziri, Nouha
    Clarke, Charles L. A.
    Rafiei, Davood
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 5591 - 5606
  • [25] A medical question answering system using large language models and knowledge graphs
    Guo, Quan
    Cao, Shuai
    Yi, Zhang
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 8548 - 8564
  • [26] Toward expert-level medical question answering with large language models
    Singhal, Karan
    Tu, Tao
    Gottweis, Juraj
    Sayres, Rory
    Wulczyn, Ellery
    Amin, Mohamed
    Hou, Le
    Clark, Kevin
    Pfohl, Stephen R.
    Cole-Lewis, Heather
    Neal, Darlene
    Rashid, Qazi Mamunur
    Schaekermann, Mike
    Wang, Amy
    Dash, Dev
    Chen, Jonathan H.
    Shah, Nigam H.
    Lachgar, Sami
    Mansfield, Philip Andrew
    Prakash, Sushant
    Green, Bradley
    Dominowska, Ewa
    Aguera y Arcas, Blaise
    Tomasev, Nenad
    Liu, Yun
    Wong, Renee
    Semturs, Christopher
    Mahdavi, S. Sara
    Barral, Joelle K.
    Webster, Dale R.
    Corrado, Greg S.
    Matias, Yossi
    Azizi, Shekoofeh
    Karthikesalingam, Alan
    Natarajan, Vivek
    NATURE MEDICINE, 2025, : 943 - 950
  • [27] Open-Domain Question Answering over Tables with Large Language Models
    Liang, Xinyi
    Hu, Rui
    Liu, Yu
    Zhu, Konglin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XII, ICIC 2024, 2024, 14873 : 347 - 358
  • [28] UniGen: A Unified Generative Framework for Retrieval and Question Answering with Large Language Models
    Li, Xiaoxi
    Zhou, Yujia
    Dou, Zhicheng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 8688 - 8696
  • [29] Large Language Models for Scientific Question Answering: An Extensive Analysis of the SciQA Benchmark
    Lehmann, Jens
    Meloni, Antonello
    Motta, Enrico
    Osborne, Francesco
    Recupero, Diego Reforgiato
    Salatino, Angelo Antonio
    Vandati, Sahar
    SEMANTIC WEB, PT I, ESWC 2024, 2024, 14664 : 199 - 217
  • [30] Assessing and Optimizing Large Language Models on Spondyloarthritis Multi-Choice Question Answering: Protocol for Enhancement and Assessment
    Wang, Anan
    Wu, Yunong
    Ji, Xiaojian
    Wang, Xiangyang
    Hu, Jiawen
    Zhang, Fazhan
    Zhang, Zhanchao
    Pu, Dong
    Tang, Lulu
    Ma, Shikui
    Liu, Qiang
    Dong, Jing
    He, Kunlun
    Li, Kunpeng
    Teng, Da
    Li, Tao
    JMIR RESEARCH PROTOCOLS, 2024, 13