Machine learning for question answering from tabular data

被引:7
|
作者
Khalid, Mahboob Alam [1 ]
Jijkoun, Valentin [1 ]
de Rijke, Maarten [1 ]
机构
[1] Univ Amsterdam, ISLA, Kruislaan 403, NL-1098 SJ Amsterdam, Netherlands
关键词
D O I
10.1109/DEXA.2007.119
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question Answering (QA) systems automatically answer natural language questions in a human-like manner One of the practical approaches to open domain QA consists in extracting facts from free text offline and using a lookup mechanism when answering user's questions online. This approach is related to natural language interfaces to databases (NLIDBs) that were studied extensively from the 1970s to the 1990s. NLIDB systems employed a range of techniques, from simple pattern-matching rules to formal logical calculi such as the lambda calculus, but most were restricted to specific domains. In this paper we describe a machine learning approach to querying tabular data for QA which is not restricted to specific domains. Our approach consists of two steps: for an incoming question, we first use a classifier to identify appropriate tables and columns in a structured database, and then employ a free-text retrieval to look up answers. The system uses part-of-speech tagging, named-entity normalization and a statistical classifier trained on data from the TREC QA task. With the TREC QA data, our system is shown to significantly outperform an existing rule-based table lookup method.
引用
收藏
页码:392 / +
页数:2
相关论文
共 50 条
  • [1] Extracting Tabular data for Question-Answering from Documents
    Jain, Palak
    Goel, Tushar
    Verma, Ishan
    Shakir, Mohammad
    Dey, Lipika
    Sharma, Geetika
    [J]. CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 400 - 404
  • [2] Machine learning for query formulation in question answering
    Monz, Christof
    [J]. NATURAL LANGUAGE ENGINEERING, 2011, 17 : 425 - 454
  • [3] A Machine Learning Approach for Factoid Question Answering
    Sal, David Dominguez
    Surdeanu, Mihai
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2006, (37): : 131 - 136
  • [4] A Machine Learning Approach for Ranking in Question Answering
    Amato, Alba
    Coronato, Antonio
    [J]. ADVANCES ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING (3PGCIC-2017), 2018, 13 : 89 - 98
  • [5] TSQA: Tabular Scenario Based Question Answering
    Li, Xiao
    Sun, Yawei
    Cheng, Gong
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 13297 - 13305
  • [6] A machine learning approach to introspection in a question answering system
    Czuba, K
    Prager, J
    Chu-Carroll, J
    [J]. PROCEEDINGS OF THE 2002 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2002, : 265 - 272
  • [7] Using machine learning and text mining in question answering
    Juarez-Gonzalez, Antonio
    Tellez-Valero, Alberto
    Denicia-Carral, Claudia
    Montes-y-Gomez, Manuel
    Villasenor-Pineda, Luis
    [J]. Evaluation of Multilingual and Multi-modal Information Retrieval, 2007, 4730 : 415 - 423
  • [8] A machine learning approach for Indonesian question answering system
    Purwarianti, Ayu
    Tsuchiya, Masatoshi
    Nakagawa, Seiichi
    [J]. PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, 2007, : 537 - +
  • [9] Question Answering System using Machine Learning Techniques
    Dobrescu, Alexandra-Maria
    Radu, Serban
    [J]. VISION 2025: EDUCATION EXCELLENCE AND MANAGEMENT OF INNOVATIONS THROUGH SUSTAINABLE ECONOMIC COMPETITIVE ADVANTAGE, 2019, : 10226 - 10237
  • [10] HybridQA: A Dataset of Multi-Hop Question Answering over Tabular and Textual Data
    Chen, Wenhu
    Zha, Hanwen
    Chen, Zhiyu
    Xiong, Wenhan
    Wang, Hong
    Wang, William
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1026 - 1036