Bridging the Semantic Gap with SQL Query Logs in Natural Language Interfaces to Databases

被引:32
|
作者
Baik, Christopher [1 ]
Jagadish, H. V. [1 ]
Li, Yunyao [2 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] IBM Res Almaden, San Jose, CA USA
关键词
SYSTEM;
D O I
10.1109/ICDE.2019.00041
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A critical challenge in constructing a natural language interface to database (NLIDB) is bridging the semantic gap between a natural language query (NLQ) and the underlying data. Two specific ways this challenge exhibits itself is through keyword mapping and join path inference. Keyword mapping is the task of mapping individual keywords in the original NLQ to database elements (such as relations, attributes or values). It is challenging due to the ambiguity in mapping the user's mental model and diction to the schema definition and contents of the underlying database. Join path inference is the process of selecting the relations and join conditions in the FROM clause of the final SQL query, and is difficult because NLIDB users lack the knowledge of the database schema or SQL and therefore cannot explicitly specify the intermediate tables and joins needed to construct a final SQL query. In this paper, we propose leveraging information from the SQL query log of a database to enhance the performance of existing NLIDBs with respect to these challenges. We present a system TEMPLAR that can be used to augment existing NLIDBs. Our extensive experimental evaluation demonstrates the effectiveness of our approach, leading up to 138% improvement in top-1 accuracy in existing NLIDBs by leveraging SQL query log information.
引用
收藏
页码:374 / 385
页数:12
相关论文
共 50 条
  • [1] Interfaces to Query Relational Databases in Natural Language
    Singh, Harjit
    [J]. IT PROFESSIONAL, 2019, 21 (01) : 67 - 73
  • [2] Bridging the gap: Query by semantic example
    Rasiwasia, Nikhil
    Moreno, Pedro J.
    Vasconcelos, Nuno
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (05) : 923 - 938
  • [3] Semantic Model for Improving the Performance of Natural Language Interfaces to Databases
    Pazos R, Rodolfo A.
    Gonzalez B, Juan J.
    Aguirre L, Marco A.
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, PT I, 2011, 7094 : 277 - 290
  • [4] VISUALNEO: Bridging the Gap between Visual Query Interfaces and Graph Query Engines
    Huang, Kai
    Liang, Houdong
    Yao, Chongchong
    Zhao, Xi
    Cui, Yue
    Tian, Yao
    Zhang, Ruiyuan
    Zhou, Xiaofang
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (12): : 4010 - 4013
  • [5] Evaluating the usability of natural language query languages and interfaces to Semantic Web knowledge bases
    Kaufmann, Esther
    Bernstein, Abraham
    [J]. JOURNAL OF WEB SEMANTICS, 2010, 8 (04): : 377 - 393
  • [6] SQL/NF - A QUERY LANGUAGE FOR = 1NF RELATIONAL DATABASES
    ROTH, MA
    KORTH, HF
    BATORY, DS
    [J]. INFORMATION SYSTEMS, 1987, 12 (01) : 99 - 114
  • [7] BRIDGING THE NATURAL-LANGUAGE GAP
    WIEDERHOLD, G
    [J]. IEEE EXPERT-INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1987, 2 (02): : 56 - 56
  • [8] On Modern Text-to-SQL Semantic Parsing Methodologies for Natural Language Interface to Databases: A Comparative Study
    Visperas, Moses
    Adoptante, Aunhel John
    Borjal, Christalline Joie
    Abia, Ma. Teresita
    Catapang, Jasper Kyle
    Peramo, Elmer
    [J]. 2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 390 - 396
  • [9] GQL, A GRAPHICAL QUERY LANGUAGE FOR SEMANTIC DATABASES-PSI
    AZMOODEH, M
    DU, H
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1989, 339 : 259 - 277
  • [10] Evaluation of Natural Language Software Interfaces to Databases
    Majeed, Fiaz
    Shoaib, Muhammad
    Alkinani, Monagi H.
    Khan, Wazir Zada
    Khurram, Shahzada
    Gardezi, Akber Abid
    Shafiq, Muhammad
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 32 (01): : 285 - 299