NADAQ: Natural Language Database Querying Based on Deep Learning

被引:17
|
作者
Xu, Boyan [1 ]
Cai, Ruichu [1 ]
Zhang, Zhenjie [2 ]
Yang, Xiaoyan [2 ]
Hao, Zhifeng [1 ,3 ]
Li, Zijian [1 ]
Liang, Zhihao [1 ]
机构
[1] Guangdong Univ Technol, Fac Comp, Guangzhou 510006, Guangdong, Peoples R China
[2] Yitu Technol Pte Ltd, Singapore R&D, Singapore 117372, Singapore
[3] Foshan Univ, Sch Math & Big Data, Foshan 528000, Peoples R China
关键词
Databases; natural language processing; recurrent neural networks;
D O I
10.1109/ACCESS.2019.2904720
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The high complexity behind SQL language and database schemas has made database querying a challenging task to human programmers. In this paper, we present our new natural language database querying (NADAQ) system as an alternative solution, by designing new translation models smoothly fusing deep learning and traditional database parsing techniques. On top of the popular encoder-decoder model for machine translation, NADAQ injects new dimensions of schema-aware bits associated with the input words into encoder phase and adds new hidden memory neurons controlled by the finite state machine for grammatical state tracking into the decoder phase. We further develop new techniques to enable the augmented neural network to reject queries irrelevant to the contents of the target database and recommend candidate queries reversely transformed into natural language. NADAQ performs well on real-world database systems over human labeled workload, returning query results at 90% accuracy.
引用
收藏
页码:35012 / 35017
页数:6
相关论文
共 50 条
  • [41] A deep database of medical abbreviations and acronyms for natural language processing
    Lisa Grossman Liu
    Raymond H. Grossman
    Elliot G. Mitchell
    Chunhua Weng
    Karthik Natarajan
    George Hripcsak
    David K. Vawdrey
    Scientific Data, 8
  • [42] Extending database programming language with declarative querying facilities
    Savnik, Iztok, 1600, Elsevier Science B.V., Amsterdam, Netherlands (40): : 10 - 12
  • [43] Querying a graph database - language selection and performance considerations
    Holzschuher, Florian
    Peinl, Rene
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2016, 82 (01) : 45 - 68
  • [44] Gremlin Language for Querying the BiographDB Integrated Biological Database
    Fiannaca, Antonino
    La Paglia, Laura
    La Rosa, Massimo
    Messina, Antonio
    Rizzo, Riccardo
    Stabile, Dario
    Urso, Alfonso
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2017, PT I, 2017, 10208 : 303 - 313
  • [45] EXTENDING DATABASE PROGRAMMING LANGUAGE WITH DECLARATIVE QUERYING FACILITIES
    SAVNIK, I
    MOHORIC, T
    JOSIFOVSKI, V
    MICROPROCESSING AND MICROPROGRAMMING, 1994, 40 (10-12): : 905 - 908
  • [46] An Arabic natural language interface for querying relational databases based on natural language processing and graph theory methods
    Bais H.
    Machkour M.
    Koutti L.
    Bais, Hanane (baishanan@gmail.com), 2018, Inderscience Enterprises Ltd., 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (10) : 155 - 165
  • [47] Multi-lingual database querying and the atoms of language
    Kapetanios, E
    Chountas, P
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2005, 3513 : 384 - 387
  • [48] Managing dialog in a natural language querying system
    Quintano, L
    Rodrigues, I
    PROGRESS IN ARTIFICIAL INTELLIGENCE-B, 2003, 2902 : 488 - 501
  • [49] Conversing with databases: Practical Natural Language Querying
    Kochedykov, Denis
    Yin, Fenglin
    Khatravath, Sreevidya
    EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Industry Track, 2023, : 372 - 379
  • [50] Querying Preschool Children Learning Foreign Language in China Based on Language Attrition and Cognition
    Zhang Yuhuan
    PROCEEDINGS OF THE 2016 INTERNATIONAL FORUM ON MANAGEMENT, EDUCATION AND INFORMATION TECHNOLOGY APPLICATION, 2016, 47 : 1007 - 1010