English Access to Structured Data

被引:3
|
作者
Richardson, Kyle D. [1 ]
Bobrow, Daniel G. [1 ]
Condoravdi, Cleo [1 ]
Waldinger, Richard [2 ]
Das, Amar [3 ]
机构
[1] Palo Alto Res Ctr, Palo Alto, CA 94304 USA
[2] SRI Int, Menlo Pk, CA 94025 USA
[3] Stanford Univ, Stanford, CA 94305 USA
关键词
Natural language processing; Natural language interfaces to databases; Deductive question answering; Theorem proving; HIV drug resistance database;
D O I
10.1109/ICSC.2011.67
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present work on using a domain model to guide text interpretation, in the context of a project that aims to interpret English questions as a sequence of queries to be answered from structured databases. We adapt a broad-coverage and ambiguity-enabled natural language processing (NLP) system to produce domain-specific logical forms, using knowledge of the domain to zero in on the appropriate interpretation. The vocabulary of the logical forms is drawn from a domain theory that constitutes a higher-level abstraction of the contents of a set of related databases. The meanings of the terms are encoded in an axiomatic domain theory. To retrieve information from the databases, the logical forms must be instantiated by values constructed from fields in the database. The axiomatic domain theory is interpreted by the first-order theorem prover SNARK to identify the groundings, and then retrieve the values through procedural attachments semantically linked to the database. SNARK attempts to prove the logical form as a theorem by reasoning over the theory that is linked to the database and returns the exemplars of the proof(s) back to the user as answers to the query. The focus of this paper is more on the language task; however, we discuss the interaction that must occur between linguistic analysis and reasoning for an end-to-end natural language interface to databases. We illustrate the process using examples drawn from an HIV treatment domain, where the underlying databases are records of temporally bound treatments of individual patients.
引用
收藏
页码:13 / 20
页数:8
相关论文
共 50 条
  • [1] STRUCTURED BIOLOGICAL DATA IN THE MOLECULAR ACCESS SYSTEM
    BARCZA, S
    KELLY, LA
    WAHRMAN, SS
    KIRSCHENBAUM, RE
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (01): : 55 - 59
  • [2] Methods to Access Structured and Semi-Structured Data in Bioinformatics Databases: A Perspective
    Moftah, Raja A.
    Maatuk, Abdelsalam M.
    White, Richard
    [J]. 2016 INTERNATIONAL CONFERENCE ON ENGINEERING & MIS (ICEMIS), 2016,
  • [3] A STORAGE AND ACCESS MANAGER FOR ILL-STRUCTURED DATA
    KOTTEMAN, JE
    GORDON, MD
    STOTT, JW
    [J]. COMMUNICATIONS OF THE ACM, 1991, 34 (08) : 94 - 103
  • [4] Efficient structured data access in parallel file systems
    Ching, A
    Choudhary, A
    Liao, WK
    Ross, R
    Gropp, W
    [J]. IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, PROCEEDINGS, 2003, : 326 - 335
  • [5] Structured Data Access Annotations for Massively Parallel Computations
    Aldinucci, Marco
    Campa, Sonia
    Kilpatrick, Peter
    Torquati, Massimo
    [J]. EURO-PAR 2012: PARALLEL PROCESSING WORKSHOPS, 2013, 7640 : 381 - 390
  • [6] User-Friendly Access to Structured Environmental Data
    Abecker, Andreas
    Bicer, Veli
    Kazakos, Wassilios
    Nagypal, Gabor
    Nedkov, Radoslav
    Valikov, Aleksei
    [J]. ENVIRONMENTAL SOFTWARE SYSTEMS: FRAMEWORKS OF EENVIRONMENT, 2011, 359 : 357 - +
  • [7] Efficient retrieval of power structured data with global data access view
    Li, Jiwei
    Li, Bo
    Liu, Shi
    Lv, Hongwei
    Chen, Fei
    Liu, Qing
    [J]. ACM International Conference Proceeding Series, 2023, : 361 - 365
  • [8] Identity Resolution in Ontology Based Data Access to Structured Data Sources
    Toman, David
    Weddell, Grant
    [J]. PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2019, 11670 : 473 - 485
  • [9] A Scalable Data Access Layer to Manage Structured Heterogeneous Biomedical Data
    Delussu, Giovanni
    Lianas, Luca
    Frexia, Francesca
    Zanetti, Gianluigi
    [J]. PLOS ONE, 2016, 11 (12):
  • [10] Controlled English Ontology-Based Data Access
    Thorne, Camilo
    Calvanese, Diego
    [J]. CONTROLLED NATURAL LANGUAGE, 2010, 5972 : 135 - 154