SQL queries over unstructured text Databases

被引:0
|
作者
Jain, Alpa [1 ]
Doan, AnHai [2 ]
Gravano, Luis [1 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
[2] Univ Wisconsin, Madison, WI 53706 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Text documents often embed data that is structured in nature. By processing a text database with information extraction systems, we can define a variety of structured "relations," over which we can then issue SQL queries. Processing SQL queries in this text-based scenario presents multiple challenges. One key challenge is efficiency: information extraction is a time-consuming process, so query processing strategies should pick efficient extraction systems whenever possible, and also minimize the number of documents that the), process. Another key challenge is result quality: extraction systems might output erroneous information or miss information that they should capture; also, efficiency-related query processing decisions (e.g., to avoid processing large numbers of useless documents) may compromise result completeness. To address these challenges, we characterize SQL query processing strategies in terms of their efficiency and result quality, and discuss the (user-specific) tradeoff between these two properties.
引用
收藏
页码:1230 / +
页数:2
相关论文
共 50 条
  • [1] Optimizing SQL queries over text databases
    Jain, Alpa
    Doan, AnHai
    Gravano, Luis
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 636 - +
  • [2] SQL queries over encrypted databases: a survey
    Sun, Bo
    Zhao, Sen
    Tian, Guohua
    [J]. CONNECTION SCIENCE, 2024, 36 (01)
  • [3] vSQL: Verifying Arbitrary SQL Queries over Dynamic Outsourced Databases
    Zhang, Yupeng
    Genkin, Daniel
    Katz, Jonathan
    Papadopoulos, Dimitrios
    Papamanthou, Charalampos
    [J]. 2017 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2017, : 863 - 880
  • [4] Correctness of SQL Queries on Databases with Nulls
    Guagliardo, Paolo
    Libkin, Leonid
    [J]. SIGMOD RECORD, 2017, 46 (03) : 5 - 16
  • [5] Populating Test Databases for Testing SQL Queries
    Suarez-Cabal, M. J.
    de la Riva, C.
    Tuya, J.
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2010, 8 (02) : 164 - 171
  • [6] A Novel Secure Scheme for Supporting Complex SQL Queries over Encrypted Databases in Cloud Computing
    Liu, Guoxiu
    Yang, Geng
    Wang, Huaqun
    Xiang, Yang
    Dai, Hua
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2018,
  • [7] Making SQL Queries Correct on Incomplete Databases: A Feasibility Study
    Guagliardo, Paolo
    Libkin, Leonid
    [J]. PODS'16: PROCEEDINGS OF THE 35TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2016, : 211 - 223
  • [8] Precis: from unstructured keywords as queries to structured databases as answers
    Simitsis, Alkis
    Koutrika, Georgia
    Ioannidis, Yannis
    [J]. VLDB JOURNAL, 2008, 17 (01): : 117 - 149
  • [9] Scalable classification over SQL databases
    Chaudhuri, S
    Fayyad, U
    Bernhardt, J
    [J]. 15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, : 470 - 479
  • [10] Fusion queries over Internet databases
    Yerneni, R
    Papakonstantinou, Y
    Abiteboul, S
    Garcia-Molina, E
    [J]. ADVANCES IN DATABASE TECHNOLOGY - EDBT'98, 1998, 1377 : 57 - 71