SQL queries over unstructured text Databases

被引:0
|
作者
Jain, Alpa [1 ]
Doan, AnHai [2 ]
Gravano, Luis [1 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
[2] Univ Wisconsin, Madison, WI 53706 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Text documents often embed data that is structured in nature. By processing a text database with information extraction systems, we can define a variety of structured "relations," over which we can then issue SQL queries. Processing SQL queries in this text-based scenario presents multiple challenges. One key challenge is efficiency: information extraction is a time-consuming process, so query processing strategies should pick efficient extraction systems whenever possible, and also minimize the number of documents that the), process. Another key challenge is result quality: extraction systems might output erroneous information or miss information that they should capture; also, efficiency-related query processing decisions (e.g., to avoid processing large numbers of useless documents) may compromise result completeness. To address these challenges, we characterize SQL query processing strategies in terms of their efficiency and result quality, and discuss the (user-specific) tradeoff between these two properties.
引用
收藏
页码:1230 / +
页数:2
相关论文
共 50 条
  • [41] AnaSearch: Extract, Retrieve and Visualize Structured Results from Unstructured Text for Analytical Queries
    Li, Tongliang
    Fang, Lei
    Lou, Jian-Guang
    Li, Zhoujun
    Zhang, Dongmei
    [J]. WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2021, : 906 - 909
  • [42] Databases, tuples, and SQL
    Kabachinski, Jeff
    [J]. Biomedical Instrumentation and Technology, 2008, 42 (05): : 385 - 387
  • [43] MULTIUSER DATABASES - THE SQL
    FINKELSTEIN, R
    [J]. BYTE, 1990, 15 (05): : 136 - &
  • [44] MAKE BULLETPROOF SQL QUERIES
    LINTHICUM, DS
    [J]. BYTE, 1995, 20 (02): : 111 - 113
  • [45] XML queries via SQL
    Chen, CX
    Malhotra, A
    [J]. WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2000, 1846 : 53 - 60
  • [46] VENN DIAGRAMS AND SQL QUERIES
    HALPIN, TA
    [J]. AUSTRALIAN COMPUTER JOURNAL, 1989, 21 (01): : 27 - 32
  • [47] Sensitivity Analysis of SQL Queries
    Laud, Peeter
    Pettai, Martin
    Randmets, Jaak
    [J]. PLAS'18: PROCEEDINGS OF THE 13TH WORKSHOP ON PROGRAMMING LANGUAGES AND ANALYSIS FOR SECURITY, 2018, : 2 - 12
  • [48] SQL queries with CASE expressions
    Gryz, Jarek
    Wang, Qiong
    Qian, Xiaoyan
    Zuzarte, Calisto
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2008, 4994 : 351 - +
  • [49] FORMAL SEMANTICS OF SQL QUERIES
    NEGRI, M
    PELAGATTI, G
    SBATTELLA, L
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 1991, 16 (03): : 513 - 534
  • [50] Proving the safety of SQL queries
    Brass, S
    Goldberg, C
    [J]. QSIC 2005: FIFTH INTERNATIONAL CONFERENCE ON QUALITY SOFTWARE, PROCEEDINGS, 2005, : 197 - 204