Online Schemaless Querying of Heterogeneous Open Knowledge Bases

被引:0
|
作者
Bhutani, Nikita [1 ]
Jagadish, H. V. [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
来源
PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19) | 2019年
关键词
open knowledge bases; heterogeneity; schemaless querying; SEARCH; TOOL;
D O I
10.1145/3357384.3357874
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Applications that depend on a deep understanding of natural language text have led to a renaissance of large knowledge bases (KBs). Some of these are curated manually and conform to an ontology. Many others, called open KBs, are derived automatically from unstructured text without any pre-specified ontology. These open KBs offer broad coverage of information but are far more heterogeneous than curated KBs, which themselves are more heterogeneous than traditional databases with a fixed schema. Due to the heterogeneity of information representation, querying KBs is a challenging task. Traditionally, query expansion is performed to cover all possible transformations and semantically equivalent structures. Such query expansion can be impractical for heterogeneous open KBs, particularly when complex queries lead to a combinatorial explosion of expansion possibilities. Furthermore, learning a query expansion model requires training examples, which is difficult to scale to diverse representations of facts in the KB. In this paper, we introduce an online schemaless querying method that does not require the query to exactly match the facts. Instead of exactly matching a query, it finds matches for individual query components and then identifies an answer by reasoning over the collective evidence. We devise an alignment-based algorithm for extracting answers based on textual and semantic similarity of query components and evidence fields. Thus, any representational mismatches between the query and evidence are handled online at query-time. Experiments show our approach is effective in handling multi-constraint queries.
引用
收藏
页码:699 / 708
页数:10
相关论文
共 50 条
  • [41] The mediator authorization-security model for heterogeneous semantic knowledge bases
    Alamri, Abdullah
    Bertok, Peter
    Thom, James A.
    Fahad, Adil
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2016, 55 : 227 - 237
  • [42] Querying databases with knowledge domains
    Ng, W
    2000 INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM - PROCEEDINGS, 2000, : 65 - 72
  • [43] A framework for querying heterogeneous images repositories
    Albanesi, MG
    Falchero, E
    Guerrini, F
    Ferretti, M
    INTERNET IMAGING V, 2004, 5304 : 154 - 159
  • [44] Autonomous querying for knowledge networks
    Greer, Kieran
    Baumgarten, Matthias
    Nugent, Chris
    Mulvenna, Maurice
    Curran, Kevin
    AUTONOMIC AND TRUSTED COMPUTING, PROCEEDINGS, 2008, 5060 : 249 - +
  • [45] Querying databases with knowledge domains
    Ng, Wilfred
    2000, IEEE, Piscataway, NJ, United States
  • [46] Analyzing Online Schema Extraction Approaches for Linked Data Knowledge Bases
    Zeimetz, Tobias
    Schenkel, Ralf
    PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON SEMANTIC BIG DATA (SBD 2019), 2019,
  • [47] Massive open online courses as a knowledge base for teachers
    Donitsa-Schmidt, Smadar
    Topaz, Beverley
    JOURNAL OF EDUCATION FOR TEACHING, 2018, 44 (05) : 608 - 620
  • [48] MULCE: Multi-level Canonicalization with Embeddings of Open Knowledge Bases
    Wu, Tien-Hsuan
    Kao, Ben
    Wu, Zhiyong
    Feng, Xiyang
    Song, Qianli
    Chen, Cheng
    WEB INFORMATION SYSTEMS ENGINEERING, WISE 2020, PT I, 2020, 12342 : 315 - 327
  • [49] Canonicalization of Open Knowledge Bases with Side Information from the Source Text
    Lin, Xueling
    Chen, Lei
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 950 - 961
  • [50] CESI: Canonicalizing Open Knowledge Bases using Embeddings and Side Information
    Vashishth, Shikhar
    Jain, Prince
    Talukdar, Partha
    WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, : 1317 - 1327