Clustering-based fragmentation and data replication for flexible query answering in distributed databases

被引:16
|
作者
Wiese L. [1 ]
机构
[1] Institute of Computer Science, Georg-August-Universität Göttingen, Goldschmidtstraße 7, Göttingen
关键词
Bin packing with conflicts; Clustering; Data replication; Distributed database; Flexible query answering; Fragmentation; Load balancing;
D O I
10.1186/s13677-014-0018-0
中图分类号
学科分类号
摘要
One feature of cloud storage systems is data fragmentation (or sharding) so that data can be distributed over multiple servers and subqueries can be run in parallel on the fragments. On the other hand, flexible query answering can enable a database system to find related information for a user whose original query cannot be answered exactly. Query generalization is a way to implement flexible query answering on the syntax level. In this paper we study a clustering-based fragmentation for the generalization operator Anti-Instantiation with which related information can be found in distributed data. We use a standard clustering algorithm to derive a semantic fragmentation of data in the database. The database system uses the derived fragments to support an intelligent flexible query answering mechanism that avoids overgeneralization but supports data replication in a distributed database system. We show that the data replication problem can be expressed as a special Bin Packing Problem and can hence be solved by an off-the shelf solver for integer linear programs. We present a prototype system that makes use of a medical taxonomy to determine similarities between medical expressions. © 2014, Wiese; licensee Springer.
引用
下载
收藏
页数:16
相关论文
共 50 条
  • [1] Clustering-based Query Result Authenticaion for Encrypted Databases in Cloud
    Jang, Miyoung
    Yoon, Min
    Youn, Deulnyeok
    Chang, Jae-Woo
    2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, : 1076 - 1082
  • [2] Dynamic clustering-based query answering in Peer-to-Peer systems
    Qian, WN
    Zhou, SG
    Ren, Y
    Zhou, AY
    Ooi, BC
    Tang, KL
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2003, 2762 : 306 - 313
  • [3] A Novel Query-Driven Clustering-Based Technique for Vertical Fragmentation and Allocation in Distributed Database Systems
    Sewisy, Adel A.
    Amer, Ali Abdullah
    Abdalla, Hassan I.
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2017, 13 (02) : 27 - 54
  • [4] A Flexible Query Answering Approach for Autonomous Web Databases
    Meng, Xiangfu
    Zhang, Xiaoyan
    Li, Xiaoxi
    MANUFACTURING SYSTEMS AND INDUSTRY APPLICATIONS, 2011, 267 : 549 - 554
  • [5] Flexible query answering in data cubes
    Naouali, S
    Missaoui, R
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2005, 3589 : 221 - 232
  • [6] An empirical evaluation of a distributed clustering-based index for metric space databases
    Gil-Costa, Veronica
    Marin, Mauricio
    Reyes, Nora
    SISAP 2008: FIRST INTERNATIONAL WORKSHOP ON SIMILARITY SEARCH AND APPLICATIONS, PROCEEDINGS, 2008, : 95 - 102
  • [7] An empirical evaluation of a distributed clustering-based index for metric space databases
    Gil-Costa, Veronica
    Marin, Mauricio
    Reyes, Nora
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1 AND 2, 2008, : 546 - 553
  • [8] On the data complexity of consistent query answering over graph databases
    Barcelo, Pablo
    Fontaine, Gaelle
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2017, 88 : 164 - 194
  • [9] Clustering-Based Index and Data Broadcasting for Mobile Nearest Neighbor Query Processing
    Waluyo, Agustinus Borgy
    Taniar, David
    Rahayu, Wenny
    Srinivasan, Bala
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2013, 9 (04) : 1964 - 1974
  • [10] Distributed RDF Query Answering with Dynamic Data Exchange
    Potter, Anthony
    Motik, Boris
    Nenov, Yavor
    Horrocks, Ian
    SEMANTIC WEB - ISWC 2016, PT I, 2016, 9981 : 480 - 497