Querying Big Data by Accessing Small Data

被引:15
|
作者
Fan, Wenfei [1 ,3 ]
Geerts, Floris [2 ]
Cao, Yang [1 ,3 ]
Deng, Ting [3 ]
Lu, Ping [3 ]
机构
[1] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[2] Univ Antwerp, Antwerp, Belgium
[3] Beihang Univ, Beijing, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
Big data; query answering; complexity; EXPRESSIONS;
D O I
10.1145/2745754.2745771
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates the feasibility of querying big data by accessing a bounded amount of the data. We study boundedly evaluable queries under a form of access constraints, when their evaluation cost is determined by the queries and constraints only. While it is undecidable to determine whether FO queries are boundedly evaluable, we show that for several classes of FO queries, the bounded evaluability problem is decidable. We also provide characterization and effective syntax for their boundedly evaluable queries. When a query Q is not boundedly evaluable, we study two approaches to approximately answering Q under access constraints. (1) We search for upper and lower envelopes of Q that are boundedly evaluable and warrant a constant accuracy bound. (2) We instantiate a minimum set of variables (parameters) in Q such that the specialized query is boundedly evaluable. We study problems for deciding the existence of envelopes and bounded specialized queries, and establish their complexity for various classes of FO queries.
引用
收藏
页码:173 / 184
页数:12
相关论文
共 50 条
  • [1] Querying Deep Web Data Bases without Accessing to Data
    Boughammoura, Radhouane
    Omri, Mohamed Nazih
    [J]. 2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017, : 597 - 603
  • [2] On Scale Independence for Querying Big Data
    Fan, Wenfei
    Geerts, Floris
    Libkin, Leonid
    [J]. PODS'14: PROCEEDINGS OF THE 33RD ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2014, : 51 - 62
  • [3] PathGraph: Querying and Exploring Big Data Graphs
    Colazzo, Dario
    Mecca, Vincenzo
    Nole, Maurizio
    Sartiani, Carlo
    [J]. 30TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2018), 2018,
  • [4] Describing and Comparing Big Data Querying Tools
    Rodrigues, Mario
    Santos, Maribel Yasmina
    Bernardino, Jorge
    [J]. RECENT ADVANCES IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, 2017, 569 : 115 - 124
  • [5] Querying Big Data from a Database Perspective
    Zhao, Wenfeng
    Liu, Guohua
    Chen, Zhao
    Nyabuga, Douglas
    Yang, Huichun
    Zhang, Heng
    Ni, Mengfei
    [J]. 2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 1433 - 1437
  • [6] Querying Big Data: Bridging Theory and Practice
    樊文飞
    怀进鹏
    [J]. Journal of Computer Science & Technology, 2014, 29 (05) : 849 - 869
  • [7] Semantic Querying Big and Distributed RDF Data
    Kaoutar, Lamrani
    Abderrahim, Ghadi
    Kudagba, Florent Kunale
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON SMART CITY APPLICATIONS (SCA'18), 2018,
  • [8] Querying Big Data: Bridging Theory and Practice
    Fan, Wenfei
    Huai, Jin-Peng
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2014, 29 (05) : 849 - 869
  • [9] Querying Big Data: Bridging Theory and Practice
    Wenfei Fan
    Jin-Peng Huai
    [J]. Journal of Computer Science and Technology, 2014, 29 : 849 - 869
  • [10] Dynamic Data Transformation for Low Latency Querying in Big Data Systems
    Ordonez-Ante, Leandro
    Vanhove, Thomas
    Van Seghbroeck, Gregory
    Wauters, Tim
    Volckaert, Bruno
    De Turck, Filip
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 2480 - 2489