Scalable Multi-Query Optimization for Exploratory Queries over Federated Scientific Databases

被引:0
|
作者
Kementsietsidis, Anastasios [1 ]
Neven, Frank [2 ,3 ]
Van de Craen, Dieter [2 ,3 ]
Vansummeren, Stijn [2 ,3 ]
机构
[1] IBM TJ Watson Res Ctr New York, New York, NY USA
[2] Hasselt Univ, Hasselt, Belgium
[3] Transnat Univ Limburg, Limburg, Belgium
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2008年 / 1卷 / 01期
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The diversity and large volumes of data processed in the Natural Sciences today has led to a proliferation of highly-specialized and autonomous scientific databases with inherent and often intricate relationships. As a user-friendly method for querying this complex, ever-expanding network of sources for correlations, we propose exploratory queries. Exploratory queries are loosely-structured, hence requiring only minimal user knowledge of the source network. Evaluating an exploratory query usually involves the evaluation of many distributed queries. As the number of such distributed queries can quickly become large, we attack the optimization problem for exploratory queries by proposing several multi-query optimization algorithms that compute a global evaluation plan while minimizing the total communication cost, a key bottleneck in distributed settings. The proposed algorithms are necessarily heuristics, as computing an optimal global evaluation plan is shown to be np-hard. Finally, we present an implementation of our algorithms, along with experiments that illustrate their potential not only for the optimization of exploratory queries, but also for the multiquery optimization of large batches of standard queries.
引用
下载
收藏
页码:16 / 27
页数:12
相关论文
共 50 条
  • [1] Multi-Query Optimization in Federated Databases using Evolutionary Algorithm
    Mansha, Sameen
    Kamiran, Faisal
    2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, : 723 - 726
  • [2] Scalable Multi-Query Optimization for SPARQL
    Le, Wangchao
    Kementsietsidis, Anastasios
    Duan, Songyun
    Li, Feifei
    2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 666 - 677
  • [3] Multi-query Optimization in Federated RDF Systems
    Peng, Peng
    Zou, Lei
    Ozsu, M. Tamer
    Zhao, Dongyan
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2018, PT I, 2018, 10827 : 745 - 765
  • [4] Pipelining in multi-query optimization
    Dalvi, NN
    Sanghai, SK
    Roy, P
    Sudarshan, S
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2003, 66 (04) : 728 - 762
  • [5] SPARQL Multi-Query Optimization
    Chen, Jiaqi
    Zhang, Fan
    Zou, Lei
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (IEEE TRUSTCOM) / 12TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (IEEE BIGDATASE), 2018, : 1419 - 1425
  • [6] Progressive query optimization for federated queries
    Ewen, Stephan
    Kache, Holger
    Markl, Volker
    Raman, Vijayshankar
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 3896 : 847 - 864
  • [7] Pipeline-based multi-query optimization for similarity queries in grid environment
    Hu H.
    Zhuang Y.
    Hu H.-Y.
    Chiu D.
    Ruan Jian Xue Bao/Journal of Software, 2010, 21 (01): : 55 - 67
  • [8] Optimizing Multi-Query Evaluation in Federated RDF Systems
    Peng, Peng
    Ge, Qi
    Zou, Lei
    Ozsu, M. Tamer
    Xu, Zhiwei
    Zhao, Dongyan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (04) : 1692 - 1707
  • [9] Multi-Query Optimization on RSS Feeds
    Getahun, Fekade
    Chbeir, Richard
    JOURNAL ON DATA SEMANTICS, 2018, 7 (01) : 47 - 64
  • [10] Multi-Query Optimization in MapReduce Framework
    Wang, Guoping
    Chan, Chee-Yong
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 7 (03): : 145 - 156