Scalable Multi-Query Optimization for Exploratory Queries over Federated Scientific Databases

被引:0
|
作者
Kementsietsidis, Anastasios [1 ]
Neven, Frank [2 ,3 ]
Van de Craen, Dieter [2 ,3 ]
Vansummeren, Stijn [2 ,3 ]
机构
[1] IBM TJ Watson Res Ctr New York, New York, NY USA
[2] Hasselt Univ, Hasselt, Belgium
[3] Transnat Univ Limburg, Limburg, Belgium
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2008年 / 1卷 / 01期
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The diversity and large volumes of data processed in the Natural Sciences today has led to a proliferation of highly-specialized and autonomous scientific databases with inherent and often intricate relationships. As a user-friendly method for querying this complex, ever-expanding network of sources for correlations, we propose exploratory queries. Exploratory queries are loosely-structured, hence requiring only minimal user knowledge of the source network. Evaluating an exploratory query usually involves the evaluation of many distributed queries. As the number of such distributed queries can quickly become large, we attack the optimization problem for exploratory queries by proposing several multi-query optimization algorithms that compute a global evaluation plan while minimizing the total communication cost, a key bottleneck in distributed settings. The proposed algorithms are necessarily heuristics, as computing an optimal global evaluation plan is shown to be np-hard. Finally, we present an implementation of our algorithms, along with experiments that illustrate their potential not only for the optimization of exploratory queries, but also for the multiquery optimization of large batches of standard queries.
引用
下载
收藏
页码:16 / 27
页数:12
相关论文
共 50 条
  • [41] Multi-clip query optimization in video databases
    Mostefaoui, A
    Brunie, L
    Kosch, H
    Böszörményi, L
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 363 - 366
  • [42] Scalable Query Processing and Query Engines over Cloud Databases: Models, Paradigms, Techniques, Future Challenges
    Cuzzocrea, Alfredo
    33RD INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2021), 2020, : 272 - 275
  • [43] Efficient execution of multi-query data analysis batches using compiler optimization strategies
    Andrade, H
    Aryangat, S
    Kurc, T
    Saltz, J
    Sussman, A
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 2004, 2958 : 509 - 523
  • [44] Exploiting Shared Sub-Expression and Materialized View Reuse for Multi-Query Optimization
    Gurumurthy, Bala
    Bidarkar, Vasudev Raghavendra
    Broneske, David
    Pionteck, Thilo
    Saake, Gunter
    INFORMATION SYSTEMS FRONTIERS, 2024,
  • [45] A scalable framework for continuous query evaluations over multidimensional, scientific datasets
    Tolooee, Cameron
    Malensek, Matthew
    Pallickara, Sangmi Lee
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (08): : 2546 - 2563
  • [46] Query Optimization over a Heterogeneously Distributed Scientific Database
    Xiang, Helen X.
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [47] Scalable Multi-party Private Set Union from Multi-query Secret-Shared Private Membership Test
    Liu, Xiang
    Gao, Ying
    ADVANCES IN CRYPTOLOGY, ASIACRYPT 2023, PT I, 2023, 14438 : 237 - 271
  • [48] An efficient co-operative framework for multi-query processing over compressed XML data
    He, Juzhen
    Ng, Wilfred
    Wang, Xiaoling
    Zhou, Aoying
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2006, 3882 : 218 - 232
  • [49] A Query Optimizer for Range Queries over Multi-Attribute Trajectories
    Xu, Jianqiu
    Lu, Hua
    Bao, Zhifeng
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (01)
  • [50] I-SQE: A Query Engine for Answering Range Queries over Incomplete Spatial Databases
    Cuzzocrea, Alfredo
    Nucita, Andrea
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT II, PROCEEDINGS, 2009, 5712 : 91 - +