On the complexity of distributed query optimization

被引:32
|
作者
Wang, CP [1 ]
Chen, MS [1 ]
机构
[1] NATL TAIWAN UNIV,DEPT ELECT ENGN,TAIPEI,TAIWAN
关键词
distributed query optimization; semijoin processing; complexity; NP-hard problems; distributed databases;
D O I
10.1109/69.536256
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While a significant amount of research efforts has been reported on developing algorithms, based on joins and semijoins, to tackle distributed query processing, there is relatively little progress made toward exploring the complexity of the problems studied. As a result, proving NP-hardness of or devising polynomial-time algorithms for certain distributed query optimization problems has been elaborated upon by many researchers. However, due to its inherent difficulty, the complexity of the majority of problems on distributed query optimization remains unknown. In this paper we generally characterize the distributed query optimization problems and provide a frame work to explore their complexity. As it will be shown, most distributed query optimization problems can be transformed into an optimization problem comprising a set of binary decisions, termed Sum Product Optimization (SPO) problem. We first prove SPO is NP-hard in light of the NP-completeness of a well-known problem, Knapsack (KNAP). Then, using this result as a basis, we prove that five classes of distributed query optimization problems, which cover the majority of distributed query optimization problems previously studied in the literature, are NP-hard by polynomially reducing SPO to each of them. The detail for each problem transformation is derived. We not only prove the conjecture that many prior studies relied upon, but also provide a frame work for future related studies.
引用
收藏
页码:650 / 662
页数:13
相关论文
共 50 条
  • [1] Distributed query optimization by query trading
    Pentaris, F
    Ioannidis, Y
    [J]. ADVANCES IN DATABASE TECHNOLOGY - EDBT 2004, PROCEEDINGS, 2004, 2992 : 532 - 550
  • [2] STRATEGIES FOR DISTRIBUTED QUERY OPTIMIZATION
    SEGEV, A
    [J]. INFORMATION SCIENCES, 1991, 54 (1-2) : 67 - 88
  • [3] Query Optimization of Distributed Pattern Matching
    Huang, Jiewen
    Venkatraman, Kartik
    Abadi, Daniel J.
    [J]. 2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2014, : 64 - 75
  • [4] STOCHASTIC QUERY OPTIMIZATION IN DISTRIBUTED DATABASES
    DRENICK, PE
    SMITH, EJ
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 1993, 18 (02): : 262 - 288
  • [5] Study on Query Optimization of Distributed Database
    Yao, Meng
    [J]. MODERN TENDENCIES IN ENGINEERING SCIENCES, 2014, 533 : 448 - 451
  • [6] A PERF solution for distributed query optimization
    Haraty, RA
    Fany, R
    [J]. COMPUTERS AND THEIR APPLICATIONS, 2000, : 150 - 153
  • [7] Query Optimization in Distributed Relational Databases
    Ribeiro C.C.
    Ribeiro C.D.
    Lanzelotte R.S.G.
    [J]. Journal of Heuristics, 1997, 3 (1) : 5 - 23
  • [8] SEMANTIC QUERY OPTIMIZATION IN DISTRIBUTED DATABASES
    VANKUIJK, HJA
    PIJPERS, FHE
    APERS, PMG
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1990, 468 : 295 - 303
  • [9] Multi-query Optimization for Distributed Similarity Query Processing
    Zhuang, Yi
    Li, Qing
    Chen, Lei
    [J]. 28TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, VOLS 1 AND 2, PROCEEDINGS, 2008, : 639 - +