AN EFFICIENT PROCESSING OF A CHAIN JOIN WITH THE MINIMUM COMMUNICATION COST IN DISTRIBUTED DATABASE-SYSTEMS

被引:39
|
作者
LIN, XM [1 ]
ORLOWSKA, ME [1 ]
机构
[1] UNIV QUEENSLAND,DEPT COMP SCI,BRISBANE,QLD 4072,AUSTRALIA
关键词
DISTRIBUTED DATABASES; QUERY PROCESSING OPTIMIZATION; COMMUNICATION COST;
D O I
10.1007/BF01263657
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates the optimization problem when executing a join in a distributed database environment. The minimization of the communication cost for sending data through links has been adopted as an optimization criterion. We explore in this paper the approach of judiciously using join operations as reducers in distributed query processing. In general, this problem is computationally intractable. A restriction of the execution of a join in a pre-defined combinatorial order leads to a possible solution in polynomial time. An algorithm for a chain query computation has been proposed in [21]. The time complexity of the algorithm is O(m(2)n(2) + m(3)n), where n is the number of sites in the network, and m is the number of relations (fragments) involved in the join. In this paper, we firstly present a proof of the intuitively well understood fact-that the ''eigenorder'' of a ''chain'' join will be the best pre-defined combinatorial order to implement the algorithm in [21]. Secondly, we show a sufficient and necessary condition for a chain query with the eigenordering to be a ''simple'' query. For the process of the class of simple queries, we show a significant reduction of the time complexity from O(m(2)n(2) + m(3)n) to O(mn + m(2)). it is encouraging that, in practice, the most frequent queries belong to the category of simple queries.
引用
收藏
页码:69 / 83
页数:15
相关论文
共 50 条