Pipeline-based multi-query optimization for similarity queries in grid environment

被引:0
|
作者
Hu H. [1 ,2 ]
Zhuang Y. [2 ]
Hu H.-Y. [1 ,2 ]
Chiu D. [3 ]
机构
[1] College of Computer Science, Hangzhou Dianzi University
[2] College of Computer and Information Engineering, Zhejiang Gongshang University
[3] Department of Computer Science and Engineering, Chinese University of Hong Kong, Hong Kong
来源
Ruan Jian Xue Bao/Journal of Software | 2010年 / 21卷 / 01期
关键词
Data partition; Grid; High-dimensional indexing; Multi-query optimization;
D O I
10.3724/SP.J.1001.2010.03665
中图分类号
学科分类号
摘要
This paper proposes a multi-query optimization algorithm for pipeline-based distributed similarity query processing (pGMSQ) in grid environment. First, when a number of query requests are simultaneously submitted by users, a cost-based dynamic query clustering (DQC) is invoked to quickly and effectively identify the correlation among the query spheres (requests). Then, index-support vector set reduction is performed at data node level in parallel. Finally, refinement of the candidate vectors is conducted to get the answer set at the execution node level. By adopting pipeline-based technique, this algorithm is experimentally proved to be efficient and effective in minimizing the response time by decreasing network transfer cost and increasing the throughput. © by Institute of Software, the Chinese Academy of Sciences. All rights reserved.
引用
下载
收藏
页码:55 / 67
页数:12
相关论文
共 22 条
  • [1] Sellis T.K., Multi-Query optimization, ACM Trans. on Database Systems, 13, 1, pp. 23-52, (1988)
  • [2] Bohm C., Berchtold S., Keim D., Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases, ACM Computing Surveys, 33, 3, pp. 322-373, (2001)
  • [3] Guttman A., R-Tree: A dynamic index structure for spatial searching, Proc. of the ACM SIGMOD Int'l Conf. on Management of Data (SIGMOD'84), pp. 47-54, (1984)
  • [4] Beckmann N., Kriegel H.P., Schneider R., Seeger B., The R*-tree: An efficient and robust access method for points and rectangles, Proc. of the ACM SIGMOD Int'l Conf. on Management of Data (SIGMOD 2000), pp. 322-331, (1990)
  • [5] Ratnasamy S., Francis P., Handley M., Karp R., Shenker S., A scalable content-addressable network, Proc. of the ACM SIGCOMM Int'l Conf. on Data Communication (SIGCOMM), pp. 161-172, (2001)
  • [6] Aspnes J., Shah G., Skip graphs, Proc. of the ACM-SIAM Symp. on Discrete Algorithms, pp. 384-393, (2003)
  • [7] Smith J., Gounaris A., Watson P., Paton N.W., Fernandes A.A.A., Sakellariou R., Distributed query processing on the grid, Proc. of the 3rd Int'l Workshop on Grid Computing, pp. 279-290, (2002)
  • [8] Jagadish H.V., Ooi B.C., Tan K.L., Yu C., Zhang R., iDistance: An adaptive B<sup>+</sup>-tree based indexing method for nearest neighbor search, ACM Trans. on Data Base Systems, 2, 30, pp. 364-397, (2005)
  • [9] Berchtold S., Bohm C., Braunmuller B., Keim D.A., Kriegel H.P., Fast parallel similarity search in multimedia databases, Proc. of the ACM SIGMOD Int'l Conf. on Management of Data (SIGMOD'97), pp. 1-12, (1997)
  • [10] Papadopoulos A.N., Manolopoulos Y., Similarity query processing using disk arrays, Proc. of the ACM SIGMOD Int'l Conf. on Management of Data (SIGMOD'98), pp. 225-236, (1998)