Using distributed query result caching to evaluate queries for parallel data mining algorithms

被引:0
|
作者
Taylor, MG [1 ]
Stoffel, K [1 ]
Hendler, JA [1 ]
Saltz, J [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
关键词
parallel; query caching; discriminant rules;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An increase in the speed of data mining algorithms can be achieved by improving the efficiency of the underlying technologies. Query engines are key components ill many knowledge discovery systems and the appropriate use of query engines can impact the performance of data mining algorithms. By laking advantage of hypothesis generation patterns, queries, generated from the hypotheses, call be evaluated more efficiently. Caching query results and using the cached results to evaluate new queries with similar constraints reduces the complexity of query evaluation and improves the performance of data mining algorithms. In a multi-processor environment, distributing the query result caches can improve the performance of parallel query evaluations. This Idea has been used in the ParDRI system and has resulted in significant improvements in the execution times of ParDRI.
引用
收藏
页码:1127 / 1132
页数:6
相关论文
共 50 条
  • [1] Query result caching for multiple event-driven continuous queries
    Watanabe, Yousuke
    Kitagawa, Hiroyuki
    INFORMATION SYSTEMS, 2010, 35 (01) : 94 - 110
  • [2] Incremental mining of frequent query patterns from XML queries for caching
    Li, Guoliang
    Feng, Jianhua
    Wang, Jianyong
    Zhang, Yong
    Zhou, Lizhu
    ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 350 - +
  • [3] Study on algorithms of parallel and distributed data mining calculating process
    Fang, YW
    Zhao, XB
    Zhang, GP
    Wang, Y
    Sun, Y
    Zhang, YF
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 2084 - 2089
  • [4] Caching system for XML queries using frequent query patterns
    Bei, Yijun
    Chen, Gang
    Hu, Tianlei
    Dong, Jinxiang
    PROCEEDINGS OF THE 2007 11TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, VOLS 1 AND 2, 2007, : 47 - +
  • [5] Creation of Data Mining Algorithms as Functional Expression for Parallel and Distributed Execution
    Kholod, Ivan
    Petukhov, Ilya
    PARALLEL COMPUTING TECHNOLOGIES (PACT 2015), 2015, 9251 : 62 - 67
  • [6] Dynamic Distributed and Parallel Machine Learning algorithms for big data mining processing
    Djafri, Laouni
    DATA TECHNOLOGIES AND APPLICATIONS, 2022, 56 (04) : 558 - 601
  • [7] Parallel and Sequential Algorithms for Data Mining Using Inductive Logic
    Skillicorn, David B.
    Wang, Yu
    Knowledge and Information Systems, 2001, Springer Science and Business Media Deutschland GmbH (03) : 405 - 421
  • [8] Acorn: Aggressive Result Caching in Distributed Data Processing Frameworks
    Ramjit, Lana
    Interlandi, Matteo
    Wu, Eugene
    Netravali, Ravi
    PROCEEDINGS OF THE 2019 TENTH ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '19), 2019, : 206 - 219
  • [9] Parallel induction algorithms for data mining
    Darlington, J
    Guo, YK
    Sutiwaraphun, J
    To, HW
    ADVANCES IN INTELLIGENT DATA ANALYSIS: REASONING ABOUT DATA, 1997, 1280 : 437 - 445
  • [10] A distributed framework for parallel data mining using HPJava']Java
    Rana, OF
    Fisk, D
    BT TECHNOLOGY JOURNAL, 1999, 17 (03) : 146 - 154