Using distributed query result caching to evaluate queries for parallel data mining algorithms

被引:0
|
作者
Taylor, MG [1 ]
Stoffel, K [1 ]
Hendler, JA [1 ]
Saltz, J [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
关键词
parallel; query caching; discriminant rules;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An increase in the speed of data mining algorithms can be achieved by improving the efficiency of the underlying technologies. Query engines are key components ill many knowledge discovery systems and the appropriate use of query engines can impact the performance of data mining algorithms. By laking advantage of hypothesis generation patterns, queries, generated from the hypotheses, call be evaluated more efficiently. Caching query results and using the cached results to evaluate new queries with similar constraints reduces the complexity of query evaluation and improves the performance of data mining algorithms. In a multi-processor environment, distributing the query result caches can improve the performance of parallel query evaluations. This Idea has been used in the ParDRI system and has resulted in significant improvements in the execution times of ParDRI.
引用
收藏
页码:1127 / 1132
页数:6
相关论文
共 50 条
  • [41] Some Observations of Sequential, Parallel and Distributed Association Rule Mining Algorithms
    Garg, Rakhi
    Mishra, P. K.
    2009 INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING, PROCEEDINGS, 2009, : 336 - 342
  • [42] Using controlled query generation to evaluate blind relevance feedback algorithms
    Jordan, Chris
    Watters, Carolyn
    Gao, Qigang
    OPENING INFORMATION HORIZONS, 2006, : 286 - +
  • [43] Data Mining Using Parallel Multi-Objective Evolutionary Algorithms on Graphics Hardware
    Wong, Man-Leung
    Cui, Geng
    2010 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2010,
  • [44] EFFICIENT ALGORITHMS FOR RESOURCE-ALLOCATION IN DISTRIBUTED AND PARALLEL QUERY-PROCESSING ENVIRONMENTS
    LIU, P
    KIYOKI, Y
    MASUDA, T
    9TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 1989, : 316 - 323
  • [45] Knowledge integration in a Parallel and distributed environment with association rule mining using XML data
    Paul, Sujni
    Saravanan, V.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (05): : 334 - 339
  • [46] Active semantic caching to optimize multidimensional data analysis in parallel and distributed environments
    Andrade, Henrique
    Kurc, Tahsin
    Sussman, Alan
    Saltz, Joel
    PARALLEL COMPUTING, 2007, 33 (7-8) : 497 - 520
  • [47] Data mining in deductive databases using query flocks
    Toroslu, IH
    Yetisgen-Yildiz, M
    EXPERT SYSTEMS WITH APPLICATIONS, 2005, 28 (03) : 395 - 407
  • [48] Parallel Computing Algorithms for Big Data Frequent Pattern Mining
    Shaik, Subhani
    Subhani, Shaik
    Devarakonda, Nagaraju
    Nagamani, Ch.
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA ENGINEERING, 2018, 9 : 113 - 123
  • [49] Parallel algorithms for mining association rules in time series data
    Sarker, BK
    Mori, T
    Hirata, T
    Uehara, K
    PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS, PROCEEDINGS, 2003, 2745 : 273 - 284
  • [50] Distributed Randomized Algorithms for Low-Support Data Mining
    Ferro, Alfredo
    Giugno, Rosalba
    Mongiovi, Misael
    Pulvirenti, Alfredo
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 2503 - 2509