Using distributed query result caching to evaluate queries for parallel data mining algorithms

被引:0
|
作者
Taylor, MG [1 ]
Stoffel, K [1 ]
Hendler, JA [1 ]
Saltz, J [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
关键词
parallel; query caching; discriminant rules;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An increase in the speed of data mining algorithms can be achieved by improving the efficiency of the underlying technologies. Query engines are key components ill many knowledge discovery systems and the appropriate use of query engines can impact the performance of data mining algorithms. By laking advantage of hypothesis generation patterns, queries, generated from the hypotheses, call be evaluated more efficiently. Caching query results and using the cached results to evaluate new queries with similar constraints reduces the complexity of query evaluation and improves the performance of data mining algorithms. In a multi-processor environment, distributing the query result caches can improve the performance of parallel query evaluations. This Idea has been used in the ParDRI system and has resulted in significant improvements in the execution times of ParDRI.
引用
收藏
页码:1127 / 1132
页数:6
相关论文
共 50 条
  • [21] Extensible parallel query processing for exploratory geoscientific data mining
    Shek, EC
    Muntz, RR
    Mesrobian, E
    DATA MINING AND KNOWLEDGE DISCOVERY, 2001, 5 (04) : 277 - 304
  • [22] Extensible Parallel Query Processing for Exploratory Geoscientific Data Mining
    Eddie C. Shek
    Richard R. Muntz
    Edmond Mesrobian
    Data Mining and Knowledge Discovery, 2001, 5 : 277 - 304
  • [23] Parallel and Distributed Algorithms for Frequent Pattern Mining in Large Databases
    Tanbeer, Syed Khairuzzaman
    Ahmed, Chowdhury Farhan
    Jeong, Byeong-Soo
    IETE TECHNICAL REVIEW, 2009, 26 (01) : 55 - 66
  • [24] Result Prediction Using Data Mining
    Sarwar, Hasan
    Gupta, Dipannoy Das
    Luna, Sanzida Mojib
    Suhi, Nusrat Jahan
    Tasnim, Marzouka
    INTELLIGENT COMPUTING, VOL 2, 2021, 284 : 295 - 307
  • [25] Web based parallel/distributed medical data mining using software agents
    Kargupta, H
    Stafford, B
    Hamzaoglu, I
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1997, : 866 - 866
  • [26] Massively parallel distributed feature extraction in textual data mining using HDDITM
    Kuntraruk, J
    Pottenger, WM
    10TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, PROCEEDINGS, 2001, : 363 - 370
  • [27] Automatic data mining by asynchronous parallel evolutionary algorithms
    Li, JD
    Kang, Z
    Li, Y
    Cao, HQ
    Liu, P
    TOOLS 39: TECHNOLOGY OF OBJECT-ORIENTED LANGUAGES AND SYSTEMS, PROCEEDINGS: SOFTWARE TECHNOLOGY FOR THE AGE OF THE INTERNET, 2001, 39 : 99 - 106
  • [28] Efficient and Progressive Algorithms for Distributed Skyline Queries over Uncertain Data
    Ding, Xiaofeng
    Jin, Hai
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (08) : 1448 - 1462
  • [29] Efficient and Progressive Algorithms for Distributed Skyline Queries over Uncertain Data
    Ding, Xiaofeng
    Jin, Hai
    2010 INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS ICDCS 2010, 2010,
  • [30] ENHANCEMENTS TO GREEDY WEB PROXY CACHING ALGORITHMS USING DATA MINING METHOD AND WEIGHT ASSIGNMENT POLICY
    Pernabas, Julian Benadit
    Fidele, Sagayaraj Francis
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2018, 14 (04): : 1311 - 1326