Runtime support for parallelizing data mining algorithms

被引:0
|
作者
Jin, RM [1 ]
Agrawal, G [1 ]
机构
[1] Ohio State Univ, Dept Comp & Informat Sci, Columbus, OH 43210 USA
关键词
parallel computing; locking; cache effects;
D O I
10.1117/12.460230
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With recent technological advances, shared memory parallel machines have become more scalable, and offer large main memories and high bus bandwidths. They are emerging as good platforms for data warehousing and data mining. In this paper, we focus on shared memory parallelization of data mining algorithms. We have developed a series of techniques for parallelization of data mining algorithms, including full replication, full locking, fixed locking, optimized full locking, and cache-sensitive locking. Unlike previous work on shared memory parallelization of specific data mining algorithms, all of our techniques apply to a large number of common data mining algorithms. In addition, we propose a reduction-object based interface for specifying a data mining algorithm. We show how our runtime system can apply any of the technique we have developed starting from a common specification of the algorithm.
引用
收藏
页码:212 / 223
页数:12
相关论文
共 50 条
  • [1] Compiler and runtime support for shared memory parallelization of data mining algorithms
    Li, XG
    Jin, RM
    Agrawal, G
    [J]. LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 2005, 2481 : 265 - 279
  • [2] A Functional Approach to Parallelizing Data Mining Algorithms in Java']Java
    Kholod, Ivan
    Shorov, Andrey
    Gorlatch, Sergei
    [J]. PARALLEL COMPUTING TECHNOLOGIES (PACT 2017), 2017, 10421 : 459 - 472
  • [3] Parallelizing the execution of native data mining algorithms for computational biology
    Coro, Gianpaolo
    Candela, Leonardo
    Pagano, Pasquale
    Italiano, Angela
    Liccardo, Loredana
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (17): : 4630 - 4644
  • [4] Data Mining Algorithms Parallelizing in Functional Programming Language for Execution in Cluster
    Kholod, Ivan
    Malov, Aleksey
    Rodionov, Sergey
    [J]. INTERNET OF THINGS, SMART SPACES, AND NEXT GENERATION NETWORKS AND SYSTEMS, 2015, 9247 : 140 - 151
  • [5] An approach to support education of data mining algorithms
    Muchova, M.
    Paralic, J.
    Jancus, M.
    [J]. 2017 IEEE 15TH INTERNATIONAL SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI), 2017, : 93 - 98
  • [6] Parallelizing evolutionary algorithms for clustering data
    Kwedlo, Wojciech
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2006, 3911 : 430 - 438
  • [7] Speculation for Parallelizing Runtime Checks
    Susskraut, Martin
    Weigert, Stefan
    Schiffel, Ute
    Knauth, Thomas
    Nowack, Martin
    de Brum, Diogo Becker
    Fetzer, Christof
    [J]. STABILIZATION, SAFETY, AND SECURITY OF DISTRIBUTED SYSTEMS, PROCEEDINGS, 2009, 5873 : 698 - 710
  • [8] Distributed Randomized Algorithms for Low-Support Data Mining
    Ferro, Alfredo
    Giugno, Rosalba
    Mongiovi, Misael
    Pulvirenti, Alfredo
    [J]. 2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 2503 - 2509
  • [9] Spatial Data Mining: Database Primitives, Algorithms and Efficient DBMS Support
    Martin Ester
    Alexander Frommelt
    Hans-Peter Kriegel
    Jöorg Sander
    [J]. Data Mining and Knowledge Discovery, 2000, 4 : 193 - 216
  • [10] Spatial data mining: Database primitives, algorithms and efficient DBMS support
    Ester, M
    Frommelt, A
    Kriegel, HP
    Sander, J
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2000, 4 (2-3) : 193 - 216