Benchmarking Classification Algorithms on High-Performance Computing Clusters

被引:1
|
作者
Bischl, Bernd [1 ]
Schiffner, Julia [1 ]
Weihs, Claus [1 ]
机构
[1] TU Dortmund, Dept Stat, Chair Computat Stat, Dortmund, Germany
关键词
D O I
10.1007/978-3-319-01595-8_3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Comparing and benchmarking classification algorithms is an important topic in applied data analysis. Extensive and thorough studies of such a kind will produce a considerable computational burden and are therefore best delegated to high-performance computing clusters. We build upon our recently developed R packages BatchJobs (Map, Reduce and Filter operations from functional programming for clusters) and BatchExperiments (Parallelization and management of statistical experiments). Using these two packages, such experiments can now effectively and reproducibly be performed with minimal effort for the researcher. We present benchmarking results for standard classification algorithms and study the influence of pre-processing steps on their performance.
引用
收藏
页码:23 / 31
页数:9
相关论文
共 50 条
  • [31] Profiling the BLAST bioinformatics application for load balancing on high-performance computing clusters
    Trinity Cheng
    Pei-Ju Chin
    Kenny Cha
    Nicholas Petrick
    Mike Mikailov
    BMC Bioinformatics, 23
  • [32] Profiling the BLAST bioinformatics application for load balancing on high-performance computing clusters
    Cheng, Trinity
    Chin, Pei-Ju
    Cha, Kenny
    Petrick, Nicholas
    Mikailov, Mike
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [33] Theoretical analysis and algorithm design of high-performance packet classification algorithms
    Qi, Ya-Xuan
    Li, Jun
    Jisuanji Xuebao/Chinese Journal of Computers, 2013, 36 (02): : 408 - 421
  • [34] Building high-performance clusters
    Jeffords, C
    Pham, D
    DR DOBBS JOURNAL, 2005, 30 (04): : 70 - +
  • [35] Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems
    Wright, Steven A.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 92 : 900 - 902
  • [36] Kinetic Algorithms for Modeling Conductive Fluids Flow on High-Performance Computing Systems
    B. N. Chetverushkin
    A. V. Saveliev
    V. I. Saveliev
    Doklady Mathematics, 2019, 100 : 577 - 581
  • [37] Kinetic Algorithms for Modeling Conductive Fluids Flow on High-Performance Computing Systems
    Chetverushkin, B. N.
    Saveliev, A. V.
    Saveliev, V. I.
    DOKLADY MATHEMATICS, 2019, 100 (03) : 577 - 581
  • [38] A survey of genome sequence assembly techniques and algorithms using high-performance computing
    Munib Ahmed
    Ishfaq Ahmad
    Mohammad Saad Ahmad
    The Journal of Supercomputing, 2015, 71 : 293 - 339
  • [39] High-Performance Computing Using Application of Q-determinant of Numerical Algorithms
    Aleeva, Valentina N.
    Aleev, Rifkhat Zh.
    2018 GLOBAL SMART INDUSTRY CONFERENCE (GLOSIC), 2018,
  • [40] Studying the Structure of Parallel Algorithms as a Key Element of High-Performance Computing Education
    Voevodin, Vladimir
    Antonov, Alexander
    Popova, Nina
    EURO-PAR 2018: PARALLEL PROCESSING WORKSHOPS, 2019, 11339 : 199 - 210