Data Mining with Sparse Grids

被引:0
|
作者
J. Garcke
M. Griebel
M. Thess
机构
[1] Institut für Angewandte Mathematik Rheinische Friedrich-Wilhelms-Universität Bonn Wegelerstr. 6 D-53115 Bonn Germany e-mail: garckej@iam.uni-bonn.de,
[2] Institut für Angewandte Mathematik Rheinische Friedrich-Wilhelms-Universität Bonn Wegelerstr. 6 D-53115 Bonn Germany e-mail: griebel@iam.uni-bonn.de,undefined
[3] Prudential Systems Software GmbH c/o Technologiezentrum Chemnitz Annaberger Str. 240 D-09125 Chemnitz Germany e-mail: thess@prudsys.com,undefined
来源
Computing | 2001年 / 67卷
关键词
AMS Subject Classifications: 62H30, 65D10, 68T10.; Key Words: Data mining, classification, approximation, sparse grids, combination technique.;
D O I
暂无
中图分类号
学科分类号
摘要
(hn−1nd−1) instead of O(hn−d) grid points and unknowns are involved. Here d denotes the dimension of the feature space and hn = 2−n gives the mesh size. To be precise, we suggest to use the sparse grid combination technique [42] where the classification problem is discretized and solved on a certain sequence of conventional grids with uniform mesh sizes in each coordinate direction. The sparse grid solution is then obtained from the solutions on these different grids by linear combination. In contrast to other sparse grid techniques, the combination method is simpler to use and can be parallelized in a natural and straightforward way. We describe the sparse grid combination technique for the classification problem in terms of the regularization network approach. We then give implementational details and discuss the complexity of the algorithm. It turns out that the method scales only linearly with the number of instances, i.e. the amount of data to be classified. Finally we report on the quality of the classifier built by our new method. Here we consider standard test problems from the UCI repository and problems with huge synthetical data sets in up to 9 dimensions. It turns out that our new method achieves correctness rates which are competitive to that of the best existing methods.
引用
收藏
页码:225 / 253
页数:28
相关论文
共 50 条
  • [1] Data mining with sparse grids
    Garcke, J
    Griebel, M
    Thess, M
    [J]. COMPUTING, 2001, 67 (03) : 225 - 253
  • [2] Multi- and Many-Core Data Mining with Adaptive Sparse Grids
    Heinecke, Alexander
    Pflueger, Dirk
    [J]. PROCEEDINGS OF THE 2011 8TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS (CF 2011), 2011,
  • [3] Data Mining on Grids
    Chakraverty, Shampa
    Gupta, Ankuj
    Goyal, Akhil
    Singal, Ashish
    [J]. CONTEMPORARY COMPUTING, 2011, 168 : 347 - 358
  • [4] Emerging Architectures Enable to Boost Massively Parallel Data Mining Using Adaptive Sparse Grids
    Heinecke, Alexander
    Pflueger, Dirk
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2013, 41 (03) : 357 - 399
  • [5] Emerging Architectures Enable to Boost Massively Parallel Data Mining Using Adaptive Sparse Grids
    Alexander Heinecke
    Dirk Pflüger
    [J]. International Journal of Parallel Programming, 2013, 41 : 357 - 399
  • [6] Sparse Trust Data Mining
    Nie, Pengli
    Xu, Guangquan
    Jiao, Litao
    Liu, Shaoying
    Liu, Jian
    Meng, Weizhi
    Wu, Hongyue
    Feng, Meiqi
    Wang, Weizhe
    Jing, Zhengjun
    Zheng, Xi
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 4559 - 4573
  • [7] Mining internet data sets for computational grids
    Borzemski, L
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS, 2005, 3683 : 268 - 274
  • [8] Middleware for data mining applications on clusters and grids
    Glimcher, Leonid
    Jin, Ruoming
    Agrawal, Gagan
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2008, 68 (01) : 37 - 53
  • [9] Heterogeneous Distributed Big Data Clustering on Sparse Grids
    Pfander, David
    Daiss, Gregor
    Pflueger, Dirk
    [J]. ALGORITHMS, 2019, 12 (03)
  • [10] Parallelisation of sparse grids for large scale data analysis
    Garcke, Jochen
    Hegland, Markus
    Nielsen, Ole
    [J]. ANZIAM JOURNAL, 2006, 48 : 11 - 22