High Performance OLAP and Data Mining on Parallel Computers

被引:0
|
作者
Sanjay Goil
Alok Choudhary
机构
[1] Northwestern University,Department of Electrical and Computer Engineering and Center for Parallel and Distributed Computing
来源
关键词
Data Cube; Parallel Computing; High Performance; Data mining; Attribute Focusing;
D O I
暂无
中图分类号
学科分类号
摘要
On-Line Analytical Processing (OLAP) techniques are increasingly being used in decision support systems to provide analysis of data. Queries posed on such systems are quite complex and require different views of data. Analytical models need to capture the multidimensionality of the underlying data, a task for which multidimensional databases are well suited. Multidimensional OLAP systems store data in multidimensional arrays on which analytical operations are performed. Knowledge discovery and data mining requires complex operations on the underlying data which can be very expensive in terms of computation time. High performance parallel systems can reduce this analysis time.
引用
收藏
页码:391 / 417
页数:26
相关论文
共 50 条
  • [41] Benchmarking Data Science: 12 Ways to Lie With Statistics and Performance on Parallel Computers
    Hoefler, Torsten
    [J]. COMPUTER, 2022, 55 (08) : 49 - 56
  • [42] MODELING FOR PERFORMANCE EVALUATION OF PARALLEL COMPUTERS
    PAKER, Y
    [J]. APPLICATIONS OF SUPERCOMPUTERS IN ENGINEERING : ALGORITHMS, COMPUTER SYSTEMS AND USER EXPERIENCE, 1989, : 197 - 214
  • [43] Exploiting OLAP and Data Mining for augmenting E-business
    Mangla, Monika
    [J]. IAMA: 2009 INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT & MULTI-AGENT SYSTEMS, 2009, : 182 - 183
  • [44] A high performance hierarchical cubing algorithm and efficient OLAP in high-dimensional data warehouse
    Hu, Kongfa
    Gong, Zhenzhi
    Da, Qingli
    Chen, Ling
    [J]. EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 357 - +
  • [45] Parallel OLAP query processing in database clusters with data replication
    Alexandre A. B. Lima
    Camille Furtado
    Patrick Valduriez
    Marta Mattoso
    [J]. Distributed and Parallel Databases, 2009, 25 : 97 - 123
  • [46] High performance data mining and applications overview
    Xie, Chao
    He, Jieyue
    [J]. EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 229 - +
  • [47] A tutorial introduction to high performance data mining
    Grossman, R
    [J]. PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1997, 1263 : 395 - 395
  • [48] High-performance data mining system
    Yaginuma, Y
    [J]. FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 2000, 36 (02): : 201 - 210
  • [49] A proposal of high performance data mining system
    Liu, Z
    Guo, MY
    [J]. APPLIED PARALLEL COMPUTING: ADVANCED SCIENTIFIC COMPUTING, 2002, 2367 : 106 - 115
  • [50] A System for High Performance Mining on GDELT Data
    Pogorelov, Konstantin
    Schroeder, Daniel Thilo
    Filkukova, Petra
    Langguth, Johannes
    [J]. 2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 1101 - 1111