A parallel decision tree builder for mining very large visualization datasets

被引:0
|
作者
Bowyer, KW [1 ]
Hall, LO [1 ]
Moore, T [1 ]
Chawla, N [1 ]
机构
[1] Univ S Florida, Tampa, FL 33620 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Simulation problems in the DOE ASCI program generate visualization datasets more than a terabyte in size. The practical difficulties in visualizing such datasets motivate the desire for automatic recognition of salient events. We have developed a parallel decision tree classifier for use in this context. Comparisons to ScalParC, a previous attempt to build a fast parallelization of a decision tree classifier, are provided. Our parallel classifier executes on the "ASCI Red" supercomputer. Experiments demonstrate that datasets too large to be processed on a single processor can be efficiently handled in parallel, and suggest that there need not be any decrease in accuracy relative to a monolithic classifier constructed on a single processor.
引用
收藏
页码:1888 / 1893
页数:6
相关论文
共 50 条
  • [1] Interactive parallel visualization of large particle datasets
    Liang, K
    Monger, P
    Couchman, H
    [J]. PARALLEL COMPUTING, 2005, 31 (02) : 243 - 260
  • [2] An efficient decision tree construction for large datasets
    Van, Uyen Nguyen Thi
    Chung, Tae Choong
    [J]. 2007 INNOVATIONS IN INFORMATION TECHNOLOGIES, VOLS 1 AND 2, 2007, : 502 - 506
  • [3] Decision Tree based Classifiers for Large Datasets
    Franco-Arcega, Anilu
    Ariel Carrasco-Ochoa, Jesus
    Sanchez-Diaz, Guillermo
    Francisco Martinez-Trinidad, Jose
    [J]. COMPUTACION Y SISTEMAS, 2013, 17 (01): : 95 - 102
  • [4] Data mining for selective visualization of large spatial datasets
    Shekhar, S
    Lu, CT
    Zhang, PS
    Liu, RL
    [J]. 14TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, : 41 - 48
  • [5] The effective use of a summary table and decision tree methodology to analyze very large healthcare datasets
    Sibbritt D.
    Gibberd R.
    [J]. Health Care Management Science, 2004, 7 (3) : 163 - 171
  • [6] Interactive visualization in mining large decision trees
    Nguyen, TD
    Ho, TB
    Shimodaira, H
    [J]. KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS: CURRENT ISSUES AND NEW APPLICATIONS, 2000, 1805 : 345 - 348
  • [7] A Parallel Algorithm to Induce Decision Trees for Large Datasets
    Franco-Arcega, A.
    Suarez-Cansino, J.
    Flores-Flores, L. G.
    [J]. 2013 XXIV INTERNATIONAL SYMPOSIUM ON INFORMATION, COMMUNICATION AND AUTOMATION TECHNOLOGIES (ICAT), 2013,
  • [8] Decision tree builder and visualizer
    Kwasnicka, H
    Doczekalski, M
    [J]. INTELLIGENT INFORMATION SYSTEMS 2002, PROCEEDINGS, 2002, 17 : 33 - 42
  • [9] Rendering large (volume) datasets: A new parallel visualization system
    Schneider, S
    May, T
    Schmidt, M
    [J]. WSCG'2003, VOL 11, NO 3, CONFERENCE PROCEEDINGS, 2003, : 418 - 424
  • [10] Parallel visualization of Visible Chinese Human with extremely large datasets
    Liu Qian
    Gong Hui
    Luo Qingming
    [J]. 2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 5172 - 5175