Visualization techniques for mining large databases: A comparison

被引:157
|
作者
Keim, DA
Kriegel, HP
机构
[1] Insitute for Computer Science, University of Munich, D-80538 München
关键词
data mining; explorative data analysis; visualizing large databases; visualizing multidimensional; multivariate data;
D O I
10.1109/69.553159
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual data mining techniques have proven to be of high value in exploratory data analysis, and they also have a high potential for mining large databases. In this article, we describe and evaluate a new visualization-based approach to mining large databases. The basic idea of our visual data mining techniques is to represent as many data items as possible on the screen at the same time by mapping each data value to a pixel of the screen and arranging the pixels adequately. The major goal of this article is to evaluate our visual data mining techniques and to compare them to other well-known visualization techniques for multidimensional data. the parallel coordinate and stick figure visualization techniques. For the evaluation of visual data mining techniques, in the first place the perception of properties of the data counts, and only in the second place the CPU time and the number of secondary storage accesses are important. In addition to testing the visualization techniques using real data, we developed a testing environment for database visualizations similar to the benchmark approach used for comparing the performance of database systems. The testing environment allows the generation of test data sets with predefined data characteristics which are important for comparing the perceptual abilities of visual data mining techniques.
引用
收藏
页码:923 / 938
页数:16
相关论文
共 50 条
  • [31] Melodic matching techniques for large music databases
    Uitdenbogerd, A
    Zobel, J
    ACM MULTIMEDIA 99, PROCEEDINGS, 1999, : 57 - 66
  • [32] Large model visualization: Techniques and applications
    Bartz, D
    WSCG'2003, VOL 11, NO 1, CONFERENCE PROCEEDINGS, 2003, : 5 - 7
  • [33] Mining and visualizing large anticancer drug discovery databases
    Shi, LM
    Fan, Y
    Lee, JK
    Waltham, M
    Andrews, DT
    Scherf, U
    Paull, KD
    Weinstein, JN
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (02): : 367 - 379
  • [34] Efficient Adaptive Retrieval and Mining in Large Multimedia Databases
    Assent, Ira
    IT-INFORMATION TECHNOLOGY, 2010, 52 (01): : 45 - 47
  • [35] Mining Rare Sequential Patterns in Large Transaction Databases
    Ouyang, Weimin
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ELECTRONIC TECHNOLOGY, 2016, 48 : 159 - 162
  • [36] Data mining of large astronomical databases with neural tools
    Longo, G
    Donalek, C
    Raiconi, G
    Staiano, A
    Tagliaferri, R
    Sessa, S
    Pasian, F
    Smareglia, R
    Volpicelli, AN
    ASTRONOMICAL DATA ANALYSIS II, 2002, 4847 : 265 - 276
  • [37] A general mining method for incremental updation in large databases
    Lee, WJ
    Lee, SJ
    2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 1423 - 1428
  • [38] Incremental mining large itemsets with constraints in dynamic databases
    Li, Naiqian
    Shen, Junyi
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2003, 37 (04): : 359 - 363
  • [39] Parallel and Distributed Frequent Pattern Mining in Large Databases
    Tanbeer, Syed Khairuzzaman
    Ahmed, Chowdhury Farhan
    Jeong, Byeong-Soo
    HPCC: 2009 11TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2009, : 407 - 414
  • [40] Methods of data mining and knowledge generalization in large databases
    Vagin, VN
    Fedotov, AA
    Fomin, MV
    JOURNAL OF COMPUTER AND SYSTEMS SCIENCES INTERNATIONAL, 1999, 38 (05) : 714 - 727