Visualization and knowledge discovery for high dimensional data

被引:5
|
作者
Inselberg, A [1 ]
机构
[1] Tel Aviv Univ, Sch Math Sci, IL-69978 Tel Aviv, Israel
关键词
D O I
10.1109/UIDIS.2001.929921
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal here is to present a multidimensional visualization methodology and its applications to Visual and Automatic Knowledge Discovery in a coherent paper. Visualization provides insight through images and can be considered as a collection of application specific mappings: Problem Domain --> Visual Range. For the visualization of multivariate problems a multidimensional system of Parallel coordinates (abbr. ||-coords) is constructed which induces a one-to-one mapping between subsets of N-space and subsets of 2-space. The result is a rigorous methodology for doing and seeing N-dimensional geometry. We start with an overview of the mathematical foundations where it is seen that from the display of high-dimensional datasets the search for multivariate relations among the variables is transformed into a 2-D pattern recognition problem. This is the basis for the application to Visual Knowledge Discovery which is illustrated in the second part with real dataset of VLSI production. Then a recent geometric classifier is presented and applied to 3 real datasets. The results compared to those of 23 other classifiers have the least error. The algorithm, has quadratic computational complexity in the size and number of parameters, provides comprehensible and explicit rules, does dimensionality selection - where the minimal set of original variables required to state the rule is found, and orders these variables so as to optimize the clarity of separation between the designated set and its complement. Finally a simple visual economic model of a real country is constructed and analyzed in order to illustrate the special strength of ||-coords in modeling multivariate relations by means of hypersurfaces.
引用
收藏
页码:5 / 24
页数:20
相关论文
共 50 条
  • [1] High-Dimensional Data Visualization Based on User Knowledge
    Liu, Qiaolian
    Zhao, Jianfei
    Guo, Naiwang
    Xiao, Ding
    Shi, Chuan
    [J]. DATA MINING AND BIG DATA, DMBD 2016, 2016, 9714 : 321 - 329
  • [2] Coupling visualization and data analysis for knowledge discovery from multi-dimensional scientific data
    Ruebel, Oliver
    Ahern, Sean
    Bethel, E. Wes
    Biggin, Mark D.
    Childs, Hank
    Cormier-Michel, Estelle
    DePace, Angela
    Eisen, Michael B.
    Fowlkes, Charless C.
    Geddes, Cameron G. R.
    Hagen, Hans
    Hamann, Bernd
    Huang, Min-Yu
    Keraenen, Soile V. E.
    Knowles, David W.
    Hendriks, Cris L. Luengo
    Malik, Jitendra
    Meredith, Jeremy
    Messmer, Peter
    Prabhat
    Ushizima, Daniela
    Weber, Gunther H.
    Wu, Kesheng
    [J]. ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01): : 1751 - 1758
  • [3] Information visualization in data mining and knowledge discovery.
    Badurek, CA
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2003, 54 (09): : 905 - 906
  • [4] Integration and visualization of biological data of the brain for the knowledge discovery
    Mineta, Katsuhiko
    Ikeo, Kazuho
    Tanaka, Yuzuru
    Gojobori, Takashi
    [J]. GENES & GENETIC SYSTEMS, 2006, 81 (06) : 460 - 460
  • [5] Coupling Clustering and Visualization for Knowledge Discovery from Data
    Cabanes, Guenael
    Bennani, Younes
    [J]. 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 2127 - 2134
  • [6] VISUALIZATION FOR KNOWLEDGE DISCOVERY
    GRINSTEIN, G
    SIEG, JC
    SMITH, S
    WILLIAMS, MG
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1992, 7 (07) : 637 - 648
  • [7] Lossless Visual Knowledge Discovery in High Dimensional Data with Elliptic Paired Coordinates
    McDonald, Rose
    Kovalerchuk, Boris
    [J]. 2020 24TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV 2020), 2020, : 286 - 291
  • [8] Lossless Interpretable Glyphs for Visual Knowledge Discovery in High-Dimensional Data
    Cutlip, Nicholas
    Kovalerchuk, Boris
    [J]. 2023 27TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION, IV, 2023, : 292 - 299
  • [9] Improving symbolic data visualization for pattern recognition and knowledge discovery
    Umbleja, Kadri
    Ichino, Manabu
    Yaguchi, Hiroyuki
    [J]. VISUAL INFORMATICS, 2020, 4 (01) : 23 - 31
  • [10] Visualization and Visual Knowledge Discovery from Big Uncertain Data
    Leung, Carson K.
    Madill, Evan W. R.
    Pazdor, Adam
    [J]. 2022 26TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV), 2022, : 330 - 335