Guided Visual Exploration of Relations in Data Sets

被引:0
|
作者
Puolamaki, Kai [1 ]
Oikarinen, Emilia [2 ]
Henelius, Andreas [2 ,3 ]
机构
[1] Univ Helsinki, Inst Atmospher & Earth Syst Res, Dept Comp Sci, POB 68, FI-00014 Helsinki, Finland
[2] Univ Helsinki, Dept Comp Sci, POB 68, FI-00014 Helsinki, Finland
[3] OP Financial Grp, Gebhardinaukio 1, FI-00510 Helsinki, Finland
基金
芬兰科学院;
关键词
exploratory data analysis; visual exploration; dimensionality reduction; constrained randomisation; iterative data mining;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient explorative data analysis systems must take into account both what a user knows and wants to know. This paper proposes a principled framework for interactive visual exploration of relations in data, through views most informative given the user's current knowledge and objectives. The user can input pre-existing knowledge of relations in the data and also formulate specific exploration interests, which are then taken into account in the exploration. The idea is to steer the exploration process towards the interests of the user, instead of showing uninteresting or already known relations. The user's knowledge is modelled by a distribution over data sets parametrised by subsets of rows and columns of data, called tile constraints. We provide a computationally efficient implementation of this concept based on constrained randomisation. Furthermore, we describe a novel dimensionality reduction method for finding the views most informative to the user, which at the limit of no background knowledge and with generic objectives reduces to PCA. We show that the method is suitable for interactive use and is robust to noise, outperforms standard projection pursuit visualisation methods, and gives understandable and useful results in analysis of real-world data. We provide an open-source implementation of the framework.
引用
收藏
页数:32
相关论文
共 50 条
  • [1] Guided visual exploration of relations in data sets
    Puolamäki, Kai
    Oikarinen, Emilia
    Henelius, Andreas
    Journal of Machine Learning Research, 2021, 22
  • [2] Visual exploration of large data sets
    Livny, M
    Ramakrishnan, R
    Myllymaki, J
    HUMAN VISION AND ELECTRONIC IMAGING, 1996, 2657 : 263 - 274
  • [3] Visual exploration of large data sets
    Keim, D
    COMMUNICATIONS OF THE ACM, 2001, 44 (08) : 38 - 44
  • [4] Visual exploration of large data sets
    Keim, Daniel A.
    2001, Association for Computing Machinery (44)
  • [5] Visual-interactive Exploration of Interesting Multivariate Relations in Mixed Research Data Sets
    Bernard, Juergen
    Steiger, Martin
    Widmer, Sven
    Luecke-Tieke, Hendrik
    May, Thorsten
    Kohlhammer, Joern
    COMPUTER GRAPHICS FORUM, 2014, 33 (03) : 291 - 300
  • [6] Visual exploration of large structured data sets
    Wills, GJ
    NEW TECHNIQUES AND TECHNOLOGIES FOR STATISTICS II, 1997, : 237 - 245
  • [7] Analysis guided visual exploration of multivariate data
    Yang, Di
    Rundensteiner, Elke A.
    Ward, Matthew O.
    VAST: IEEE SYMPOSIUM ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY 2007, PROCEEDINGS, 2007, : 83 - 90
  • [8] A visual and interactive data exploration method for large data sets and clustering
    Da Costa, David
    Venturini, Gilles
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2007, 4632 : 553 - +
  • [9] Scalable visual data exploration of large data sets via multiresolution
    Keim, DA
    Schneidewind, J
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2005, 11 (11) : 1766 - 1779
  • [10] Parallel sets: Interactive exploration and visual analysis of categorical data
    Kosara, R
    Bendix, F
    Hauser, H
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2006, 12 (04) : 558 - 568