Statistical and Knowledge Supported Visualization of Multivariate Data

被引:1
|
作者
Fontes, Magnus [1 ]
机构
[1] Lund Univ, Ctr Math Sci, Box 118, SE-22100 Lund, Sweden
关键词
FALSE DISCOVERY RATE; WIDE EXPRESSION DATA; GENE SET ENRICHMENT; LARGEST EIGENVALUE; MICROARRAY DATA; MATRICES; CLASSIFICATION; NORMALIZATION; VARIABLES; SURVIVAL;
D O I
10.1007/978-3-642-20236-0_6
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In the present work we have selected a collection of statistical and mathematical tools useful for the exploration of multivariate data and we present them in a form that is meant to be particularly accessible to a classically trained mathematician. We give self contained and streamlined introductions to principal component analysis, multidimensional scaling and statistical hypothesis testing. Within the presented mathematical framework we then propose a general exploratory methodology for the investigation of real world high dimensional datasets that builds on statistical and knowledge supported visualizations. We exemplify the proposed methodology by applying it to several different genomewide DNA-microarray datasets. The exploratory methodology should be seen as an embryo that can be expanded and developed in many directions. As an example we point out some recent promising advances in the theory for random matrices that, if further developed, potentially could provide practically useful and theoretically well founded estimations of information content in dimension reducing visualizations. We hope that the present work can serve as an introduction to, and help to stimulate more research within, the interesting and rapidly expanding field of data exploration.
引用
收藏
页码:143 / 173
页数:31
相关论文
共 50 条
  • [41] Brushing of Attribute Clouds for the Visualization of Multivariate Data
    Janicke, Heike
    Bottinger, Michael
    Scheuermann, Gerik
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2008, 14 (06) : 1459 - 1466
  • [42] Method for data statistical comparison visualization
    National Research Nuclear University MEPhI, Moscow Engineering Physics Institute, Russia
    不详
    Sci. Visualization, 5 (26-37):
  • [43] Study on the knowledge visualization and creation supported kmap platform
    Zhang Yongjin
    He Xinyan
    Xie Jiancang
    Wang Zhiguo
    FIRST INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, : 154 - 159
  • [44] Data Visualization and Statistical Literacy for Open and Big Data
    Shanmugam, Ramalingam
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2020,
  • [45] Data Visualization and Statistical Graphics in Big Data Analysis
    Cook, Dianne
    Lee, Eun-Kyung
    Majumder, Mahbubul
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 3, 2016, 3 : 133 - 159
  • [46] The knowledge content of statistical data
    Lucien Preuss
    Helmut Vorkauf
    Psychometrika, 1997, 62 : 133 - 161
  • [47] The knowledge content of statistical data
    Preuss, L
    Vorkauf, H
    PSYCHOMETRIKA, 1997, 62 (01) : 133 - 161
  • [48] Multivariate visualization techniques in statistical process monitoring and their applications to semiconductor manufacturing
    He, Q. Peter
    DATA ANALYSIS AND MODELING FOR PROCESS CONTROL III, 2006, 6155
  • [49] MULTIVARIATE STATISTICAL ANALYSIS OF WIND SOUNDING DATA
    VANDERMA.CJ
    JOURNAL OF SPACECRAFT AND ROCKETS, 1967, 4 (01) : 74 - &
  • [50] A multivariate Statistical Analysis of Groundwater Chemistry Data
    Belkhiri, L.
    Boudoukha, A.
    Mouni, L.
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH, 2011, 5 (02) : 537 - 544