Statistical and Knowledge Supported Visualization of Multivariate Data

被引:1
|
作者
Fontes, Magnus [1 ]
机构
[1] Lund Univ, Ctr Math Sci, Box 118, SE-22100 Lund, Sweden
关键词
FALSE DISCOVERY RATE; WIDE EXPRESSION DATA; GENE SET ENRICHMENT; LARGEST EIGENVALUE; MICROARRAY DATA; MATRICES; CLASSIFICATION; NORMALIZATION; VARIABLES; SURVIVAL;
D O I
10.1007/978-3-642-20236-0_6
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In the present work we have selected a collection of statistical and mathematical tools useful for the exploration of multivariate data and we present them in a form that is meant to be particularly accessible to a classically trained mathematician. We give self contained and streamlined introductions to principal component analysis, multidimensional scaling and statistical hypothesis testing. Within the presented mathematical framework we then propose a general exploratory methodology for the investigation of real world high dimensional datasets that builds on statistical and knowledge supported visualizations. We exemplify the proposed methodology by applying it to several different genomewide DNA-microarray datasets. The exploratory methodology should be seen as an embryo that can be expanded and developed in many directions. As an example we point out some recent promising advances in the theory for random matrices that, if further developed, potentially could provide practically useful and theoretically well founded estimations of information content in dimension reducing visualizations. We hope that the present work can serve as an introduction to, and help to stimulate more research within, the interesting and rapidly expanding field of data exploration.
引用
收藏
页码:143 / 173
页数:31
相关论文
共 50 条
  • [1] VISUALIZATION INVESTIGATION ON THE MARINE DATA WITH MULTIVARIATE STATISTICAL ANALYSIS METHODS
    Li Yajie
    Lv Zhengdong
    Wang Maonan
    POLISH MARITIME RESEARCH, 2017, 24 : 89 - 94
  • [2] Data visualization and knowledge visualization
    Lou, WJ
    Kong, FS
    Cao, YS
    PROCEEDINGS OF THE 3RD WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-5, 2000, : 2523 - 2525
  • [3] Visualization of Multivariate Metabolomic Data
    AA Ji-ye
    Chinese Herbal Medicines, 2011, 3 (04) : 285 - 289
  • [4] On the Visualization of Hierarchical Multivariate Data
    Zheng, Boyan
    Sadlo, Filip
    2021 IEEE 14TH PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS 2021), 2021, : 136 - 145
  • [5] Multivariate visualization of particle data
    Liang Zhou
    Daniel Weiskopf
    The European Physical Journal Special Topics, 2019, 227 : 1741 - 1755
  • [6] Multivariate visualization of particle data
    Zhou, Liang
    Weiskopf, Daniel
    EUROPEAN PHYSICAL JOURNAL-SPECIAL TOPICS, 2019, 227 (14): : 1741 - 1755
  • [7] Constructing knowledge from multivariate spatiotemporal data: integrating geographical visualization with knowledge discovery in database methods
    Maceachren, AM
    Wachowicz, M
    Edsall, R
    Haug, D
    Masters, R
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 1999, 13 (04) : 311 - 334
  • [8] Lattice: Multivariate Data Visualization with R
    Cheshire, James
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2010, 173 : 275 - 276
  • [9] An Improved Multivariate Data Visualization Technique
    Sun, Yang
    Yuan, Jinping
    Hu, Yanli
    Xiao, Weidong
    2008 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-4, 2008, : 1525 - 1530
  • [10] Multivariate spatial data visualization: a survey
    Xiangyang He
    Yubo Tao
    Qirui Wang
    Hai Lin
    Journal of Visualization, 2019, 22 : 897 - 912