Statistical and Knowledge Supported Visualization of Multivariate Data

被引:1
|
作者
Fontes, Magnus [1 ]
机构
[1] Lund Univ, Ctr Math Sci, Box 118, SE-22100 Lund, Sweden
关键词
FALSE DISCOVERY RATE; WIDE EXPRESSION DATA; GENE SET ENRICHMENT; LARGEST EIGENVALUE; MICROARRAY DATA; MATRICES; CLASSIFICATION; NORMALIZATION; VARIABLES; SURVIVAL;
D O I
10.1007/978-3-642-20236-0_6
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In the present work we have selected a collection of statistical and mathematical tools useful for the exploration of multivariate data and we present them in a form that is meant to be particularly accessible to a classically trained mathematician. We give self contained and streamlined introductions to principal component analysis, multidimensional scaling and statistical hypothesis testing. Within the presented mathematical framework we then propose a general exploratory methodology for the investigation of real world high dimensional datasets that builds on statistical and knowledge supported visualizations. We exemplify the proposed methodology by applying it to several different genomewide DNA-microarray datasets. The exploratory methodology should be seen as an embryo that can be expanded and developed in many directions. As an example we point out some recent promising advances in the theory for random matrices that, if further developed, potentially could provide practically useful and theoretically well founded estimations of information content in dimension reducing visualizations. We hope that the present work can serve as an introduction to, and help to stimulate more research within, the interesting and rapidly expanding field of data exploration.
引用
收藏
页码:143 / 173
页数:31
相关论文
共 50 条
  • [21] Statistical Graphics for Visualizing Multivariate Data
    Williams, R
    CONTEMPORARY SOCIOLOGY-A JOURNAL OF REVIEWS, 1999, 28 (05) : 633 - 634
  • [22] Statistical characterization of multivariate geotechnical data
    Department of Civil Engineering, National Taiwan University , China
    不详
    不详
    Reliab. of Geotech. Struct. in ISO2394, 1600, (89-126):
  • [23] Multivariate statistical analysis of environmental data
    Brzezinska, Justyna
    Rybicka, Aneta
    Pelka, Marcin
    12TH PROFESSOR ALEKSANDER ZELIAS INTERNATIONAL CONFERENCE ON MODELLING AND FORECASTING OF SOCIO-ECONOMIC PHENOMENA, 2018, 1 : 40 - 49
  • [24] Multivariate Statistical Analyses for Neuroimaging Data
    McIntosh, Anthony R.
    Misic, Bratislav
    ANNUAL REVIEW OF PSYCHOLOGY, VOL 64, 2013, 64 : 499 - +
  • [25] An Application of Multivariate Statistical Analysis for Query-Driven Visualization
    Gosink, Luke J.
    Garth, Christoph
    Anderson, John C.
    Bethel, E. Wes
    Joy, Kenneth I.
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2011, 17 (03) : 264 - 275
  • [26] Fault Detection Based on Statistical Multivariate Analysis and Microarray Visualization
    Ma, Ming-Da
    Wong, David Shan-Hill
    Jang, Shi-Shang
    Tseng, Sheng-Tsaing
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2010, 6 (01) : 18 - 24
  • [27] Parallel dual visualization of multidimensional Multivariate data
    Xu, Yonghong
    Hong, Wenxue
    Li, Xin
    Song, Jialin
    2007 IEEE INTERNATIONAL CONFERENCE ON INTEGRATION TECHNOLOGY, PROCEEDINGS, 2007, : 263 - +
  • [28] Multivariate Functional Data Visualization and Outlier Detection
    Dai, Wenlin
    Genton, Marc G.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2018, 27 (04) : 923 - 934
  • [29] Dual multiresolution HyperSlice for multivariate data visualization
    Wong, PC
    Crabb, AH
    Bergeron, RD
    IEEE SYMPOSIUM ON INFORMATION VISUALIZATION '96, PROCEEDINGS, 1996, : 74 - +
  • [30] Variable contribution identification and visualization in multivariate statistical process monitoring
    Rossouw, R. F.
    Coetzer, R. L. J.
    Le Roux, N. J.
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 196