Targeted projection pursuit for interactive exploration of high-dimensional data sets

被引:0
|
作者
Faith, Joe [1 ]
机构
[1] Northumbria Univ, Newcastle Upon Tyne NE1 8ST, Tyne & Wear, England
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
High-dimensional data is, by its nature, difficult to visualise. Many current techniques involve reducing the dimensionality of the data, which results in a loss of information. Targeted Projection Pursuit is a novel method for visualising high-dimensional datasets which allows the user to interactively explore the space of possible views to find those that meet their requirements. A prototype tool that utilises this method is introduced, and is shown to allow users to explore data through an interface that is transparent and efficient. The tool and underlying technique are general purpose - applicable to any high-dimensional numeric data, and supporting a wide range of exploratory data analysis activities - but are evaluated on three particular tasks using gene expression data: identifiying discriminatory genes, visualising diagnostic classes, and detecting misdiagnosed samples. It is found to perform well in comparison with standard techniques.
引用
收藏
页码:286 / 292
页数:7
相关论文
共 50 条
  • [41] Training neural networks on high-dimensional data using random projection
    Wojcik, Piotr Iwo
    Kurdziel, Marcin
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (03) : 1221 - 1231
  • [42] Training neural networks on high-dimensional data using random projection
    Piotr Iwo Wójcik
    Marcin Kurdziel
    [J]. Pattern Analysis and Applications, 2019, 22 : 1221 - 1231
  • [43] streamingRPHash: Random Projection Clustering of High-Dimensional Data in a MapReduce Framework
    Franklin, Jacob
    Wenke, Samuel
    Quasem, Sadiq
    Carraher, Lee A.
    Wilsey, Philip A.
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2016, : 168 - 169
  • [44] A SOM projection technique with the growing structure for visualizing high-dimensional data
    Wu, Z
    Yen, GG
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 1763 - 1768
  • [45] On eigenfunction approach to data mining: outlier detection in high-dimensional data sets
    Nagar, AK
    Muyeba, MK
    [J]. 8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTING TECHNIQUES, 2004, : 251 - 256
  • [46] Interactive tools for explaining multidimensional projections for high-dimensional tabular data
    Thijssen, Julian
    Tian, Zonglin
    Telea, Alexandru
    [J]. COMPUTERS & GRAPHICS-UK, 2024, 122
  • [47] Representative Factor Generation for the Interactive Visual Analysis of High-Dimensional Data
    Turkay, Cagatay
    Lundervold, Arvid
    Lundervold, Astri Johansen
    Hauser, Helwig
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2012, 18 (12) : 2621 - 2630
  • [48] Interactive Pattern Discovery in High-Dimensional, Multimodal Data Using Manifolds
    Guo, Jinhong K.
    Hofmann, Martin O.
    [J]. COMPLEX ADAPTIVE SYSTEMS CONFERENCE WITH THEME: ENGINEERING CYBER PHYSICAL SYSTEMS, CAS, 2017, 114 : 258 - 265
  • [49] Pollux: Interactive Cluster-First Projections of High-Dimensional Data
    Wenskovitch, John
    North, Chris
    [J]. 2019 IEEE VISUALIZATION IN DATA SCIENCE (VDS), 2019, : 38 - 47
  • [50] MetaGate: Interactive analysis of high-dimensional cytometry data with metadata integration
    Ask, Eivind Heggernes
    Tschan-Plessl, Astrid
    Hoel, Hanna Julie
    Kolstad, Arne
    Holte, Harald
    Malmberg, Karl-Johan
    [J]. PATTERNS, 2024, 5 (07):