A Parallel Coordinates Plot Method Based on Unsupervised Feature Selection for High-Dimensional Data Visualization

被引:2
|
作者
Lou, Jiaqi [1 ]
Dong, Ke [2 ]
Wang, Maosen [1 ]
机构
[1] Cranfield Univ, Sch Aerosp Transport & Mfg, Bedford, England
[2] Hefei Univ Technol, Sch Comp Sci & Informat, Hefei, Peoples R China
关键词
High-Dimensional Data Visualization; PCP; Unsupervised Feature Selection; Laplacian Score; LAPLACIAN SCORE; CONSTRAINT;
D O I
10.1109/IWCMC51323.2021.9498694
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the recent years, high-dimensional data visualization has become a challenging task in data science and machine learning. As one of the most effective methods for high-dimensional data visualization, Parallel Coordinates Plots (PCPs) demonstrate dimensional reduction by transforming features of multivariate data into 2D axes. Such approach, however, does not consider the irrelevant or redundant features such that each feature is projected into the axis in a fixed manner. This paper proposed a novel PCP introduced by an unsupervised feature selection called Laplacian Score, which can be used to improve the visualization performance of PCP by ranking the importance of attributes based on their locality preserving power. The experimental results demonstrated that the performance of PCP visualization can be improved by feature selection method. Furthermore, we proposed a flexible user interface based on PCP visualization and Laplacian Score.
引用
下载
收藏
页码:532 / 536
页数:5
相关论文
共 50 条
  • [1] Hybrid fast unsupervised feature selection for high-dimensional data
    Manbari, Zhaleh
    AkhlaghianTab, Fardin
    Salavati, Chiman
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 124 : 97 - 118
  • [2] A New Metric on Parallel Coordinates and Its Application for High-Dimensional Data Visualization
    Tran Van Long
    2015 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2015, : 297 - 301
  • [3] A hybrid feature selection method for high-dimensional data
    Taheri, Nooshin
    Nezamabadi-pour, Hossein
    2014 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2014, : 141 - 145
  • [4] An information-theoretic approach to unsupervised feature selection for high-dimensional data
    Huang S.-L.
    Xu X.
    Zheng L.
    IEEE Journal on Selected Areas in Information Theory, 2020, 1 (01): : 157 - 166
  • [5] An Information-theoretic Approach to Unsupervised Feature Selection for High-Dimensional Data
    Huang, Shao-Lun
    Zhang, Lin
    Zheng, Lizhong
    2017 IEEE INFORMATION THEORY WORKSHOP (ITW), 2017, : 434 - 438
  • [6] Feature selection for high-dimensional data
    Bolón-Canedo V.
    Sánchez-Maroño N.
    Alonso-Betanzos A.
    Progress in Artificial Intelligence, 2016, 5 (2) : 65 - 75
  • [7] Feature selection for high-dimensional data
    Destrero A.
    Mosci S.
    De Mol C.
    Verri A.
    Odone F.
    Computational Management Science, 2009, 6 (1) : 25 - 40
  • [8] A hybrid feature selection approach based on ensemble method for high-dimensional data
    Rouhi, Amirreza
    Nezamabadi-pour, Hossein
    2017 2ND CONFERENCE ON SWARM INTELLIGENCE AND EVOLUTIONARY COMPUTATION (CSIEC), 2017, : 16 - 20
  • [9] An ensemble feature selection method for high-dimensional data based on sort aggregation
    Wang, Jie
    Xu, Jing
    Zhao, Chengan
    Peng, Yan
    Wang, Hongpeng
    SYSTEMS SCIENCE & CONTROL ENGINEERING, 2019, 7 (02) : 32 - 39
  • [10] Bifocal Parallel Coordinates Plot for Multivariate Data Visualization
    Kaur, Gurminder
    Karki, Bijaya B.
    VISIGRAPP 2018: PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS / INTERNATIONAL CONFERENCE ON INFORMATION VISUALIZATION THEORY AND APPLICATIONS (IVAPP), VOL 3, 2018, : 173 - 180