A New Supervised t-SNE with Dissimilarity Measure for Effective Data Visualization and Classification

被引:13
|
作者
Hajderanj, Laureta [1 ]
Weheliye, Isakh [1 ]
Chen, Daqing [1 ]
机构
[1] London South Bank Univ, Sch Engn, London, England
关键词
High dimensional data; dissimilarity measure; supervised dimensionality reduction; supervised t-SNE; k-NN classification; NONLINEAR DIMENSIONALITY REDUCTION;
D O I
10.1145/3328833.3328853
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, a new version of the Supervised t-Stochastic Neighbor Embedding (S-tSNE) algorithm is proposed which introduces the use of a dissimilarity measure related to class information. The proposed S-tSNE can be applied in any high dimensional dataset for visualization or as a feature extraction for classification problems. In this study, the S-tSNE is applied to three datasets MNIST, Chest x-ray, and SEER Breast Cancer. The two-dimensional data generated by the S-tSNE showed better visualization and an improvement in terms of classification accuracy in comparison to the original t-Stochastic Neighbor Embedding(t-SNE) method. The results from k-nearest neighbors (k-NN) classification model which used the lower dimension space generated by the new S-tSNE method showed more than 20% improvement on average in accuracy in all the three datasets compared with the t-SNE method. In addition, the classification accuracy using the S-tSNE for feature extraction was even higher than classification accuracy obtained from the original high dimensional data.
引用
收藏
页码:232 / 236
页数:5
相关论文
共 50 条
  • [1] Classification of Categorical Data Based on the Chi-Square Dissimilarity and t-SNE
    Cardona, Luis Ariosto Serna
    Vargas-Cardona, Hernan Dario
    Navarro Gonzalez, Piedad
    Cardenas Pena, David Augusto
    Orozco Gutierrez, Alvaro Angel
    [J]. COMPUTATION, 2020, 8 (04) : 1 - 15
  • [2] Visualization of SNPs with t-SNE
    Platzer, Alexander
    [J]. PLOS ONE, 2013, 8 (02):
  • [3] Parallel t-SNE Applied to Data Visualization in Smart Cities
    Da Silva Lopes, Maximiliano Araujo
    Doria Neto, Adriao D.
    De Medeiros Martins, Allan
    [J]. IEEE ACCESS, 2020, 8 : 11482 - 11490
  • [4] A t-SNE Based Classification Approach to Compositional Microbiome Data
    Xu, Xueli
    Xie, Zhongming
    Yang, Zhenyu
    Li, Dongfang
    Xu, Ximing
    [J]. FRONTIERS IN GENETICS, 2020, 11
  • [5] An Empirical Evaluation of the t-SNE Algorithm for Data Visualization in Structural Engineering
    Hajibabaee, Parisa
    Pourkamali-Anaraki, Farhad
    Hariri-Ardebili, Mohammad Amin
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 1674 - 1680
  • [6] Deep Supervised t-SNE for SAR Target Recognition
    Yu, Meiting
    Zhang, Siqian
    Zhao, Lingjun
    Kuang, Gangyao
    [J]. 2017 2ND INTERNATIONAL CONFERENCE ON FRONTIERS OF SENSORS TECHNOLOGIES (ICFST), 2017, : 265 - 269
  • [7] Visualizing Data using t-SNE
    van der Maaten, Laurens
    Hinton, Geoffrey
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2008, 9 : 2579 - 2605
  • [8] A novel Supervised t-SNE based approach of viseme classification for automated lip reading
    Fenghour, Souheil
    Chen, Daqing
    Hajderanj, Laureta
    Weheliye, Isakh
    Xiao, Perry
    [J]. INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 1316 - 1322
  • [9] Generic Process Visualization Using Parametric t-SNE
    Zhu, Wenbo
    Webb, Zachary
    Han, Xianyao
    Mao, Kaitian
    Sun, Wei
    Romagnoli, Jose
    [J]. IFAC PAPERSONLINE, 2018, 51 (18): : 803 - 808
  • [10] Seeing data as t-SNE and UMAP do
    Marx, Vivien
    [J]. NATURE METHODS, 2024, 21 (06) : 930 - 933