An Empirical Evaluation of the t-SNE Algorithm for Data Visualization in Structural Engineering

被引:13
|
作者
Hajibabaee, Parisa [1 ]
Pourkamali-Anaraki, Farhad [1 ]
Hariri-Ardebili, Mohammad Amin [2 ]
机构
[1] Univ Massachusetts, Comp Sci, Lowell, MA 01854 USA
[2] Univ Colorado, Civil Environm & Architectural Engn, Boulder, CO 80309 USA
关键词
Classification algorithms; supervised learning; dimensionality reduction; feature extraction; oversampling; EPISTEMIC UNCERTAINTY; RELIABILITY-ANALYSIS; SMOTE; CHALLENGES; REDUCTION;
D O I
10.1109/ICMLA52953.2021.00267
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A fundamental task in machine learning involves visualizing high-dimensional data sets that arise in high-impact application domains. When considering the context of large imbalanced data, this problem becomes much more challenging. In this paper, the t-Distributed Stochastic Neighbor Embedding (tSNE) algorithm is used to reduce the dimensions of an earthquake engineering related data set for visualization purposes. Since imbalanced data sets greatly affect the accuracy of classifiers, we employ Synthetic Minority Oversampling Technique (SMOTE) to tackle the imbalanced nature of such data set. We present the result obtained from t-SNE and SMOTE and compare it to the basic approaches with various aspects. Considering four options and six classification algorithms, we show that using t-SNE on the imbalanced data and SMOTE on the training data set, neural network classifiers have promising results without sacrificing accuracy. Hence, we can transform the studied scientific data into a two-dimensional (2D) space, enabling the visualization of the classifier and the resulting decision surface using a 2D plot.
引用
收藏
页码:1674 / 1680
页数:7
相关论文
共 50 条
  • [41] Visualization of Non-metric Relationships by Adaptive Learning Multiple Maps t-SNE Regularization
    Shen, Xianjun
    Zhu, Xianchao
    Jiang, Xingpeng
    Gao, Li
    He, Tingting
    Hu, Xiaohua
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3882 - 3887
  • [42] Multi-condition Wear Evaluation of Tool Based on T-SNE and XGBoost
    Li, Ya
    Huang, Yixiang
    Zhao, Lujie
    Liu, Chengliang
    [J]. Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2020, 56 (01): : 132 - 140
  • [43] Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization
    Xu, Weiwei
    Jiang, Xingpeng
    Hu, Xiaohua
    Li, Guangrong
    [J]. BMC MEDICAL GENOMICS, 2014, 7
  • [44] Class-Constrained t-SNE: Combining Data Features and Class Probabilities
    Meng, Linhao
    van den Elzen, Stef
    Pezzotti, Nicola
    Vilanova, Anna
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (01) : 164 - 174
  • [45] Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization
    Weiwei Xu
    Xingpeng Jiang
    Xiaohua Hu
    Guangrong Li
    [J]. BMC Medical Genomics, 7
  • [46] Initialization is critical for preserving global data structure in both t-SNE and UMAP
    Kobak, Dmitry
    Linderman, George C.
    [J]. NATURE BIOTECHNOLOGY, 2021, 39 (02) : 156 - 157
  • [47] t-SNE for Complex Multi-Manifold High-Dimensional Data
    Bian, Rongzheng
    Zhang, Jian
    Zhou, Liang
    Jiang, Peng
    Chen, Baoquan
    Wang, Yunhai
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (11): : 1746 - 1754
  • [48] Theoretical Foundations of t-SNE for Visualizing High-Dimensional Clustered Data
    Cai, T. Tony
    Ma, Rong
    [J]. Journal of Machine Learning Research, 2022, 23
  • [49] Classification of Categorical Data Based on the Chi-Square Dissimilarity and t-SNE
    Cardona, Luis Ariosto Serna
    Vargas-Cardona, Hernan Dario
    Navarro Gonzalez, Piedad
    Cardenas Pena, David Augusto
    Orozco Gutierrez, Alvaro Angel
    [J]. COMPUTATION, 2020, 8 (04) : 1 - 15
  • [50] A Frequency-Based Approach for the Detection and Classification of Structural Changes Using t-SNE
    Agis, David
    Pozo, Francesc
    [J]. SENSORS, 2019, 19 (23)