An Empirical Evaluation of the t-SNE Algorithm for Data Visualization in Structural Engineering

被引:13
|
作者
Hajibabaee, Parisa [1 ]
Pourkamali-Anaraki, Farhad [1 ]
Hariri-Ardebili, Mohammad Amin [2 ]
机构
[1] Univ Massachusetts, Comp Sci, Lowell, MA 01854 USA
[2] Univ Colorado, Civil Environm & Architectural Engn, Boulder, CO 80309 USA
关键词
Classification algorithms; supervised learning; dimensionality reduction; feature extraction; oversampling; EPISTEMIC UNCERTAINTY; RELIABILITY-ANALYSIS; SMOTE; CHALLENGES; REDUCTION;
D O I
10.1109/ICMLA52953.2021.00267
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A fundamental task in machine learning involves visualizing high-dimensional data sets that arise in high-impact application domains. When considering the context of large imbalanced data, this problem becomes much more challenging. In this paper, the t-Distributed Stochastic Neighbor Embedding (tSNE) algorithm is used to reduce the dimensions of an earthquake engineering related data set for visualization purposes. Since imbalanced data sets greatly affect the accuracy of classifiers, we employ Synthetic Minority Oversampling Technique (SMOTE) to tackle the imbalanced nature of such data set. We present the result obtained from t-SNE and SMOTE and compare it to the basic approaches with various aspects. Considering four options and six classification algorithms, we show that using t-SNE on the imbalanced data and SMOTE on the training data set, neural network classifiers have promising results without sacrificing accuracy. Hence, we can transform the studied scientific data into a two-dimensional (2D) space, enabling the visualization of the classifier and the resulting decision surface using a 2D plot.
引用
收藏
页码:1674 / 1680
页数:7
相关论文
共 50 条
  • [31] Shape Pattern Recognition of Building Footprints Using t-SNE Dimensionality Reduction Visualization
    Li, Jingzhong
    Mao, Kainan
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2024, 13 (06)
  • [32] Evaluating the Effects of Missing Values and Mixed Data Types on Social Sequence Clustering Using t-SNE Visualization
    Lazar, Alina
    Jin, Ling
    Spurlock, C. Anna
    Wu, Kesheng
    Sim, Alex
    Todd, Annika
    [J]. ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2019, 11 (02):
  • [33] Automatic grid topology detection method based on Lasso algorithm and t-SNE algorithm
    Huang, Sheng
    Que, Huakun
    Zhang, Yingnan
    Xie, Tenglong
    Peng, Jie
    [J]. Energy Informatics, 2024, 7 (01)
  • [34] Wind Farm NWP Data Preprocessing Method Based on t-SNE
    Gu, Jiu
    Wang, Yining
    Xie, Da
    Zhang, Yu
    [J]. ENERGIES, 2019, 12 (19)
  • [35] Dimensionality reduction and sensitivity improvement for TACTIC Cherenkov data using t-SNE machine learning algorithm
    Das, M. P.
    Dhar, V. K.
    Verma, S.
    Yadav, K. K.
    [J]. NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2023, 1057
  • [36] Dimensionality reduction and visualisation of hyperspectral ink data using t-SNE
    Devassy, Binu Melit
    George, Sony
    [J]. FORENSIC SCIENCE INTERNATIONAL, 2020, 311
  • [37] Visualizing Time Series Data with Temporal Matching Based t-SNE
    Wong, Kwan Yeung
    Chung, Fu-lai
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [38] Fast interpolation-based t-SNE for improved visualization of single-cell RNA-seq data
    George C. Linderman
    Manas Rachh
    Jeremy G. Hoskins
    Stefan Steinerberger
    Yuval Kluger
    [J]. Nature Methods, 2019, 16 : 243 - 245
  • [39] T-SNE Based on Simulated Plant Growth Optimization Algorithm with t-Distribution Parameters
    Dong, Xinyu
    Li, Dan
    Zhang, Licheng
    Xu, Chenwei
    Nai, Wei
    Yang, Zan
    [J]. PROCEEDINGS OF 2021 IEEE 11TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2021), 2021, : 210 - 214
  • [40] Fast interpolation-based t-SNE for improved visualization of single-cell RNA-seq data
    Linderman, George C.
    Rachh, Manas
    Hoskins, Jeremy G.
    Steinerberger, Stefan
    Kluger, Yuval
    [J]. NATURE METHODS, 2019, 16 (03) : 243 - +