An Empirical Evaluation of the t-SNE Algorithm for Data Visualization in Structural Engineering

被引:13
|
作者
Hajibabaee, Parisa [1 ]
Pourkamali-Anaraki, Farhad [1 ]
Hariri-Ardebili, Mohammad Amin [2 ]
机构
[1] Univ Massachusetts, Comp Sci, Lowell, MA 01854 USA
[2] Univ Colorado, Civil Environm & Architectural Engn, Boulder, CO 80309 USA
关键词
Classification algorithms; supervised learning; dimensionality reduction; feature extraction; oversampling; EPISTEMIC UNCERTAINTY; RELIABILITY-ANALYSIS; SMOTE; CHALLENGES; REDUCTION;
D O I
10.1109/ICMLA52953.2021.00267
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A fundamental task in machine learning involves visualizing high-dimensional data sets that arise in high-impact application domains. When considering the context of large imbalanced data, this problem becomes much more challenging. In this paper, the t-Distributed Stochastic Neighbor Embedding (tSNE) algorithm is used to reduce the dimensions of an earthquake engineering related data set for visualization purposes. Since imbalanced data sets greatly affect the accuracy of classifiers, we employ Synthetic Minority Oversampling Technique (SMOTE) to tackle the imbalanced nature of such data set. We present the result obtained from t-SNE and SMOTE and compare it to the basic approaches with various aspects. Considering four options and six classification algorithms, we show that using t-SNE on the imbalanced data and SMOTE on the training data set, neural network classifiers have promising results without sacrificing accuracy. Hence, we can transform the studied scientific data into a two-dimensional (2D) space, enabling the visualization of the classifier and the resulting decision surface using a 2D plot.
引用
收藏
页码:1674 / 1680
页数:7
相关论文
共 50 条
  • [21] A t-SNE Based Classification Approach to Compositional Microbiome Data
    Xu, Xueli
    Xie, Zhongming
    Yang, Zhenyu
    Li, Dongfang
    Xu, Ximing
    [J]. FRONTIERS IN GENETICS, 2020, 11
  • [22] Using Global t-SNE to Preserve Intercluster Data Structure
    Zhou, Yuansheng
    Sharpee, Tatyana O.
    [J]. NEURAL COMPUTATION, 2022, 34 (08) : 1637 - 1651
  • [23] TWO-DIMENSIONAL VISUALIZATION OF LARGE DOCUMENT LIBRARIES USING T-SNE
    Gonzalez-Marquez, Rita
    Berens, Philipp
    Kobak, Dmitry
    [J]. TOPOLOGICAL, ALGEBRAIC AND GEOMETRIC LEARNING WORKSHOPS 2022, VOL 196, 2022, 196
  • [24] t-SNE Manifold Learning Based Visualization: A Human Activity Recognition Approach
    Dharavath, Ramesh
    MadhukarRao, G.
    Khurana, Himanshu
    Edla, Damodar Reddy
    [J]. ADVANCES IN DATA SCIENCE AND MANAGEMENT, 2020, 37 : 33 - 43
  • [25] Data Segmentation via t-SNE, DBSCAN, and Random Forest
    DeLise, Timothy
    [J]. INTELLIGENT COMPUTING, VOL 2, 2021, 284 : 139 - 151
  • [26] Speaker Recognition System Based on Identity Vector Using t-SNE Visualization and Mean-shift Algorithm
    Kiani, Kourosh
    Baniasadi, Atefeh
    [J]. 2019 5TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS 2019), 2019,
  • [27] t-SNE-CUDA: GPU-Accelerated t-SNE and its Applications to Modern Data
    Chan, David M.
    Rao, Roshan
    Huang, Forrest
    Canny, John F.
    [J]. 2018 30TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2018), 2018, : 330 - 338
  • [28] Examining Intermediate Data Reduction Algorithms for use with t-SNE
    Campbell, Aaron
    Caudle, Kyle
    Hoover, Randy C.
    [J]. PROCEEDINGS OF THE 2019 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTE AND DATA ANALYSIS (ICCDA 2019), 2019, : 36 - 42
  • [29] MetGem Software for the Generation of Molecular Networks Based on the t-SNE Algorithm
    Olivon, Florent
    Elie, Nicolas
    Grelier, Gwendal
    Roussi, Fanny
    Litaudon, Marc
    Touboul, David
    [J]. ANALYTICAL CHEMISTRY, 2018, 90 (23) : 13900 - 13908
  • [30] S plus t-SNE - Bringing Dimensionality Reduction to Data Streams
    Vieira, Pedro C.
    Montrezol, Joao P.
    Vieira, Joao T.
    Gama, Joao
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS XXII, PT II, IDA 2024, 2024, 14642 : 95 - 106