Visualization Framework for High-Dimensional Spatio-Temporal Hydrological Gridded Datasets using Machine-Learning Techniques

被引:20
|
作者
Mazher, Abeer [1 ]
机构
[1] CSIRO, DEI FSP, Melbourne, Vic, Australia
关键词
machine learning; spatio-temporal gridded datasets; dimensionality reduction; color maps; spatial visualization; quality assessment; CLASSIFICATION; WATER; CHEMISTRY; QUALITY;
D O I
10.3390/w12020590
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Numerical modelling increasingly generates massive, high-dimensional spatio-temporal datasets. Exploring such datasets relies on effective visualization. This study presents a generic workflow to (i) project high-dimensional spatio-temporal data on a two-dimensional (2D) plane accurately (ii) compare dimensionality reduction techniques (DRTs) in terms of resolution and computational efficiency (iii) represent 2D projection spatially using a 2D perceptually uniform background color map. Machine learning (ML) based DRTs for data visualization i.e., principal component analysis (PCA), generative topographic mapping (GTM), t-distributed stochastic neighbor embedding (t-SNE) and uniform manifold approximation and projection (UMAP) are compared in terms of accuracy, resolution and computational efficiency to handle massive datasets. The accuracy of visualization is evaluated using a quality metric based on a co-ranking framework. The workflow is applied to an output of an Australian Water Resource Assessment (AWRA) model for Tasmania, Australia. The dataset consists of daily time series of nine components of the water balance at a 5 km grid cell resolution for the year 2017. The case study shows that PCA allows rapid visualization of global data structures, while t-SNE and UMAP allows more accurate representation of local trends. Furthermore, UMAP is computationally more efficient than t-SNE and least affected by the outliers compared to GTM.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Visualization strategies and techniques for high-dimensional spatio-temporal data
    Schmidt, B
    Streit, U
    Uhlenkuken, C
    [J]. GEOGRAPHICAL INFORMATION '97: FROM RESEARCH TO APPLICATION THROUGH COOPERATION, VOLS 1 AND 2, 1997, : 248 - 253
  • [2] An Optimized Interestingness Hotspot Discovery Framework for Large Gridded Spatio-temporal Datasets
    Akdag, Fatih
    Eick, Christoph F.
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2010 - 2019
  • [3] MACHINE-LEARNING BASED RETRIEVAL OF SOIL MOISTURE AT HIGH SPATIO-TEMPORAL SCALES USING CYGNSS AND SMAP OBSERVATIONS
    Lei, Fangni
    Senyurek, Volkan
    Kurum, Mehmet
    Gurbuz, Ali
    Moorhead, Robert
    Boyd, Dylan
    [J]. IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 4470 - 4473
  • [4] Adaptive surrogate modeling for high-dimensional spatio-temporal output
    Berkcan Kapusuzoglu
    Sankaran Mahadevan
    Shunsaku Matsumoto
    Yoshitomo Miyagi
    Daigo Watanabe
    [J]. Structural and Multidisciplinary Optimization, 2022, 65
  • [5] Adaptive surrogate modeling for high-dimensional spatio-temporal output
    Kapusuzoglu, Berkcan
    Mahadevan, Sankaran
    Matsumoto, Shunsaku
    Miyagi, Yoshitomo
    Watanabe, Daigo
    [J]. STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2022, 65 (10)
  • [6] Gaidai reliability method for high-dimensional spatio-temporal biosystems
    Gaidai, Oleg
    Yakimov, Vladimir
    Niu, Yuhao
    Liu, Zirui
    [J]. BIOSYSTEMS, 2024, 235
  • [7] A high throughput machine-learning driven analysis of Ca2+ spatio-temporal maps
    Leigh, Wesley A.
    Del Valle, Guillermo
    Kamran, Sharif Amit
    Drumm, Bernard T.
    Tavakkoli, Alireza
    Sanders, Kenton M.
    Baker, Salah A.
    [J]. CELL CALCIUM, 2020, 91
  • [8] Spatio-temporal downscaling of gridded crop model yield estimates based on machine learning
    Folberth, C.
    Baklanov, A.
    Balkovic, J.
    Skalsky, R.
    Khabarov, N.
    Obersteiner, M.
    [J]. AGRICULTURAL AND FOREST METEOROLOGY, 2019, 264 : 1 - 15
  • [9] Scalable spatio-temporal Bayesian analysis of high-dimensional electroencephalography data
    Mohammed, Shariq
    Dey, Dipak K.
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2021, 49 (01): : 107 - 128
  • [10] Comparison of classical and machine-learning methods on spatio-temporal modeling of daily Ozone concentrations
    Gualan, Ronald
    Saquicela, Victor
    Long Tran-Thanh
    [J]. 2020 XLVI LATIN AMERICAN COMPUTING CONFERENCE (CLEI 2020), 2021, : 56 - 65