Gaussian Cubes: Real-Time Modeling for Visual Exploration of Large Multidimensional Datasets

被引:37
|
作者
Wang, Zhe [1 ]
Ferreira, Nivan [2 ]
Wei, Youhao [1 ]
Bhaskar, Aarthy Sankari [1 ]
Scheidegger, Carlos [1 ]
机构
[1] Univ Arizona, Tucson, AZ 85721 USA
[2] Univ Fed Pernambuco, Recife, PE, Brazil
基金
美国国家科学基金会;
关键词
Data modeling; dimensionality reduction; interactive visualization; data cubes; VISUALIZATION; REGRESSION; AGGREGATION; COMPRESSION;
D O I
10.1109/TVCG.2016.2598694
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recently proposed techniques have finally made it possible for analysts to interactively explore very large datasets in real time. However powerful, the class of analyses these systems enable is somewhat limited: specifically, one can only quickly obtain plots such as histograms and heatmaps. In this paper, we contribute Gaussian Cubes, which significantly improves on state-of-the-art systems by providing interactive modeling capabilities, which include but are not limited to linear least squares and principal components analysis (PCA). The fundamental insight in Gaussian Cubes is that instead of precomputing counts of many data subsets (as state-of-the-art systems do), Gaussian Cubes precomputes the best multivariate Gaussian for the respective data subsets. As an example, Gaussian Cubes can fit hundreds of models over millions of data points in well under a second, enabling novel types of visual exploration of such large datasets. We present three case studies that highlight the visualization and analysis capabilities in Gaussian Cubes, using earthquake safety simulations, astronomical catalogs, and transportation statistics. The dataset sizes range around one hundred million elements and 5 to 10 dimensions. We present extensive performance results, a discussion of the limitations in Gaussian Cubes, and future research directions.
引用
收藏
页码:681 / 690
页数:10
相关论文
共 50 条
  • [1] Real-Time Exploration of Large Spatiotemporal Datasets Based on Order Statistics
    Pahins, Cicero A. L.
    Ferreira, Nivan
    Comba, Joao L.
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (11) : 3314 - 3326
  • [2] Nanocubes for Real-Time Exploration of Spatiotemporal Datasets
    Lins, Lauro
    Klosowski, James T.
    Scheidegger, Carlos
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2013, 19 (12) : 2456 - 2465
  • [3] AVIST: A GPU-Centric Design for Visual Exploration of Large Multidimensional Datasets
    Mi, Peng
    Sun, Maoyuan
    Masiane, Moeti
    Cao, Yong
    North, Chris
    [J]. INFORMATICS-BASEL, 2016, 3 (04):
  • [4] Similarity-based visual exploration of very large georeferenced multidimensional datasets
    Peralta-Aranibar, Roger
    Pahins, Cicero A. L.
    Comba, Joao L. D.
    Gomez-Nieto, Erick
    [J]. SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 683 - 686
  • [5] A Hierarchical 3D Gaussian Representation for Real-Time Rendering of Very Large Datasets
    Kerbl, Bernhard
    Meuleman, Andreas
    Kopanas, Georgios
    Wimmer, Michael
    Lanvin, Alexandre
    Drettakis, George
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (04):
  • [6] An Introduction to the Multidimensional Real-Time Economic Modeling
    Estrada, Mario Arturo Ruiz
    Chandran, Vgr
    Tahir, Muhammad
    [J]. CONTEMPORARY ECONOMICS, 2016, 10 (01) : 55 - 70
  • [7] Real-Time Visual Saliency Detection Using Gaussian Distribution
    Wang, Haoqian
    Zhang, Chunlong
    Wang, Xingzheng
    [J]. 2014 INTERNATIONAL CONFERENCE ON MEDICAL BIOMETRICS (ICMB 2014), 2014, : 200 - 203
  • [8] Real-time Large-scale Deformation of Gaussian Splatting
    Gao, Lin
    Yang, Jie
    Zhang, Bo-Tao
    Sun, Jia-Mu
    Yuan, Yu-Jie
    Fu, Hongbo
    Lai, Yu-Kun
    [J]. ACM Transactions on Graphics, 2024, 43 (06):
  • [9] Real-time visualization of large volume datasets on standard PC hardware
    Xie, Kai
    Yang, Jie
    Zhu, Y. M.
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2008, 90 (02) : 117 - 123
  • [10] Real-time outlier detection for large datasets by RT-DetMCD
    De Ketelaere, Bart
    Hubert, Mia
    Raymaekers, Jakob
    Rousseeuw, Peter J.
    Vranckx, Iwein
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 199