Probabilistic summarization via importance-driven sampling for large-scale patch-based scientific data visualization

被引:1
|
作者
Yang Y. [1 ]
Wu Y. [1 ]
Cao Y. [1 ]
机构
[1] Institute of Applied Physics and Computational Mathematics, Beijing
来源
基金
中国博士后科学基金;
关键词
Information entropy; Large-scale patch-based data; Parallel numerical simulation; Probabilistic summarization; Scientific visualization;
D O I
10.1016/j.cag.2022.06.004
中图分类号
学科分类号
摘要
Probabilistic summarization is the process of creating compact statistical representations of the original data. It is used for data reduction, and to facilitate efficient post-hoc visualization for large-scale patch-based data generated in parallel numerical simulation. To ensure high reconstruction accuracy, existing methods typically merge and repartition data patches stored across multiple processor cores, which introduces time-consuming processing. Therefore, this paper proposes a novel probabilistic summarization method for large-scale patch-based scientific data. It considers neighborhood statistical properties by importance-driven sampling guided by the information entropy, thus eliminating the requirement of patch merging and repartitioning. In addition, the reconstruction value of a given spatial location is estimated by coupling the statistical representations of each data patch and the sampling results, thereby maintaining high reconstruction accuracy. We demonstrate the effectiveness of our method using five datasets, with a maximum grid size of one billion. The experimental results show that the method presented in this paper reduced the amount of data by about one order of magnitude. Compared with the current state-of-the-art methods, our method had higher reconstruction accuracy and lower computational cost. © 2022 Elsevier Ltd
引用
收藏
页码:119 / 129
页数:10
相关论文
共 50 条
  • [31] Accurate Multiple View 3D Reconstruction Using Patch-Based Stereo for Large-Scale Scenes
    Shen, Shuhan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (05) : 1901 - 1914
  • [32] SOM-based Visualization for Classifying Large-scale Sensing Data of Moonquakes
    Goto, Yasumichi
    Yamada, Ryuhei
    Yamamoto, Yukio
    Yokoyama, Shohei
    Ishikawa, Hiroshi
    2013 EIGHTH INTERNATIONAL CONFERENCE ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING (3PGCIC 2013), 2013, : 630 - 634
  • [33] Presentation and Expression of Large-Scale Public Building Structures Based on Data Visualization
    Sun, Yong
    Jiang, Liwen
    Zhong, Jie
    INTERNATIONAL JOURNAL OF MULTIPHYSICS, 2024, 18 (03) : 1 - 12
  • [34] GPU-based adaptive data reconstruction for large-scale statistical visualization
    Wu, Yu
    Yang, Yang
    Cao, Yi
    JOURNAL OF VISUALIZATION, 2023, 26 (04) : 899 - 915
  • [35] Visualization of Large-Scale Power Plant Control Data Based on Condition Division
    Ji L.
    Chen Z.
    Huang K.
    Zhao N.
    Kong Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (02): : 229 - 240
  • [36] Tasuke: a web-based visualization program for large-scale resequencing data
    Kumagai, Masahiko
    Kim, Jungsok
    Itoh, Ryutaro
    Itoh, Takeshi
    BIOINFORMATICS, 2013, 29 (14) : 1806 - 1808
  • [37] GPU-based adaptive data reconstruction for large-scale statistical visualization
    Yu Wu
    Yang Yang
    Yi Cao
    Journal of Visualization, 2023, 26 : 899 - 915
  • [38] MODEL CHECKING IN LARGE-SCALE DATA SET VIA STRUCTURE-ADAPTIVE-SAMPLING
    Han, Yixin
    Ma, Ping
    Ren, Haojie
    Wang, Zhaojun
    STATISTICA SINICA, 2023, 33 (01) : 303 - 329
  • [39] Data-driven robust optimization for the itinerary planning via large-scale GPS data
    Wu, Lei
    Hifi, Mhand
    KNOWLEDGE-BASED SYSTEMS, 2021, 231
  • [40] Sub-Linear Time Sampling Approach for Large-Scale Data Visualization Using Reinforcement Learning
    Biswas, Ayan
    Bhattacharya, Arindam
    Chen, Yi-Tang
    Shen, Han-Wei
    2023 IEEE 13TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION, LDAV, 2023, : 12 - 16