Probabilistic summarization via importance-driven sampling for large-scale patch-based scientific data visualization

被引:1
|
作者
Yang Y. [1 ]
Wu Y. [1 ]
Cao Y. [1 ]
机构
[1] Institute of Applied Physics and Computational Mathematics, Beijing
来源
基金
中国博士后科学基金;
关键词
Information entropy; Large-scale patch-based data; Parallel numerical simulation; Probabilistic summarization; Scientific visualization;
D O I
10.1016/j.cag.2022.06.004
中图分类号
学科分类号
摘要
Probabilistic summarization is the process of creating compact statistical representations of the original data. It is used for data reduction, and to facilitate efficient post-hoc visualization for large-scale patch-based data generated in parallel numerical simulation. To ensure high reconstruction accuracy, existing methods typically merge and repartition data patches stored across multiple processor cores, which introduces time-consuming processing. Therefore, this paper proposes a novel probabilistic summarization method for large-scale patch-based scientific data. It considers neighborhood statistical properties by importance-driven sampling guided by the information entropy, thus eliminating the requirement of patch merging and repartitioning. In addition, the reconstruction value of a given spatial location is estimated by coupling the statistical representations of each data patch and the sampling results, thereby maintaining high reconstruction accuracy. We demonstrate the effectiveness of our method using five datasets, with a maximum grid size of one billion. The experimental results show that the method presented in this paper reduced the amount of data by about one order of magnitude. Compared with the current state-of-the-art methods, our method had higher reconstruction accuracy and lower computational cost. © 2022 Elsevier Ltd
引用
收藏
页码:119 / 129
页数:10
相关论文
共 50 条
  • [41] A Sampling-Based Density Peaks Clustering Algorithm for Large-Scale Data
    Ding, Shifei
    Li, Chao
    Xu, Xiao
    Ding, Ling
    Zhang, Jian
    Guo, Lili
    Shi, Tianhao
    PATTERN RECOGNITION, 2023, 136
  • [42] Information-Importance Based Communication for Large-Scale WSN Data Processing
    Sardouk, Ahmad
    Rahim-Amoud, Rana
    Merghem-Boulahia, Leila
    Gaiti, Dominique
    WIRELESS AND MOBILE NETWORKING, PROCEEDINGS, 2009, 308 : 297 - 308
  • [43] Building large-scale density model via a deep-learning-based data-driven method
    Gao, Zhaoqi
    Li, Chuang
    Zhang, Bing
    Jiang, Xiudi
    Pan, Zhibin
    Gao, Jinghuai
    Xu, Zongben
    GEOPHYSICS, 2021, 86 (01) : M1 - M15
  • [44] SalienTime: User-driven Selection of Salient Time Steps for Large-Scale Geospatial Data Visualization
    Chen, Juntong
    Huang, Haiwen
    Ye, Huayuan
    Peng, Zhong
    Li, Chenhui
    Wang, Changbo
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS (CHI 2024), 2024,
  • [45] SLIDE - a web-based tool for interactive visualization of large-scale - omics data
    Ghosh, Soumita
    Datta, Abhik
    Tan, Kaisen
    Choi, Hyungwon
    BIOINFORMATICS, 2019, 35 (02) : 346 - 348
  • [46] Data-Driven Sensor Selection using Gumbel-max Sampling for Large-Scale IoT
    Chen, Yuxuan
    Chen, Yuan
    Li, Guobing
    2023 IEEE 97TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-SPRING, 2023,
  • [47] Deep ensemble approach for pathogen classification in large-scale images using patch-based training and hyper-parameter optimization
    Ahmad, Fareed
    Khan, Muhammad Usman Ghani
    Tahir, Ahsen
    Masud, Farhan
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [48] Deep ensemble approach for pathogen classification in large-scale images using patch-based training and hyper-parameter optimization
    Fareed Ahmad
    Muhammad Usman Ghani Khan
    Ahsen Tahir
    Farhan Masud
    BMC Bioinformatics, 24
  • [49] JS']JSweep: A Patch-centric Data-driven Approach for Parallel Sweeps on Large-scale Meshes
    Yan, Jie
    Yang, Zhang
    Zhang, Aiqing
    Mo, Zeyao
    PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 776 - 785
  • [50] The Effect of Pets on Happiness: A Data-Driven Approach via Large-Scale Social Media
    Wu, Yuchen
    Yuan, Jianbo
    You, Quanzeng
    Luo, Jiebo
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 1889 - 1894