DeStager: feature guided in-situ data management in distributed deep memory hierarchies

被引:2
|
作者
Zhang, Xuechen [1 ]
Zheng, Fang [2 ]
Bao Nguyen [1 ]
机构
[1] Washington State Univ, Sch Engn & Comp Sci, Vancouver, WA 98686 USA
[2] IBM TJ Watson Res Ctr, New York, NY USA
关键词
Indexing; R-tree; Octree; In-situ Analytics; SSDs; SIMULATION; COMBUSTION;
D O I
10.1007/s10619-018-7235-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In-situ analytics have been increasingly adopted by leadership scientific applications to gain fast insights into massive output data of simulations. With the current practice, systems buffer the output data in DRAM for analytics processing, constraining it to DRAM capacity un-used by the simulation. The rapid growth of data size requires alternative approaches to accommodating data-rich analytics, such as using solid-state disks to increase effective memory capacity. For this purpose, this paper explores software solutions for exploring the deep memory hierarchies expected on future high-end machines. Leveraging the fact that many analytics are sensitive to data features (regions-of-interest) hidden in the data being processed, the approach incorporates the knowledge of the data features into in-situ data management. It uses adaptive index creation/refinement to reduce the overhead of index management. In addition, it uses data features to predict data skew and improve load balance through controlling data distribution and placement on distributed staging servers. The experimental results show that such feature-guided optimizations achieve substantial improvements over state-of-the-art approaches for managing output data in-situ.
引用
收藏
页码:209 / 231
页数:23
相关论文
共 50 条
  • [41] Flood forecast in complex orography coupling distributed hydro-meteorological models and in-situ and remote sensing data
    M. Verdecchia
    E. Coppola
    C. Faccani
    R. Ferretti
    A. Memmo
    M. Montopoli
    G. Rivolta
    T. Paolucci
    E. Picciotti
    A. Santacasa
    B. Tomassetti
    G. Visconti
    F. S. Marzano
    Meteorology and Atmospheric Physics, 2008, 101 : 267 - 285
  • [42] Benchmark Non-volatile and Volatile Memory Based Hybrid Precision Synapses for In-situ Deep Neural Network Training
    Luo, Yandong
    Yu, Shimeng
    2020 25TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2020, 2020, : 422 - 427
  • [43] Accelerating Deep Neural Network In-Situ Training With Non-Volatile and Volatile Memory Based Hybrid Precision Synapses
    Luo, Yandong
    Yu, Shimeng
    IEEE TRANSACTIONS ON COMPUTERS, 2020, 69 (08) : 1113 - 1127
  • [44] Feature Analysis, Tracking, and Data Reduction: An Application to Multiphase Reactor Simulation MFiX-Exa for In-Situ Use Case
    Biswas, Ayan
    Ahrens, James P.
    Dutta, Soumya
    Musser, Jordan M.
    Almgren, Ann S.
    Turton, Terece L.
    COMPUTING IN SCIENCE & ENGINEERING, 2021, 23 (01) : 75 - 82
  • [45] A framework to nowcast soil moisture with NASA SMAP level 4 data using in-situ measurements and deep learning
    Dashtian, Hassan
    Young, Michael H.
    Young, Bissett E.
    McKinney, Tyson
    Rateb, Ashraf M.
    Niyogi, Dev
    Kumar, Sujay V.
    JOURNAL OF HYDROLOGY-REGIONAL STUDIES, 2024, 56
  • [46] A Distributed Memory Hierarchy and Data Management for Interactive Scene Navigation and Modification on Tiled Display Walls
    Duy-Quoc Lai
    Sajadi, Behzad
    Jiang, Shan
    Meenakshisundaram, Gopi
    Majumder, Aditi
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2015, 21 (06) : 714 - 729
  • [47] Soil property recovery from incomplete in-situ geotechnical test data using a hybrid deep generative framework
    Chen, Weihang
    Ding, Jianwen
    Wang, Tengfei
    Connolly, David P.
    Wan, Xing
    ENGINEERING GEOLOGY, 2023, 326
  • [48] DeepQC: A deep learning system for automatic quality control of in-situ soil moisture sensor time series data
    Bandaru, Lahari
    Irigireddy, Bharat C.
    Koutilya, P. V. N. R.
    Davis, Brian
    SMART AGRICULTURAL TECHNOLOGY, 2024, 8
  • [49] SparkRDF: In-Memory Distributed RDF Management Framework for Large-Scale Social Data
    Xu, Zhichao
    Chen, Wei
    Gai, Lei
    Wang, Tengjiao
    WEB-AGE INFORMATION MANAGEMENT (WAIM 2015), 2015, 9098 : 337 - 349
  • [50] Integration of Earth observation and in-situ spatial data for the development of a decision support tool for technological risk management
    Chrysoulakis, N
    REMOTE SENSING FOR ENVIRONMENTAL MONITORING, GIS APPLICATIONS, AND GEOLOGY II, 2003, 4886 : 448 - 458