DeStager: feature guided in-situ data management in distributed deep memory hierarchies

被引:2
|
作者
Zhang, Xuechen [1 ]
Zheng, Fang [2 ]
Bao Nguyen [1 ]
机构
[1] Washington State Univ, Sch Engn & Comp Sci, Vancouver, WA 98686 USA
[2] IBM TJ Watson Res Ctr, New York, NY USA
关键词
Indexing; R-tree; Octree; In-situ Analytics; SSDs; SIMULATION; COMBUSTION;
D O I
10.1007/s10619-018-7235-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In-situ analytics have been increasingly adopted by leadership scientific applications to gain fast insights into massive output data of simulations. With the current practice, systems buffer the output data in DRAM for analytics processing, constraining it to DRAM capacity un-used by the simulation. The rapid growth of data size requires alternative approaches to accommodating data-rich analytics, such as using solid-state disks to increase effective memory capacity. For this purpose, this paper explores software solutions for exploring the deep memory hierarchies expected on future high-end machines. Leveraging the fact that many analytics are sensitive to data features (regions-of-interest) hidden in the data being processed, the approach incorporates the knowledge of the data features into in-situ data management. It uses adaptive index creation/refinement to reduce the overhead of index management. In addition, it uses data features to predict data skew and improve load balance through controlling data distribution and placement on distributed staging servers. The experimental results show that such feature-guided optimizations achieve substantial improvements over state-of-the-art approaches for managing output data in-situ.
引用
收藏
页码:209 / 231
页数:23
相关论文
共 50 条
  • [31] Noise attenuation in distributed acoustic sensing data using a guided unsupervised deep learning network
    Saad, Omar M.
    Ravasi, Matteo
    Alkhalifah, Tariq
    GEOPHYSICS, 2024, 89 (06) : V573 - V587
  • [32] Leveraging on Deep Memory Hierarchies to Minimize Energy Consumption and Data Access Latency on Single-Chip Cloud Computers
    Maqsood, Tahir
    Tziritas, Nikos
    Loukopoulos, Thanasis
    Madani, Sajjad A.
    Khan, Samee U.
    Xu, Cheng-Zhong
    IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2017, 2 (02): : 154 - 166
  • [33] Scalable Data Management on Hybrid Memory System for Deep Neural Network Applications
    Rang, Wei
    Yang, Donglin
    Li, Zhimin
    Cheng, Dazhao
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1470 - 1480
  • [34] Distributed Range-Based Meta-Data Management for an In-Memory Storage
    Klein, Florian
    Beineke, Kevin
    Schoettner, Michael
    EURO-PAR 2015: PARALLEL PROCESSING WORKSHOPS, 2015, 9523 : 3 - 15
  • [35] Deep learning-based retrieval of cyanobacteria pigment in inland water for in-situ and airborne hyperspectral data
    Yim, Inhyeok
    Shin, Jihoon
    Lee, Hyuk
    Park, Sanghyun
    Nam, Gibeom
    Kang, Taegu
    Cho, Kyung Hwa
    Cha, YoonKyung
    ECOLOGICAL INDICATORS, 2020, 110
  • [36] Prediction of deep soil water content (0-5 m) with in-situ and remote sensing data
    Zhu, Zhaocen
    Zhao, Chunlei
    Jia, Xiaoxu
    Wang, Jiao
    Shao, Mingan
    CATENA, 2023, 222
  • [37] DEEP LEARNING-BASED DATA FUSION METHOD FOR IN-SITU POROSITY DETECTION IN LASERBASED ADDITIVE MANUFACTURING
    Tian, Qi
    Guo, Shenghan
    Guo, Weihong
    Bian, Linkan
    PROCEEDINGS OF THE ASME 2020 15TH INTERNATIONAL MANUFACTURING SCIENCE AND ENGINEERING CONFERENCE (MSEC2020), VOL 2B, 2020,
  • [38] Remote sensing, mathematical modeling and in-situ data for improving coastal management supporting tools Preface
    Neves, Ramiro
    Mateus, Marcos D.
    JOURNAL OF MARINE SYSTEMS, 2012, 94 : S1 - S1
  • [39] Remote Sensing and GIS in Natural Resource Management: Comparing Tools and Emphasizing the Importance of In-Situ Data
    Sharma, Sanjeev
    Beslity, Justin O.
    Rustad, Lindsey
    Shelby, Lacy J.
    Manos, Peter T.
    Khanal, Puskar
    Reinmann, Andrew B.
    Khanal, Churamani
    REMOTE SENSING, 2024, 16 (22)
  • [40] Flood forecast in complex orography coupling distributed hydro-meteorological models and in-situ and remote sensing data
    Verdecchia, M.
    Coppola, E.
    Faccani, C.
    Ferretti, R.
    Memmo, A.
    Montopoli, M.
    Rivolta, G.
    Paolucci, T.
    Picciotti, E.
    Santacasa, A.
    Tomassetti, B.
    Visconti, G.
    Marzano, F. S.
    METEOROLOGY AND ATMOSPHERIC PHYSICS, 2008, 101 (3-4) : 267 - 285