EMPRESS: Accelerating Scientific Discovery through Descriptive Metadata Management

被引:2
|
作者
Lawson, Margaret [1 ,2 ]
Gropp, William [1 ]
Lofstead, Jay [3 ]
机构
[1] Univ Illinois, 1205 W Clark St, Urbana, IL 61801 USA
[2] 747 6th St S, Kirkland, WA 98033 USA
[3] Sandia Natl Labs, POB 5800 MS 1319, Albuquerque, NM 87185 USA
基金
美国能源部;
关键词
Descriptive metadata; data tagging; high-level indexing; EMPRESS; HDF5; accelerating scientific discovery; Decaf; ATDM; IN-SITU VISUALIZATION; TEMPESTEXTREMES; OPTIMIZATION; EXTRACTION; FRAMEWORK; EFFICIENT; SELECTION; TRACKING; MAP;
D O I
10.1145/3523698
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
High-performance computing scientists are producing unprecedented volumes of data that take a long time to load for analysis. However, many analyses only require loading in the data containing particular features of interest and scientists have many approaches for identifying these features. Therefore, if scientists store information (descriptive metadata) about these identified features, then for subsequent analyses they can use this information to only read in the data containing these features. This can greatly reduce the amount of data that scientists have to read in, thereby accelerating analysis. Despite the potential benefits of descriptive metadata management, no prior work has created a descriptive metadata system that can help scientists working with a wide range of applications and analyses to restrict their reads to data containing features of interest. In this article, we present EMPRESS, the first such solution. EMPRESS offers all of the features needed to help accelerate discovery: It can accelerate analysis by up to 300x, supports a wide range of applications and analyses, is high-performing, is highly scalable, and requires minimal storage space. In addition, EMPRESS offers features required for a production-oriented system: scalable metadata consistency techniques, flexible system configurations, fault tolerance as a service, and portability.
引用
收藏
页数:49
相关论文
共 50 条
  • [1] Accelerating scientific discovery through computation and visualization
    Sims, JS
    Hagedorn, JG
    Ketcham, PM
    Satterfield, SG
    Griffin, TJ
    George, WL
    Fowler, HA
    am Ende, BA
    Hung, HK
    Bohn, RB
    Koontz, JE
    Martys, NS
    Bouldin, CE
    Warren, JA
    Feder, DL
    Clark, CW
    Filla, BJ
    Devaney, JE
    JOURNAL OF RESEARCH OF THE NATIONAL INSTITUTE OF STANDARDS AND TECHNOLOGY, 2000, 105 (06): : 875 - 894
  • [2] Accelerating scientific discovery through crowdsourced computing
    Hindo, Juan
    Pyzer-Knapp, Edward
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 252
  • [3] Accelerating scientific discovery through computation and visualization II
    Sims, JS
    George, WL
    Satterfield, SG
    Hung, HK
    Hagedorn, JG
    Ketcham, PM
    Griffin, TJ
    Hagstrom, SA
    Franiatte, JC
    Bryant, GW
    Jaskólski, W
    Martys, NS
    Bouldin, CE
    Simmons, V
    Nicolas, OP
    Warren, JA
    Ende, BAA
    Koontz, JE
    Filla, BJ
    Pourprix, VG
    Copley, SR
    Bohn, RB
    Peskin, AP
    Parker, YM
    Devaney, JE
    JOURNAL OF RESEARCH OF THE NATIONAL INSTITUTE OF STANDARDS AND TECHNOLOGY, 2002, 107 (03): : 223 - 245
  • [4] Accelerating discovery through better supply management
    Kulkarni, Jayant
    American Laboratory, 2019, 51
  • [5] Accelerating Discovery Through Better Supply Management
    Kulkarni, Jayant
    AMERICAN LABORATORY, 2019, 51 (01) : 18 - 19
  • [6] Accelerating Data-Driven Discovery With Scientific Asset Management
    Schuler, Robert E.
    Kesselman, Carl
    Czajkowski, Karl
    PROCEEDINGS OF THE 2016 IEEE 12TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), 2016, : 31 - 40
  • [7] XSEDE: Accelerating Scientific Discovery
    Towns, John
    Cockerill, Tim
    Dahan, Maytal
    Foster, Ian
    Gaither, Kelly
    Grimshaw, Andrew
    Hazlewood, Victor
    Lathrop, Scott
    Lifka, Dave
    Peterson, Gregory D.
    Roskies, Ralph
    Scott, J. Ray
    Wilkins-Diehr, Nancy
    COMPUTING IN SCIENCE & ENGINEERING, 2014, 16 (05) : 62 - 74
  • [8] Using a Robust Metadata Management System to Accelerate Scientific Discovery at Extreme Scales
    Lawson, Margaret
    Lofstead, Jay
    PROCEEDINGS OF 2018 IEEE/ACM 3RD JOINT INTERNATIONAL WORKSHOP ON PARALLEL DATA STORAGE & DATA INTENSIVE SCALABLE COMPUTING SYSTEMS (PDSW-DISCS), 2018, : 13 - 23
  • [9] Metadata management for scientific databases
    Pinoli, Pietro
    Ceri, Stefano
    Martinenghi, Davide
    Nanni, Luca
    INFORMATION SYSTEMS, 2019, 81 : 1 - 20
  • [10] EMPRESS-Extensible Metadata PRovider for Extreme-scale Scientific Simulations
    Lawson, Margaret
    Ulmer, Craig
    Mukherjee, Shyamali
    Templet, Gary
    Lofstead, Jay
    Levy, Scott
    Widener, Patrick
    Kordenbrock, Todd
    PROCEEDINGS OF PDSW-DISCS 2017: 2ND JOINT INTERNATIONAL WORKSHOP ON PARALLEL DATA STORAGE & DATA INTENSIVE SCALABLE COMPUTING SYSTEMS, 2017, : 19 - 24