EMPRESS: Accelerating Scientific Discovery through Descriptive Metadata Management

被引:2
|
作者
Lawson, Margaret [1 ,2 ]
Gropp, William [1 ]
Lofstead, Jay [3 ]
机构
[1] Univ Illinois, 1205 W Clark St, Urbana, IL 61801 USA
[2] 747 6th St S, Kirkland, WA 98033 USA
[3] Sandia Natl Labs, POB 5800 MS 1319, Albuquerque, NM 87185 USA
基金
美国能源部;
关键词
Descriptive metadata; data tagging; high-level indexing; EMPRESS; HDF5; accelerating scientific discovery; Decaf; ATDM; IN-SITU VISUALIZATION; TEMPESTEXTREMES; OPTIMIZATION; EXTRACTION; FRAMEWORK; EFFICIENT; SELECTION; TRACKING; MAP;
D O I
10.1145/3523698
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
High-performance computing scientists are producing unprecedented volumes of data that take a long time to load for analysis. However, many analyses only require loading in the data containing particular features of interest and scientists have many approaches for identifying these features. Therefore, if scientists store information (descriptive metadata) about these identified features, then for subsequent analyses they can use this information to only read in the data containing these features. This can greatly reduce the amount of data that scientists have to read in, thereby accelerating analysis. Despite the potential benefits of descriptive metadata management, no prior work has created a descriptive metadata system that can help scientists working with a wide range of applications and analyses to restrict their reads to data containing features of interest. In this article, we present EMPRESS, the first such solution. EMPRESS offers all of the features needed to help accelerate discovery: It can accelerate analysis by up to 300x, supports a wide range of applications and analyses, is high-performing, is highly scalable, and requires minimal storage space. In addition, EMPRESS offers features required for a production-oriented system: scalable metadata consistency techniques, flexible system configurations, fault tolerance as a service, and portability.
引用
收藏
页数:49
相关论文
共 50 条
  • [21] Accelerating material design with the generative toolkit for scientific discovery
    Matteo Manica
    Jannis Born
    Joris Cadow
    Dimitrios Christofidellis
    Ashish Dave
    Dean Clarke
    Yves Gaetan Nana Teukam
    Giorgio Giannone
    Samuel C. Hoffman
    Matthew Buchan
    Vijil Chenthamarakshan
    Timothy Donovan
    Hsiang Han Hsu
    Federico Zipoli
    Oliver Schilter
    Akihiro Kishimoto
    Lisa Hamada
    Inkit Padhi
    Karl Wehden
    Lauren McHugh
    Alexy Khrabrov
    Payel Das
    Seiji Takeda
    John R. Smith
    npj Computational Materials, 9
  • [22] Accelerating material design with the generative toolkit for scientific discovery
    Manica, Matteo
    Born, Jannis
    Cadow, Joris
    Christofidellis, Dimitrios
    Dave, Ashish
    Clarke, Dean
    Teukam, Yves Gaetan Nana
    Giannone, Giorgio
    Hoffman, Samuel C.
    Buchan, Matthew
    Chenthamarakshan, Vijil
    Donovan, Timothy
    Hsu, Hsiang Han
    Zipoli, Federico
    Schilter, Oliver
    Kishimoto, Akihiro
    Hamada, Lisa
    Padhi, Inkit
    Wehden, Karl
    McHugh, Lauren
    Khrabrov, Alexy
    Das, Payel
    Takeda, Seiji
    Smith, John R.
    NPJ COMPUTATIONAL MATERIALS, 2023, 9 (01)
  • [23] Accelerating Scientific Discovery With AI-Aided Automation
    Schneider, Tapio
    Altintas, Ilkay
    Atkins, Daniel
    COMPUTING IN SCIENCE & ENGINEERING, 2023, 25 (05) : 27 - 30
  • [24] Exploring Metadata Search Essentials for Scientific Data Management
    Zhang, Wei
    Byna, Suren
    Niu, Chenxu
    Chen, Yong
    2019 IEEE 26TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC), 2019, : 83 - 92
  • [25] Statistical integration through metadata management
    Colledge, MJ
    INTERNATIONAL STATISTICAL REVIEW, 1999, 67 (01) : 79 - 98
  • [26] Accelerating antibiotic discovery through artificial intelligence
    Marcelo C. R. Melo
    Jacqueline R. M. A. Maasch
    Cesar de la Fuente-Nunez
    Communications Biology, 4
  • [27] Accelerating Neoproterozoic research through scientific drilling
    Condon, D. J.
    Boggiani, P.
    Fike, D.
    Halverson, G. P.
    Kasemann, S.
    Knoll, A. H.
    Macdonald, F. A.
    Prave, A. R.
    Zhu, M.
    SCIENTIFIC DRILLING, 2015, 19 : 17 - 25
  • [28] Learning to Rank Complex Biomedical Hypotheses for Accelerating Scientific Discovery
    Ding, Juncheng
    Dahal, Shailesh
    Adhikari, Bijaya
    Jha, Kishlay
    2024 IEEE 12TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS, ICHI 2024, 2024, : 285 - 293
  • [29] Accelerating antibiotic discovery through artificial intelligence
    Melo, Marcelo C. R.
    Maasch, Jacqueline R. M. A.
    de la Fuente-Nunez, Cesar
    COMMUNICATIONS BIOLOGY, 2021, 4 (01)
  • [30] Scientific Discovery Framework Accelerating Advanced Polymeric Materials Design
    Wang, Ran
    Fu, Teng
    Yang, Ya-Jie
    Song, Xuan
    Wang, Xiu-Li
    Wang, Yu-Zhong
    RESEARCH, 2024, 7