The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets

被引:485
|
作者
Chervenak, A [1 ]
Foster, I
Kesselman, C
Salisbury, C
Tuecke, S
机构
[1] Univ So Calif, Inst Informat Sci, Los Angeles, CA 90089 USA
[2] Argonne Natl Lab, Div Math & Comp Sci, Argonne, IL 60439 USA
[3] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
关键词
D O I
10.1006/jnca.2000.0110
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In an increasing number of scientific disciplines, large data collections are emerging as important community resources. In this paper, we introduce design principles for a data management architecture called the data grid. We describe two basic services that we believe are fundamental to the design of a data grid, namely, storage systems and metadata management. Next, we explain how these services can be used to develop higher-level services for replica management and replica selection. We conclude by describing our initial implementation of data grid functionality. (C) 2000 Academic Press.
引用
收藏
页码:187 / 200
页数:14
相关论文
共 50 条
  • [1] MANAGEMENT AND ANALYSIS OF LARGE SCIENTIFIC DATASETS
    SIROVICH, L
    EVERSON, R
    [J]. INTERNATIONAL JOURNAL OF SUPERCOMPUTER APPLICATIONS AND HIGH PERFORMANCE COMPUTING, 1992, 6 (01): : 50 - 68
  • [2] Scientific data management architecture for grid computing environments
    No, J
    Cuong, NT
    Park, SS
    [J]. GRID AND COOPERATIVE COMPUTING - GCC 2005, PROCEEDINGS, 2005, 3795 : 541 - 546
  • [3] Distributed large data-object management architecture
    Johnston, W
    Jin, GJ
    Lee, J
    Thompson, M
    Tierney, B
    [J]. STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES V, 1997, 3022 : 478 - 482
  • [4] Grid-based architecture for sharing distributed massive datasets
    Bashir, Mohammed Bakri
    Abd Latiff, Muhammad Shafie
    Yousif, Adil
    [J]. INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2015, 15 (2-3) : 248 - 264
  • [5] TOWARDS UNIVERSAL CLOUD SERVICE FOR DISTRIBUTED LARGE SCALE SCIENTIFIC DATA
    Xie, Jianjun
    Huang, Junling
    Qian, Fang
    Yu, Jianjun
    [J]. 2012 IEEE 2nd International Conference on Cloud Computing and Intelligent Systems (CCIS) Vols 1-3, 2012, : 323 - 327
  • [6] Developing metadata standards for scientific data reuse in NCSA's distributed grid architecture
    Futrelle, J
    [J]. IGARSS 2000: IEEE 2000 INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOL I - VI, PROCEEDINGS, 2000, : 1217 - 1219
  • [7] Towards autonomic Grid data management with virtualized distributed file systems
    Zhao, Ming
    Xu, Jing
    Figueiredo, Renato J.
    [J]. 3RD INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING, PROCEEDINGS, 2005, : 209 - 218
  • [8] Scalable Distributed Data Anonymization for Large Datasets
    di Vimercati, Sabrina De Capitani
    Facchinetti, Dario
    Foresti, Sara
    Livraga, Giovanni
    Oldani, Gianluca
    Paraboschi, Stefano
    Rossi, Matthew
    Samarati, Pierangela
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (03) : 818 - 831
  • [9] Towards distributed visualization and analysis of large flow data
    Hege, HC
    Weinkauf, T
    Prohaska, S
    Hutanu, A
    [J]. JSME INTERNATIONAL JOURNAL SERIES B-FLUIDS AND THERMAL ENGINEERING, 2005, 48 (02) : 241 - 246
  • [10] Towards a Distributed Intelligent ICT Architecture for the Smart Grid
    Ortega de Mues, Mariano
    Alvarez, Alejandro
    Espinoza, Angelina
    Garbajosa, Juan
    [J]. 2011 9TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2011,