Scientific data management in a Grid environment

被引:0
|
作者
James H.A. [1 ]
Hawick K.A. [1 ]
机构
[1] Institute of Information and Mathematical Sciences, Massey University, Auckland
关键词
Data Management; Data mining; Grid systems; Metadata; Parameter cross-products;
D O I
10.1007/s10723-005-5464-y
中图分类号
学科分类号
摘要
Managing scientific data is by no means a trivial task even in a single site environment with a small number of researchers involved. We discuss some issues concerned with posing well-specified experiments in terms of parameters or instrument settings and the metadata framework that arises from doing so. We are particularly interested in parallel computer simulation experiments, where very large quantities of warehouse-able data are involved, run in a multi-site Grid environment. We consider SQL databases and other framework technologies for manipulating experimental data. Our framework manages the outputs from parallel runs that arise from large cross-products of parameter combinations. Considerable useful experiment planning and analysis can be done with the sparse metadata without fully expanding the parameter cross-products. Extra value can be obtained from simulation output that can subsequently be data-mined. We have particular interests in running large scale Monte Carlo physics model simulations. Finding ourselves overwhelmed by the problems of managing data and compute resources, we have built a prototype tool using Java and MySQL that addresses these issues. We use this example to discuss type-space management and other fundamental ideas for implementing a laboratory information management system. © Springer 2005.
引用
收藏
页码:39 / 51
页数:12
相关论文
共 50 条
  • [31] DATA INTENSIVE SCIENTIFIC ANALYSIS WITH GRID COMPUTING
    Terzo, Olivier
    Mossucca, Lorenzo
    Cucca, Manuela
    Notarpietro, Riccardo
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2011, 21 (02) : 219 - 228
  • [32] A Replication Strategy in Data Grid Environment
    Li, Jing
    2009 IITA INTERNATIONAL CONFERENCE ON SERVICES SCIENCE, MANAGEMENT AND ENGINEERING, PROCEEDINGS, 2009, : 303 - 306
  • [33] Data translation in semantic grid environment
    Zhang, Lin
    Gu, Jinguang
    PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 439 - 443
  • [34] Implementing data security in Grid environment
    Cunsolo, V. D.
    Distefano, S.
    Puliafito, A.
    Scarpa, M.
    2009 18TH IEEE INTERNATIONAL WORKSHOP ON ENABLING TECHNOLOGIES: INFRASTRUCTURES FOR COLLABORATIVE ENTERPRISES, 2009, : 177 - 182
  • [35] Diagonal data replication in grid environment
    Latip, Rohaya
    Ibrahim, Hamidah
    Othman, Mohamed
    Sulaiman, Md Nasir
    Abdullah, Azizol
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2007, PT 3, PROCEEDINGS, 2007, 4707 : 763 - 773
  • [36] A distributed data server in grid environment
    Chen, B
    Xiao, N
    Liu, B
    GRID AND COOPERATIVE COMPUTING GCC 2004 WORKSHOPS, PROCEEDINGS, 2004, 3252 : 775 - 782
  • [37] Optimization of data access for Grid environment
    Dutka, L
    Slota, R
    Nikolow, D
    Kitowski, J
    GRID COMPUTING, 2004, 2970 : 93 - 102
  • [38] Diagonal data replication in grid environment
    Faculty of Computer Science and Information Technology, Universiti Putra Malaysia
    Lect. Notes Comput. Sci., PART 3 (763-773):
  • [39] Research on resources environment data grid
    Zhu, Y
    Li, S
    Zhang, N
    IGARSS 2005: IEEE International Geoscience and Remote Sensing Symposium, Vols 1-8, Proceedings, 2005, : 912 - 915
  • [40] Query processing in a data grid environment
    Eddine, Ayouni Houssam
    Hafida, Belbachir
    2012 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2012, : 646 - 650