Scientific data management in a Grid environment

被引:0
|
作者
James H.A. [1 ]
Hawick K.A. [1 ]
机构
[1] Institute of Information and Mathematical Sciences, Massey University, Auckland
关键词
Data Management; Data mining; Grid systems; Metadata; Parameter cross-products;
D O I
10.1007/s10723-005-5464-y
中图分类号
学科分类号
摘要
Managing scientific data is by no means a trivial task even in a single site environment with a small number of researchers involved. We discuss some issues concerned with posing well-specified experiments in terms of parameters or instrument settings and the metadata framework that arises from doing so. We are particularly interested in parallel computer simulation experiments, where very large quantities of warehouse-able data are involved, run in a multi-site Grid environment. We consider SQL databases and other framework technologies for manipulating experimental data. Our framework manages the outputs from parallel runs that arise from large cross-products of parameter combinations. Considerable useful experiment planning and analysis can be done with the sparse metadata without fully expanding the parameter cross-products. Extra value can be obtained from simulation output that can subsequently be data-mined. We have particular interests in running large scale Monte Carlo physics model simulations. Finding ourselves overwhelmed by the problems of managing data and compute resources, we have built a prototype tool using Java and MySQL that addresses these issues. We use this example to discuss type-space management and other fundamental ideas for implementing a laboratory information management system. © Springer 2005.
引用
收藏
页码:39 / 51
页数:12
相关论文
共 50 条
  • [41] Distributed Data Mining in the Grid Environment
    SelvaLakshmi, C. B.
    Murali, S.
    Chanthiya, P.
    Karthikayan, P. N.
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON INTERNET COMPUTING AND INFORMATION COMMUNICATIONS (ICICIC GLOBAL 2012), 2014, 216 : 151 - 157
  • [42] A real time data visualization and analysis environment, scientific data management of large weather radar archives
    Toussaint, M
    Malkomes, M
    Hagen, M
    Höller, H
    Meischner, P
    PHYSICS AND CHEMISTRY OF THE EARTH PART B-HYDROLOGY OCEANS AND ATMOSPHERE, 2000, 25 (10-12): : 1001 - 1003
  • [43] Towards secure data management system for grid environment based on the cell broadband engine
    Wyrzykowski, Roman
    Kuczynski, Lukasz
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2008, 4967 : 825 - 834
  • [44] Replica Management in Data Grid
    Al Mistarihi, Husni Hamad E.
    Yong, Chan Huah
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (06): : 22 - 32
  • [45] Data management for grid environments
    Stockinger, H
    Rana, OF
    Moore, R
    Merzky, A
    HIGH-PERFORMANCE COMPUTING AND NETWORKING, 2001, 2110 : 151 - 160
  • [46] A framework for data management in the grid
    Nguyen, Thi-Mai-Huong
    Magoules, Frederic
    DCABES 2007 PROCEEDINGS, VOLS I AND II, 2007, : 629 - 633
  • [47] A survey on data replication strategies in a Data Grid environment
    Naseera, Shaik
    MULTIAGENT AND GRID SYSTEMS, 2016, 12 (04) : 253 - 269
  • [48] Tasks Scheduling and Resource Allocation for High Data Management in Scientific Cloud Computing Environment
    Djebbar, Esma Insaf
    Belalem, Ghalem
    MOBILE, SECURE, AND PROGRAMMABLE NETWORKING (MSPN 2016), 2016, 10026 : 16 - 27
  • [49] Rucio: Scientific Data Management
    Barisits M.
    Beermann T.
    Berghaus F.
    Bockelman B.
    Bogado J.
    Cameron D.
    Christidis D.
    Ciangottini D.
    Dimitrov G.
    Elsing M.
    Garonne V.
    di Girolamo A.
    Goossens L.
    Guan W.
    Guenther J.
    Javurek T.
    Kuhn D.
    Lassnig M.
    Lopez F.
    Magini N.
    Molfetas A.
    Nairz A.
    Ould-Saada F.
    Prenner S.
    Serfon C.
    Stewart G.
    Vaandering E.
    Vasileva P.
    Vigne R.
    Wegner T.
    Computing and Software for Big Science, 2019, 3 (1)
  • [50] Active management of scientific data
    Plale, B
    Gannon, D
    Alameda, J
    Wilhelmson, B
    Hampton, S
    Rossi, A
    Droegemeier, K
    IEEE INTERNET COMPUTING, 2005, 9 (01) : 27 - 34