Enabling High Data Throughput in Desktop Grids through Decentralized Data and Metadata Management: The BlobSeer Approach

被引:0
|
作者
Nicolae, Bogdan [1 ]
Antoniu, Gabriel [2 ]
Bouge, Luc [3 ]
机构
[1] Univ Rennes 1, IRISA, Rennes, France
[2] IRISA, Ctr Rennes Bretagne Atlant, INRIA, Rennes, France
[3] IRISA, Ecole Normale Super Cachan Brittany, Cachan, France
关键词
STORAGE-SYSTEM;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Whereas traditional Desktop Grids rely on centralized servers for data management, some recent progress has been made to enable distributed, large in-hut data, using to peer-to-peer (P2P) protocols and Content Distribution Networks (CDN). We make a step further and propose a generic, yet efficient data storage which enables the use of Desktop Grids for applications with high output data requirements, where the access grain and the access patterns may be random. Our solution builds on a blob management service enabling a large number of concurrent clients to efficiently read/write and append huge data that are fragmented and distributed at a large scale. Scalability under heavy concurrency is achieved thanks to an original metadata scheme using a distributed segment tree built on top of a Distributed Hash Table (DHT). The proposed approach has been implemented and its benefits have successfully been demonstrated within our BlobSeer prototype on the Grid'5000 testbed.
引用
收藏
页码:404 / +
页数:3
相关论文
共 50 条
  • [1] Decentralized data management framework for Data Grids
    Lamehamedi, Houda
    Szymanski, Boleslaw K.
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2007, 23 (01): : 109 - 115
  • [2] myVCF: a desktop application for high-throughput mutations data management
    Pietrelli, Alessandro
    Valenti, Luca
    [J]. BIOINFORMATICS, 2017, 33 (22) : 3676 - 3678
  • [3] Bridging the data management gap between service and desktop grids
    Kelley, Ian
    Taylor, Ian
    [J]. DISTRIBUTED AND PARALLEL SYSTEMS: IN FOCUS: DESKTOP GRID COMPUTING, 2008, : 13 - +
  • [4] The management of digital data: a metadata approach
    Chilvers, A
    Feather, J
    [J]. ELECTRONIC LIBRARY, 1998, 16 (06): : 365 - 372
  • [5] Enabling and managing greater access to transport data through metadata
    Wigan, M
    Grieco, M
    Hine, J
    [J]. TRANSPORTATION DATA AND INFORMATION TECHNOLOGY RESEARCH: PLANNING AND ADMINISTRATION, 2002, (1804): : 48 - 55
  • [6] Decentralized geospatial metadata management Delegating properties in the web of data
    Fugazza, Cristiano
    Tagliolato Acquaviva d'Aragona, Paolo
    Oggioni, Alessandro
    Carrara, Paola
    [J]. EARTH SCIENCE INFORMATICS, 2021, 14 (03) : 1579 - 1596
  • [7] Enabling Social Applications via Decentralized Social Data Management
    Kourtellis, Nicolas
    Blackburn, Jeremy
    Borcea, Cristian
    Iamnitchi, Adriana
    [J]. ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2015, 15 (01) : 3 - 28
  • [8] Data and metadata management automation for an effective approach to sharing environmental data
    D'Amore, F.
    Cinnirella, S.
    Pirrone, N.
    [J]. PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON HEAVY METALS IN THE ENVIRONMENT, 2013, 1
  • [9] Enabling high-throughput data management for systems biology: The bioinformatics resource manager
    Shah, Anuj R.
    Singhal, Mudita
    Klicker, Kyle R.
    Stephan, Eric G.
    Wiley, H. Steven
    Waters, Katrina M.
    [J]. BIOINFORMATICS, 2007, 23 (07) : 906 - 909
  • [10] Enabling data-centric Al through data quality management and data literacy
    Abedjan, Ziawasch
    [J]. IT-INFORMATION TECHNOLOGY, 2022, 64 (1-2): : 67 - 70