Federated Big Data for resource aggregation and load balancing with DIRAC

被引:2
|
作者
Fernandez, Victor [1 ]
Mendez, Victor [2 ]
Pena, Tomas F. [3 ]
机构
[1] Univ Santiago de Compostela, Dept Particle Phys, Santiago De Compostela, Spain
[2] Univ Autonoma Barcelona, CAOS, E-08193 Barcelona, Spain
[3] Univ Santiago de Compostela, Res Ctr Informat Technol CiTIUS, Santiago De Compostela, Spain
关键词
Big Data federation; DIRAC; MapReduce; Hadoop; Cloud Computing;
D O I
10.1016/j.procs.2015.05.430
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
BigDataDIRAC is a Big Data solution with a Distributed Infrastructure with Remote Agent Control (DIRAC) access point. Users have the opportunity to access multiple Big Data resources scattered in different geographical areas, such as access to grid resources. This approach opens the possibility of offering not only grid and cloud to the users, but also Big Data resources from the same DIRAC environment. In this work, we describe a system to allow access to a federation of Big Data resources, including load balancing, using DIRAC. Our results demonstrate the ability of BigDataDIRAC to manage jobs driven by dataset location stored in the Hadoop File System (HDFS) of the Hadoop distributed clusters. DIRAC is used to monitor the execution, collect the necessary statistical data, and upload the results from the remote HDFS to the SandBox Storage machine. Performance results demonstrate that BigDataDIRAC load balancing is able to aggregate resources in an efficient manner.
引用
收藏
页码:2769 / 2773
页数:5
相关论文
共 50 条
  • [1] Exploration on load balancing data aggregation algorithm in wireless sensor network based on big data artificial intelligence
    Jiang, Huilong
    [J]. INTERNET TECHNOLOGY LETTERS, 2024, 7 (02)
  • [2] I/O Load Balancing for Big Data HPC Applications
    Paul, Arnab K.
    Goyal, Arpit
    Wang, Feiyi
    Oral, Sarp
    Butt, Ali R.
    Brim, Michael J.
    Srinivasa, Sangeetha B.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 233 - 242
  • [3] Research on dynamic load balancing of data flow under big data platform
    Sun, Junlin
    Zhang, Yi
    [J]. INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2021, 12 (02)
  • [4] Resource augmentation in load balancing
    Azar, Y
    Epstein, L
    van Stee, R
    [J]. ALGORITHM THEORY - SWAT 2000, 2000, 1851 : 189 - 199
  • [5] A generic API for load balancing in distributed systems for big data management
    Antoine, Maeva
    Pellegrino, Laurent
    Huet, Fabrice
    Baude, Francoise
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (08): : 2440 - 2456
  • [6] A Study on Load Balancing Techniques for Task Allocation in Big Data Processing
    Jin Xiaohong
    Li Hui
    Liu Yanjun
    Fan Yanfang
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL FORUM ON MECHANICAL, CONTROL AND AUTOMATION (IFMCA 2016), 2017, 113 : 212 - 218
  • [7] Load Balancing for Privacy-Preserving Access to Big Data in Cloud
    Li, Peng
    Guo, Song
    [J]. 2014 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2014, : 524 - 528
  • [8] Improving load balancing for data-duplication in big data cloud computing networks
    Javadpour, Amir
    Abadi, Ali Majed Hossein
    Rezaei, Samira
    Zomorodian, Mozhdeh
    Rostami, Ali Shokouhi
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (04): : 2613 - 2631
  • [9] Improving load balancing for data-duplication in big data cloud computing networks
    Amir Javadpour
    Ali Majed Hossein Abadi
    Samira Rezaei
    Mozhdeh Zomorodian
    Ali Shokouhi Rostami
    [J]. Cluster Computing, 2022, 25 : 2613 - 2631
  • [10] Dynamic Optical Data Center Network Load Balancing and Resource Allocation
    Huang, Henna
    Chan, Vincent W. S.
    [J]. 2016 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2016,