Maximizing data locality in distributed systems

被引:6
|
作者
Chung, Fan
Graharn, Ronald
Bhagwan, Ranjita [1 ]
Savage, Stefan
机构
[1] IBM Corp, TJ Watson Res Ctr, Hawthorne, NY USA
[2] Univ Calif San Diego, Dept Comp Sci & Engn, San Diego, CA 92103 USA
关键词
bin packing; distributed systems; combinatorial algorithms; approximation algorithms;
D O I
10.1016/j.jcss.2006.07.001
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The effectiveness of a distributed system hinges on the manner in which tasks and data are assigned to the underlying system resources. Moreover, today's large-scale distributed systems must accommodate heterogeneity in both the offered load and in the makeup of the available storage and compute capacity. The ideal resource assignment must balance the utilization of the underlying system against the loss of locality incurred when individual tasks or data objects are fragmented among several servers. In this paper we describe this locality-maximizing placement problem and show that an optimal solution is NP-hard. We then describe a polynomial-time algorithm that generates a placement within an additive constant of two from optimal. (C) 2006 Elsevier Inc. All rights reserved.
引用
收藏
页码:1309 / 1316
页数:8
相关论文
共 50 条
  • [21] COMPUTATION MIGRATION - ENHANCING LOCALITY FOR DISTRIBUTED-MEMORY PARALLEL SYSTEMS
    HSIEH, WC
    WANG, P
    WEIHL, WE
    SIGPLAN NOTICES, 1993, 28 (07): : 239 - 248
  • [22] Locality-Improved Repairable Fountain Codes for Distributed Storage Systems
    Baik, Jonghyun
    Suh, Youngkil
    Shin, Myeongcheol
    Kim, Sungsoo
    Kim, Joonsuk
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [23] Effortless Locality on Data Systems Using Relational Fabric
    Papon, Tarikul Islam
    Mun, Ju Hyoung
    Karatsenidis, Konstantinos
    Roozkhosh, Shahin
    Hoornaert, Denis
    Sanaullah, Ahmed
    Drepper, Ulrich
    Mancuso, Renato
    Athanassoulis, Manos
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 7410 - 7422
  • [24] Exploiting locality for data management in systems of limited bandwidth
    Maggs, BM
    auf der Heide, FM
    Vocking, B
    Westermann, M
    38TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 1997, : 284 - 293
  • [25] A distributed algorithm for maximizing utility of data collection in a crowd sensing system
    Chen, Qinghua
    Weng, Zhengqiu
    Han, Yang
    Zhu, Yanmin
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2016, 12 (09):
  • [26] Distributed Data Management and Distributed File Systems
    Girone, Maria
    21ST INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2015), PARTS 1-9, 2015, 664
  • [27] Optimization of distributed resource allocation strategies in wireless systems for maximizing QoS
    Riahi, Sara
    Riahi, Azzeddine
    2019 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS 2019), 2019,
  • [28] Maximizing reliability of energy constrained parallel applications on heterogeneous distributed systems
    Xiao, Xiongren
    Xie, Guoqi
    Xu, Cheng
    Fan, Chunnian
    Li, Renfa
    Li, Keqin
    JOURNAL OF COMPUTATIONAL SCIENCE, 2018, 26 : 344 - 353
  • [29] Task allocation algorithms for maximizing reliability of heterogeneous distributed computing systems
    Mahmood, A
    CONTROL AND CYBERNETICS, 2001, 30 (01): : 115 - 130
  • [30] Task allocation for maximizing reliability of distributed systems: A simulated annealing approach
    Attiya, Gamal
    Hamam, Yskandar
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2006, 66 (10) : 1259 - 1266