Maximizing data locality in distributed systems

被引:6
|
作者
Chung, Fan
Graharn, Ronald
Bhagwan, Ranjita [1 ]
Savage, Stefan
机构
[1] IBM Corp, TJ Watson Res Ctr, Hawthorne, NY USA
[2] Univ Calif San Diego, Dept Comp Sci & Engn, San Diego, CA 92103 USA
关键词
bin packing; distributed systems; combinatorial algorithms; approximation algorithms;
D O I
10.1016/j.jcss.2006.07.001
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The effectiveness of a distributed system hinges on the manner in which tasks and data are assigned to the underlying system resources. Moreover, today's large-scale distributed systems must accommodate heterogeneity in both the offered load and in the makeup of the available storage and compute capacity. The ideal resource assignment must balance the utilization of the underlying system against the loss of locality incurred when individual tasks or data objects are fragmented among several servers. In this paper we describe this locality-maximizing placement problem and show that an optimal solution is NP-hard. We then describe a polynomial-time algorithm that generates a placement within an additive constant of two from optimal. (C) 2006 Elsevier Inc. All rights reserved.
引用
收藏
页码:1309 / 1316
页数:8
相关论文
共 50 条
  • [41] A data-locality-aware task scheduler for distributed social graph queries
    Jin, Jiahui
    Luo, Junzhou
    Du, Mingyang
    Dang, Yongcheng
    Li, Feng
    Zhang, Jinghui
    Song, Aibo
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 93 : 1010 - 1022
  • [42] Locality and Availability in Distributed Storage
    Rawat, Ankit Singh
    Papailiopoulos, Dimitris S.
    Dimakis, Alexandros G.
    Vishwanath, Sriram
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2016, 62 (08) : 4481 - 4493
  • [43] Adapting to Access Locality via Live Data Migration in Globally Distributed Datastores
    Charapko, Aleksey
    Ailijiang, Ailidani
    Demirbas, Murat
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 3321 - 3330
  • [44] Distributed File System to Leverage Data Locality for Large-File Processing
    da Silva, Erico Correia
    Sato, Liria Matsumoto
    Midorikawa, Edson Toshimi
    ELECTRONICS, 2024, 13 (01)
  • [45] Data Locality Aware Algorithm for Task Execution on Distributed, Cloud Based Environments
    Bica, Mihai
    Gorgan, Dorian
    COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS, CISIS-2017, 2018, 611 : 557 - 566
  • [46] AN EFFICIENT DATA LOCALITY SYSTEM FOR BIG DATA PROCESSING OVER DISTRIBUTED DATA CENTRE BY USING SCHEDULING TECHNIQUE
    Kaur, Navneet
    Gupta, S. C.
    Kumar, Deepak
    ADVANCES AND APPLICATIONS IN MATHEMATICAL SCIENCES, 2019, 18 (08): : 651 - 664
  • [47] Scalable Family of Codes with Locality and Availability for Information Repair in Distributed Storage Systems
    Farkas, Peter
    2019 4TH INTERNATIONAL CONFERENCE ON SMART AND SUSTAINABLE TECHNOLOGIES (SPLITECH), 2019, : 156 - 159
  • [48] Locality preserving scheme of text databases representative in distributed information retrieval systems
    Hassan, Mohammad
    Al-Lahham, Yaser A.
    Journal of Digital Information Management, 2011, 9 (05): : 193 - 198
  • [49] Locality Preserving Scheme of Text Databases Representative in Distributed Information Retrieval Systems
    Hassan, Mohammad
    Hasan, Yaser
    NETWORKED DIGITAL TECHNOLOGIES, PT 2, 2010, 88 : 162 - 171
  • [50] A Library for Portable and Composable Data Locality Optimizations for NUMA Systems
    Majo Z.
    Gross T.R.
    2017, Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States (03)