A data-locality-aware task scheduler for distributed social graph queries

被引:2
|
作者
Jin, Jiahui [1 ]
Luo, Junzhou [1 ]
Du, Mingyang [1 ]
Dang, Yongcheng [1 ]
Li, Feng [1 ]
Zhang, Jinghui [1 ]
Song, Aibo [1 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing, Jiangsu, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Task scheduling; Data locality; Graph query; Social graph; Distributed system; MAPREDUCE; FRAMEWORK;
D O I
10.1016/j.future.2018.04.086
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
For large-scale online social networks such as Facebook and Twitter, network analysis often uses graph queries to extract network information. Because of the work and memory required, usually such queries are performed in a distributed manner. However, most existing distributed graph computation systems optimize for offline graph analysis rather than online graph queries. The problem with this approach is that graph query tasks then must transfer a large volume of data and interactively answer queries within a short time frame. To resolve this, we propose a novel data-locality-aware task scheduling algorithm that optimizes interactive distributed graph queries. The scheduling algorithm jointly considers data placement and graph topology to reduce data transfer costs. After implementing the scheduling algorithm in a real-world distributed graph computation system, we evaluate the task scheduler's effectiveness through simulations and real-life social graph queries. Results show that our scheduler reduces the querying time by one order of magnitude. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:1010 / 1022
页数:13
相关论文
共 50 条
  • [1] An Enhanced Data-Locality-Aware Task Scheduling Algorithm for Hadoop Applications
    Choi, Dongjoo
    Jeon, Myunghoon
    Kim, Namgi
    Lee, Byoung-Dai
    [J]. IEEE SYSTEMS JOURNAL, 2018, 12 (04): : 3346 - 3357
  • [2] A highly efficient data locality aware task scheduler for cloud-based systems
    Ru, Jia
    Yang, Yun
    Grundy, John
    Keung, Jacky
    Hao, Li
    [J]. 2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 496 - 498
  • [3] Data-locality-aware mapreduce real-time scheduling framework
    Kao, Yu-Chon
    Chen, Ya-Shu
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 112 : 65 - 77
  • [4] Data-Locality-Aware User Grouping in Cloud Radio Access Networks
    Ao, Weng Chon
    Psounis, Konstantinos
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2018, 17 (11) : 7295 - 7308
  • [5] LoadAtomizer: A Locality and I/O Load aware Task Scheduler for MapReduce
    Asahara, Masato
    Nakadai, Shinji
    Araki, Takuya
    [J]. 2012 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2012,
  • [6] Data Locality Aware Algorithm for Task Execution on Distributed, Cloud Based Environments
    Bica, Mihai
    Gorgan, Dorian
    [J]. COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS, CISIS-2017, 2018, 611 : 557 - 566
  • [7] Locality-Aware Dynamic Task Graph Scheduling
    Maglalang, Jordyn
    Krishnamoorthy, Sriram
    Agrawal, Kunal
    [J]. 2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2017, : 70 - 80
  • [8] Autonomous and Distributed Construction of Locality Aware Skip Graph
    Toda, Takahiro
    Tanigawa, Yosuke
    Tode, Hideki
    [J]. 2017 14TH IEEE ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2017, : 33 - 36
  • [9] A Predictive Map Task Scheduler for Optimizing Data Locality in MapReduce Clusters
    Merabet, Mohamed
    Benslimane, Sidi Mohamed
    Barhamgi, Mahmoud
    Bonnet, Christine
    [J]. INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2018, 10 (04) : 1 - 14
  • [10] GT-scheduler: a hybrid graph-partitioning and tabu-search based task scheduler for distributed data stream processing systems
    Hadian, Hamid
    Sharifi, Mohsen
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (05): : 5815 - 5832