HEPart: A balanced hypergraph partitioning algorithm for big data applications

被引:15
|
作者
Yang, Wenyin [1 ,3 ]
Wang, Guojun [2 ]
Choo, Kim-Kwang Raymond [4 ,5 ]
Chen, Shuhong [2 ]
机构
[1] Foshan Univ, Sch Elect Informat Engn, Foshan 528000, Peoples R China
[2] Guangzhou Univ, Sch Comp Sci & Educ Software, Guangzhou 510006, Guangdong, Peoples R China
[3] Cent South Univ, Sch Informat Sci & Engn, Changsha 410083, Hunan, Peoples R China
[4] Univ Texas San Antonio, Dept Informat Syst & Cyber Secur, San Antonio, TX 78249 USA
[5] Univ South Australia, Sch Informat Technol & Math Sci, Adelaide, SA 5095, Australia
基金
中国国家自然科学基金;
关键词
Hypergraph partitioning; Hyperedge partitioning; Load balancing; Big data;
D O I
10.1016/j.future.2018.01.009
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Minimizing the query cost among multi-hosts is important to data processing for big data applications. Hypergraph is good at modeling data and data relationships of complex networks, the typical big data applications, by representing multi-way relationships or interactions as hyperedges. Hypergraph partitioning (HP) helps to partition the query loads on several hosts, enabling the horizontal scaling of large-scale networks. Existing heuristic HP algorithms are generally vertex hypergraph partitioning, designed to minimize the number of cut hyperedges while satisfying the balance requirements of part weights regarding vertices. However, since workloads are mainly produced by group operations, minimizing query costs landing on hyperedges and balancing the workloads should be the objectives in horizontal scaling. We thus propose a heuristic hyperedge partitioning algorithm, HEPart. Specifically, HEPart directly partitions the hypergraph into K sub-hypergraphs with a minimum cutsize for vertices, while satisfying the balance constraint on hyperedge weights, based on the effective move of hyperedges. The I performance of HEPart is evaluated using several complex network datasets modeled by undirected hypergraphs, under different cutsize metrics. The partitioning quality of HEPart is then compared with alternative hyperedge partitioners and vertex hypergraph partitioning algorithms. The experimental findings demonstrate the utility of HEPart (e.g. low cut cost while keeping load balancing as required, especially over scale-free networks). (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:250 / 268
页数:19
相关论文
共 50 条
  • [21] A fast hypergraph min-cut algorithm for circuit partitioning
    Mak, WK
    Wong, DF
    [J]. INTEGRATION-THE VLSI JOURNAL, 2000, 30 (01) : 1 - 11
  • [22] A parallel algorithm for multilevel k-way hypergraph partitioning
    Trifunovic, A
    Knottenbelt, WJ
    [J]. ISPDC 2004: THIRD INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING/HETEROPAR '04: THIRD INTERNATIONAL WORKSHOP ON ALGORITHMS, MODELS AND TOOLS FOR PARALLEL COMPUTING ON HETEROGENEOUS NETWORKS, PROCEEDINGS, 2004, : 114 - 121
  • [23] Using Client-Side Access Partitioning for Data Clustering in Big Data Applications
    Liu, Dapeng
    Xu, Shaochun
    Cui, Zengdi
    [J]. 2014 15TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2014, : 149 - 153
  • [24] Hypergraph-based locality-enhancing methods for graph operations in Big Data applications
    Akbudak, Kadir
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2024, 38 (03): : 210 - 224
  • [25] Research on Balanced Scheduling Algorithm of Big Data in Network Under Cloud Computing
    Ye, Lunqiang
    [J]. ADVANCED HYBRID INFORMATION PROCESSING, ADHIP 2019, PT I, 2019, 301 : 197 - 206
  • [26] Hypergraph-based locality-enhancing methods for graph operations in Big Data applications
    Akbudak, Kadir
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2024, 38 (03): : 210 - 224
  • [27] Sub-Grid Partitioning Algorithm for Distributed Outlier Detection on Big Data
    Sakr, Mohamed
    Atwa, Walid
    Keshk, Arabi
    [J]. PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 252 - 257
  • [28] Generalizing p-Laplacian: spectral hypergraph theory and a partitioning algorithm
    Shota Saito
    Mark Herbster
    [J]. Machine Learning, 2023, 112 : 241 - 280
  • [29] Generalizing p-Laplacian: spectral hypergraph theory and a partitioning algorithm
    Saito, Shota
    Herbster, Mark
    [J]. MACHINE LEARNING, 2023, 112 (01) : 241 - 280
  • [30] Multi-Level Refinement Algorithm of Weighted Hypergraph Partitioning Problem
    [J]. Guo, Kai-Qiang (kaiqiangguo@qq.com), 1600, Walter de Gruyter GmbH (26):