HEPart: A balanced hypergraph partitioning algorithm for big data applications

被引:15
|
作者
Yang, Wenyin [1 ,3 ]
Wang, Guojun [2 ]
Choo, Kim-Kwang Raymond [4 ,5 ]
Chen, Shuhong [2 ]
机构
[1] Foshan Univ, Sch Elect Informat Engn, Foshan 528000, Peoples R China
[2] Guangzhou Univ, Sch Comp Sci & Educ Software, Guangzhou 510006, Guangdong, Peoples R China
[3] Cent South Univ, Sch Informat Sci & Engn, Changsha 410083, Hunan, Peoples R China
[4] Univ Texas San Antonio, Dept Informat Syst & Cyber Secur, San Antonio, TX 78249 USA
[5] Univ South Australia, Sch Informat Technol & Math Sci, Adelaide, SA 5095, Australia
基金
中国国家自然科学基金;
关键词
Hypergraph partitioning; Hyperedge partitioning; Load balancing; Big data;
D O I
10.1016/j.future.2018.01.009
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Minimizing the query cost among multi-hosts is important to data processing for big data applications. Hypergraph is good at modeling data and data relationships of complex networks, the typical big data applications, by representing multi-way relationships or interactions as hyperedges. Hypergraph partitioning (HP) helps to partition the query loads on several hosts, enabling the horizontal scaling of large-scale networks. Existing heuristic HP algorithms are generally vertex hypergraph partitioning, designed to minimize the number of cut hyperedges while satisfying the balance requirements of part weights regarding vertices. However, since workloads are mainly produced by group operations, minimizing query costs landing on hyperedges and balancing the workloads should be the objectives in horizontal scaling. We thus propose a heuristic hyperedge partitioning algorithm, HEPart. Specifically, HEPart directly partitions the hypergraph into K sub-hypergraphs with a minimum cutsize for vertices, while satisfying the balance constraint on hyperedge weights, based on the effective move of hyperedges. The I performance of HEPart is evaluated using several complex network datasets modeled by undirected hypergraphs, under different cutsize metrics. The partitioning quality of HEPart is then compared with alternative hyperedge partitioners and vertex hypergraph partitioning algorithms. The experimental findings demonstrate the utility of HEPart (e.g. low cut cost while keeping load balancing as required, especially over scale-free networks). (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:250 / 268
页数:19
相关论文
共 50 条
  • [1] Hypergraph Partitioning for Big Data Applications
    Yang, Wenyin
    Ma, Li
    Cui, Ruchun
    Wang, Guojun
    [J]. 2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 1705 - 1710
  • [2] A Distributed Algorithm for Balanced Hypergraph Partitioning
    Yang, Wenyin
    Wang, Guojun
    Ma, Li
    Wu, Shiyang
    [J]. ADVANCES IN SERVICES COMPUTING, 2016, 10065 : 477 - 490
  • [3] Approximate hypergraph partitioning and applications
    Fischer, Eldar
    Matsliah, Arie
    Shapira, Asaf
    [J]. 48TH ANNUAL IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 2007, : 579 - +
  • [4] APPROXIMATE HYPERGRAPH PARTITIONING AND APPLICATIONS
    Fischer, Eldar
    Matsliah, Arie
    Shapira, Asaf
    [J]. SIAM JOURNAL ON COMPUTING, 2010, 39 (07) : 3155 - 3185
  • [5] A novel algorithm for multilevel hypergraph partitioning
    [J]. Liu, Y. (ysu678@hotmail.com), 1865, ICIC Express Letters Office, Tokai University, Kumamoto Campus, 9-1-1, Toroku, Kumamoto, 862-8652, Japan (06):
  • [6] An effective algorithm for multiway hypergraph partitioning
    Zhao, ZZ
    Tao, LX
    Zhao, YC
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2002, 49 (08) : 1079 - 1092
  • [7] PaPar: A Parallel Data Partitioning Framework for Big Data Applications
    Wang, Hao
    Zhang, Jing
    Zhang, Da
    Pumma, Sarunya
    Feng, Wu-chun
    [J]. 2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2017, : 605 - 614
  • [8] Partitioning the Impact of Mobile Applications on Big Data Cloud
    Ahmed, Fayyaz
    [J]. 8TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2017) AND THE 7TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT 2017), 2017, 109 : 1041 - 1046
  • [9] A Hybrid Hypergraph Partitioning Algorithm for Scientific Computing
    Zeng Yao-yuan
    Zhao Wen-tao
    Wang Zheng-hua
    [J]. MATERIALS PROCESSING AND MANUFACTURING III, PTS 1-4, 2013, 753-755 : 2900 - +
  • [10] Multilevel hypergraph partitioning: Applications in VLSI domain
    Karypis, G
    Aggarwal, R
    Kumar, V
    Shekhar, S
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 1999, 7 (01) : 69 - 79