HyPR: Hybrid Page Ranking on Evolving Graphs

被引:3
|
作者
Giri, Hemant Kumar [1 ]
Haque, Mridul [1 ]
Banerjee, Dip Sankar [1 ]
机构
[1] Indian Inst Informat Technol Guwahati Bongora, Dept Comp Sci & Engn, Gauhati 781015, Assam, India
关键词
Heterogeneous Computing; PageRank; CPU plus GPU; Dynamic graphs;
D O I
10.1109/HiPC50609.2020.00020
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
PageRank (PR) is the standard metric used by the Google search engine to compute the importance of a web page via modeling the entire web as a first order Markov chain. The challenge of computing PR efficiently and quickly has been already addressed by several works previously who have shown innovations in both algorithms and in the use of parallel computing. The standard method of computing PR is handled by modelling the web as a graph. The fast growing internet adds several new web pages everyday and hence more nodes (representing the web pages) and edges (the hyperlinks) are added to this graph in an incremental fashion. Computing PR on this evolving graph is now an emerging challenge since computations from scratch on the massive graph is time consuming and unscalable. In this work, we propose Hybrid Page Rank (HyPR), which computes PR on evolving graphs using collaborative executions on muti-core CPUs and massively parallel GPUs. We exploit data parallelism via efficiently partitioning the graph into different regions that are affected and unaffected by the new updates. The different partitions are then processed in an overlapped manner for PR updates. The novelty of our technique is in utilizing the hybrid platform to scale the solution to massive graphs. The technique also provides high performance through parallel processing of every batch of updates using a parallel algorithm. HyPR efficiently executes on a NVIDIA V100 GPU hosted on a 6th Gen Intel Xeon CPU and is able to update a graph with 640M edges with a single batch of 100,000 edges in 12 ms. HyPR outperforms other state of the art techniques for computing PR on evolving graphs [1] by 4.8x. Additionally HyPR provides 1.2x speedup over GPU only executions, and 95x speedup over CPU only parallel executions.
引用
收藏
页码:62 / 71
页数:10
相关论文
共 50 条
  • [1] Hybrid Model for Improvised Page Ranking Algorithm
    Rodrigues, Lissa
    Jaswal, Shree
    [J]. 2015 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2015, : 466 - 469
  • [2] An Efficient Page Ranking Approach Based On Hybrid Model
    Rodrigues, Lissa
    Jaswal, Shree
    [J]. 2015 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATION ENGINEERING ICACCE 2015, 2015, : 693 - 696
  • [3] A hybrid model for drawing dynamic and evolving graphs
    Gaertler, M
    Wagner, D
    [J]. GRAPH DRAWING, 2006, 3843 : 189 - 200
  • [4] Page Ranking Algorithms: A Survey
    Duhan, Neelam
    Sharma, A. K.
    Bhatia, Komal Kumar
    [J]. 2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 1530 - 1537
  • [5] Web page importance ranking
    Wolfgang Gaul
    [J]. Advances in Data Analysis and Classification, 2011, 5 : 113 - 128
  • [6] Web page importance ranking
    Gaul, Wolfgang
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2011, 5 (02) : 113 - 128
  • [7] Page ranking and topic-sensitive page ranking: micro-changes and macro-impact
    Rimbach, F.
    Dannenberg, M.
    Bleimann, U.
    [J]. INTERNET RESEARCH, 2007, 17 (01) : 38 - 48
  • [8] Ranking Attack Graphs
    Mehta, Vaibhav
    Bartzis, Constantinos
    Zhu, Haifeng
    Clarke, Edmund
    Wing, Jeannette
    [J]. RECENT ADVANCES IN INTRUSION DETECTION, PROCEEDINGS, 2006, 4219 : 127 - 144
  • [9] Ranking numbers of graphs
    Chang, Chan-Wei
    Kuo, David
    Lin, Hsing-Ching
    [J]. INFORMATION PROCESSING LETTERS, 2010, 110 (16) : 711 - 716
  • [10] The Improvement of Web Page Ranking on SERPs
    Chu, Hung-Chi
    Yan, Chen-You
    Luo, Zhi-Jie
    Huang, Xin-Cang
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2018,