A heuristic search algorithm based on subspaces for PageRank computation

被引:1
|
作者
Miyata, Takafumi [1 ]
机构
[1] Fukuoka Inst Technol, Dept Comp Sci & Engn, Higashi Ku, 3-30-1 Wajiro Higashi, Fukuoka, Fukuoka 8110295, Japan
来源
JOURNAL OF SUPERCOMPUTING | 2018年 / 74卷 / 07期
关键词
PageRank; Google matrix; Power iteration; Krylov subspace; Residual minimization; Parallel computing;
D O I
10.1007/s11227-018-2383-9
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We studied a fast algorithm for the large-scale computation of PageRank. PageRank is what the Google search engine uses to simulate the importance of web pages. It is defined by the eigenvector of a particular stochastic matrix related to the graphs of web pages. The power method is the typical means to compute the eigenvector, while the Krylov subspace method shows faster convergence, which can be regarded as a two-step algorithm. The first step predicts the eigenvector, and the second step corrects the predicted result. More precisely, the power method is first iterated to compute the eigenvector approximately. Secondly, a Krylov subspace spanned by the approximations is searched for a better approximate eigenvector in terms of minimizing a residual. To get a better approximation efficiently, we consider using subspaces not only at the second step but also at the first step. Specifically, a Krylov subspace is first used to compute an approximate eigenvector, by which another subspace is expanded. Secondly, this non-Krylov subspace is searched for a better approximate eigenvector that minimizes its residual over the subspace. This paper describes a heuristic search algorithm iterating the two steps alternately and presents its efficient implementation. Experimental results with huge Google matrices illustrate improvements in performance of the algorithm.
引用
收藏
页码:3278 / 3294
页数:17
相关论文
共 50 条
  • [1] A heuristic search algorithm based on subspaces for PageRank computation
    Takafumi Miyata
    The Journal of Supercomputing, 2018, 74 : 3278 - 3294
  • [2] An improved computation of the PageRank algorithm
    Kim, SJ
    Lee, SH
    ADVANCES IN INFORMATION RETRIEVAL, 2002, 2291 : 73 - 85
  • [3] A heuristic relaxed extrapolated algorithm for accelerating PageRank
    Migallon, Hector
    Migallon, Violeta
    Palomino, Juan A.
    Penades, Jose
    ADVANCES IN ENGINEERING SOFTWARE, 2018, 120 : 88 - 95
  • [4] An Improved PageRank Method based on Genetic Algorithm for Web Search
    Yan, Lili
    Gui, Zhanji
    Du, Wencai
    Guo, Qingju
    CEIS 2011, 2011, 15
  • [5] Computation of Word Similarity Based on the Information Content of Sememes and PageRank Algorithm
    Li, Hao
    Mu, Lingling
    Zan, Hongying
    CHINESE LEXICAL SEMANTICS, CLSW 2016, 2016, 10085 : 416 - 425
  • [6] RESEARCH ON THEME CRAWLER BASED ON SHARK-SEARCH AND PAGERANK ALGORITHM
    Qiu, Lei
    Lou, Yuansheng
    Chang, Min
    PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 268 - 271
  • [7] Text information similarity search algorithm based on segment estimation and PageRank
    Zhai L.
    Cui X.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (04): : 910 - 915
  • [8] A DISTRIBUTED ALGORITHM FOR FORMAL CONCEPTS PROCESSING BASED ON SEARCH SUBSPACES
    de Moraes, Nilander R. M.
    Zarate, Luis E.
    Freitas, Henrique C.
    ICEIS 2010: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1: DATABASES AND INFORMATION SYSTEMS INTEGRATION, 2010, : 105 - 111
  • [9] A heuristic search algorithm based on unified transformation framework
    Long, S
    Fursin, G
    2005 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS, PROCEEDINGS, 2005, : 137 - 144
  • [10] Predicting Heuristic Search Performance with PageRank Centrality in Local Optima Networks
    Herrmann, Sebastian
    Rothlauf, Franz
    GECCO'15: PROCEEDINGS OF THE 2015 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2015, : 401 - 408