Incremental C-Rank: An effective and efficient ranking algorithm for dynamic Web environments

被引:6
|
作者
Koo, Jangwan [1 ]
Chae, Dong-Kyu [1 ]
Kim, Dong-Jin [2 ]
Kim, Sang-Wook [1 ]
机构
[1] Hanyang Univ, Seoul, South Korea
[2] Brainsoft Inc, Seongnam, South Korea
基金
新加坡国家研究基金会;
关键词
Information retrieval; Ranking algorithm; Dynamic ranking; INFORMATION; SIMILARITY; RETRIEVAL; SEARCH; LINKS;
D O I
10.1016/j.knosys.2019.03.034
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Web page ranking is one of the core components of search engines. Given a user query, ranking aims to provide a ranked list of Web pages that the user is likely to prefer the most. By and large, the ranking algorithms can be categorized into content-based approaches, link-based approaches, and hybrid approaches. Hybrid ranking algorithms, which exploit both the content and link information, are the most popular and extensively studied techniques. Among the hybrid algorithms, C-Rank combines content and link information in a very effective way using the concept of contribution. This algorithm is known to provide high performance in terms of both accurate and prompt responses to user queries. However, C-Rank suffers from very high costs to reflect the highly dynamic and extremely frequent changes in the World Wide Web, because it re-computes all of the C-Rank scores used for ranking from scratch to reflect the changes. As a result, C-Rank may be considered inappropriate to provide users with accurate and up-to-date search results. This paper aims to remedy this limitation of C-Rank. We propose incremental C-Rank, which is designed to update the C-Rank scores of only a carefully chosen portion of the Web pages rather than those of all of the Web pages without any accuracy loss. Our experimental results on a real-world dataset confirm both the effectiveness and efficiency of our proposed method. (C) 2019 Elsevier B.V. All rights reserved. Web page ranking is one of the core components of search engines. Given a user query, ranking aims to provide a ranked list of Web pages that the user is likely to prefer the most. By and large, the ranking algorithms can be categorized into content-based approaches, link-based approaches, and hybrid approaches. Hybrid ranking algorithms, which exploit both the content and link information, are the most popular and extensively studied techniques. Among the hybrid algorithms, C-Rank combines content and link information in a very effective way using the concept of contribution. This algorithm is known to provide high performance in terms of both accurate and prompt responses to user queries. However, C-Rank suffers from very high costs to reflect the highly dynamic and extremely frequent changes in the World Wide Web, because it re-computes all of the C-Rank scores used for ranking from scratch to reflect the changes. As a result, C-Rank may be considered inappropriate to provide users with accurate and up-to-date search results. This paper aims to remedy this limitation of C-Rank. We propose incremental C-Rank, which is designed to update the C-Rank scores of only a carefully chosen portion of the Web pages rather than those of all of the Web pages without any accuracy loss. Our experimental results on a real-world dataset confirm both the effectiveness and efficiency of our proposed method. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:147 / 158
页数:12
相关论文
共 46 条
  • [41] An effective and efficient approximate two-dimensional dynamic programming algorithm for supporting advanced computer vision applications
    Cuzzocrea, Alfredo
    Mumolo, Enzo
    Grasso, Giorgio Mario
    Vercelli, Gianni
    [J]. JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2017, 42 : 13 - 22
  • [42] An efficient dynamic point algorithm for line-based collision detection in real time virtual environments involving haptics
    Maciel, Anderson
    De, Suvranu
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2008, 19 (02) : 151 - 163
  • [43] An effective task scheduling algorithm based on dynamic energy management and efficient resource utilization in green cloud computing environment
    Yong Lu
    Na Sun
    [J]. Cluster Computing, 2019, 22 : 513 - 520
  • [44] Towards An Effective and Efficient Approximation Algorithm for Advanced Computer Vision Applications based on Two-Dimensional Dynamic Programming
    Cuzzocrea, Alfredo
    Mumolo, Enzo
    Grasso, Giorgio Mario
    Vercelli, Gianni
    [J]. DMS 2016: THE 22ND INTERNATIONAL CONFERENCE ON DISTRIBUTED MULTIMEDIA SYSTEMS, 2016, : 84 - 90
  • [45] An effective task scheduling algorithm based on dynamic energy management and efficient resource utilization in green cloud computing environment
    Lu, Yong
    Sun, Na
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 1): : 513 - 520
  • [46] Efficient Three-Step Amplifier Configuration Algorithm for Dynamic C plus L-Band Links in Presence of Stimulated Raman Scattering
    Song, Yuchen
    Fan, Qirui
    Lu, Chao
    Wang, Danshi
    Lau, Alan Pak Tao
    [J]. JOURNAL OF LIGHTWAVE TECHNOLOGY, 2023, 41 (05) : 1445 - 1453