C-Rank and its variants: A contribution-based ranking approach exploiting links and content

被引:3
|
作者
Kim, Dong-Jin [1 ]
Lee, Sang-Chul [2 ]
Son, Ho-Yong [2 ]
Kim, Sang-Wook [2 ]
Lee, Jae Bum [3 ]
机构
[1] NHN Inst Next Network, Songnam, South Korea
[2] Hanyang Univ, Dept Elect & Comp Engn, Seoul 133791, South Korea
[3] NHN Corp, Songnam, South Korea
基金
新加坡国家研究基金会;
关键词
Content and link ranking; contribution based ranking; contribution constraints; C-Rank; Web information retrieval; TEXT; INFORMATION;
D O I
10.1177/0165551514545429
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the problem in Web page ranking of effectively combining link and content information with efficiency high enough to be applicable to real-world search engines. Unlike previous surfer models, our approach is based on the viewpoint of a Web page author. Based on this viewpoint, we formulate the concept of contribution score, which indicates the amount to which a term in each page is utilized by other pages. To improve efficiency without loss of effectiveness, we exploit the expectations of both a Web page author and a Web search engine user on retrieval results, and restrict candidate terms that can contribute to other pages to a set of keywords of each page. In this paper, we propose three contribution-based models: C-Rank, PC-Rank and HC-Rank. Experimental results show that C-Rank provides the best precision among the models and is very effective for topic distillation tasks on the .GOV collection in TREC. Most importantly, the proposed models are efficient enough to be applicable to real-world search engines.
引用
收藏
页码:761 / 778
页数:18
相关论文
共 1 条
  • [1] A Substrate-Based Approach to Skeletal Diversity from Dicobalt Hexacarbonyl (C1)-Alkynyl Glycals by Exploiting Its Combined Ferrier-Nicholas Reactivity
    Lobo, Fernando
    Gomez, Ana M.
    Miranda, Silvia
    Cristobal Lopez, J.
    [J]. CHEMISTRY-A EUROPEAN JOURNAL, 2014, 20 (33) : 10492 - 10502