A Probabilistic Approach for Distillation and Ranking of Web Pages

被引:2
|
作者
Greco G. [1 ]
Greco S. [1 ]
Zumpano E. [1 ]
机构
[1] DEIS, Università della Calabria, Rende
关键词
information retrieval on the Web; random walks; search engines; Web searching;
D O I
10.1023/A:1013883717655
中图分类号
学科分类号
摘要
A great number of recent papers have investigated the possibility of introducing more effective and efficient algorithms for search engines. In traditional search engines the resulting ranking is carried out using textual information only and, as showed by several works, they are not very useful for extracting relevant information. Present research, instead, takes a new approach, called Topic Distillation, whose main task is finding relevant documents using a different similarity criterion: retrieved documents are those related to the query topic, but which do not necessarily contain the query string. Current algorithms for topic distillation first compute a base set containing all the relevant pages and then, by applying an iterative procedure, obtain the authoritative pages. In this paper, we present a different approach which computes the authoritative pages by analyzing the structure of the base set. The technique applies a statistical approach to the co-citation matrix (of the base set) to find the most co-cited pages and combines a link analysis approach with the content page evaluation. Several experiments have shown the validity of our approach. © 2001, Kluwer Academic Publishers.
引用
收藏
页码:189 / 207
页数:18
相关论文
共 50 条
  • [1] A probabilistic approach for discovering authoritative Web pages
    Greco, G
    Greco, S
    Zumpano, E
    [J]. SECOND INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, VOL I, PROCEEDINGS, 2002, : 133 - 133
  • [2] Voting model for ranking Web pages
    Lifantsev, M
    [J]. IC'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET COMPUTING, 2000, : 143 - 148
  • [3] Web Pages Ranking with Domain Ontology
    Zhou, Mingji
    Liu, Jin
    Zheng, Yuhui
    [J]. ADVANCES IN COMPUTER SCIENCE AND UBIQUITOUS COMPUTING, 2018, 474 : 516 - 521
  • [4] Parallel online ranking of Web pages
    Saffar, Y. Ganji
    Esmaili, K. Sheykh
    Ghodsi, M.
    Abolhassani, H.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2006, : 104 - +
  • [5] Incremental Refinement of Page Ranking of Web Pages
    Sharma, Prem Sagar
    Yadav, Divakar
    [J]. INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2020, 10 (03) : 57 - 73
  • [6] Graph neural networks for ranking web pages
    Scarselli, F
    Yong, SL
    Gori, M
    Hagenbuchner, M
    Tsoi, AC
    Maggini, M
    [J]. 2005 IEEE/WIC/ACM International Conference on Web Intelligence, Proceedings, 2005, : 666 - 672
  • [7] Ranking Billions of Web Pages Using Diodes
    Kaul, Rohit
    Yun, Yeogirl
    Kim, Seong-Gon
    [J]. COMMUNICATIONS OF THE ACM, 2009, 52 (08) : 132 - 136
  • [8] DistanceRank: An intelligent ranking algorithm for web pages
    Bidoki, Ali Mohammad Zareh
    Yazdani, Nasser
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2008, 44 (02) : 877 - 892
  • [9] Ranking Health Web Pages with Relevance and Understandability
    Palotti, Joao
    Goeuriot, Lorraine
    Zuccon, Guido
    Hanbury, Allan
    [J]. SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 965 - 968
  • [10] Effective Model And Implementation Of Dynamic Ranking In Web Pages
    Divjot
    Singh, Jaiteg
    [J]. 2015 FIFTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT2015), 2015, : 1010 - 1014