Content and link-structure perspective of ranking webpages: A review

被引:4
|
作者
Ali, Fayyaz [1 ]
Khusro, Shah [1 ]
机构
[1] Univ Peshawar, Dept Comp Sci, Peshawar 25120, Khyber Pakhtunk, Pakistan
关键词
Information retrieval; Web search engines; Ranking; Ranking algorithms; PageRank; HITS; WEB; PAGERANK; SEARCH; ALGORITHM; TIME; INFORMATION; COMPUTATION; RETRIEVAL; GOOGLE; MODEL;
D O I
10.1016/j.cosrev.2021.100397
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The delivery of ranked relevant results is probably the most important factor in making a web search engine acceptable to its users. This inspiration has led the search engine engineers and researchers to conceive ranking algorithms that can provide the most relevant results (webpages) at the top of the Search Engines Results Page (SERP). To rank webpages, several features are exploited in research studies related to the content and link structure of the web. This article discusses and assesses the webpage ranking algorithms proposed in the domains of content-based and link-based rankings in the past two decades. The assessment of these algorithms is done using features extracted from the relevant literature. The strengths and limitations of these features as well as the ranking algorithms are highlighted and discussed. The findings of this work suggest that the link-based ranking factors are still the dominant force in ranking webpages but these alone are by no means enough to fulfill the information needs of the users. An acceptable solution must contain features from both the content based and link-based ranking domains integrated with the temporal features and users' behavior information. Possible future directions are also highlighted.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Did online publishers "get it right"? Using a naturalistic search strategy to review cognitive health promotion content on internet webpages
    Hunter, P. V.
    Delbaere, M.
    O'Connell, M. E.
    Cammer, A.
    Seaton, J. X.
    Friedrich, T.
    Fick, F.
    [J]. BMC GERIATRICS, 2017, 17
  • [32] A method for focused crawling using combination of link structure and content similarity
    Jamali, Mohsen
    Sayyadi, Hassan
    Hariri, Babak Bagheri
    Abolhassani, Hassan
    [J]. 2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 753 - +
  • [33] Link prediction of the world container shipping network: A network structure perspective
    Ge, Jiawei
    Wang, Xuefeng
    Shi, Wenming
    [J]. CHAOS, 2021, 31 (11)
  • [34] Modeling content and structure for abstractive review summarization
    Gerani, Shima
    Carenini, Giuseppe
    Ng, Raymond T.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2019, 53 : 302 - 331
  • [35] Information Content Estimate of Model Proteomes: A Primary Structure Perspective
    Eroglu, Sertac
    [J]. CURRENT BIOINFORMATICS, 2017, 12 (06) : 490 - 497
  • [36] Efficient Focused Crawling Strategy Using Combination of Link Structure and Content Similarity
    Qu Cheng
    Wang Beizhan
    Wei Pianpian
    [J]. 2008 IEEE INTERNATIONAL SYMPOSIUM ON IT IN MEDICINE AND EDUCATION, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1045 - 1048
  • [37] Extracting Topic Maps from Web Pages by Web Link Structure and Content
    Mase, Motohiro
    Yamada, Seiji
    Nitta, Katsumi
    [J]. 2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, : 1232 - +
  • [38] Structure and content of phenolics in eggplant (Solanum melongena) - a review
    Nino-Medina, G.
    Urias-Orona, V.
    Muy-Rangel, M. D.
    Heredia, J. B.
    [J]. SOUTH AFRICAN JOURNAL OF BOTANY, 2017, 111 : 161 - 169
  • [39] Intra-Firm Information Flow: A Content-Structure Perspective
    Berchenko, Yakir
    Daliot, Or
    Brueller, Nir N.
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS X: IDA 2011, 2011, 7014 : 34 - +
  • [40] Defected Ground Structure in the perspective of Microstrip Antennas: A Review
    Arya, Ashwini K.
    Kartikeyan, M. V.
    Patnaik, A.
    [J]. FREQUENZ, 2010, 64 (5-6) : 79 - 84