HierarchicalRank: Webpage Rank Improvement Using HTML']HTML TagLevel Similarity

被引:0
|
作者
Sharma, Dilip [1 ]
Ganeshiya, Deepak [1 ]
机构
[1] GLA Univ Mathura, Dept Comp Engn & Applicat, Mathura, Uttar Pradesh, India
关键词
Web mining; web graph; hyperlink analysis; connectivity; pagerank; !text type='HTML']HTML[!/text] tags;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the past researches, two types of algorithms are introduced that are query dependent and query independent, works online or offline. PageRank Algorithm works offline independent to query while Hyperlink-Induced Topic Search (HITS) algorithm woks online dependent on query. One of the problems of these algorithms is that, division of the rank is based on number of inlinks, outlinks and different parameters used in hyperlink analysis which is dependent or independent to webpage content with the problem of topic drift. Previous researches were focused to solve this problem using the popularity of the outlink webpages. In this paper a novel algorithm for popularity measure is proposed based on similarity between query and Hierarchical text extracted from source and target webpage using Hyper Text Markup Language (HTML) tags importance parameter. In this paper, result of proposed method is compared with PageRank Algorithm and Topic Distillation with Query Dependent Link Connections and Page Characteristics results.
引用
收藏
页码:485 / 492
页数:8
相关论文
共 50 条
  • [31] Toward a retrieval of HTML']HTML documents using a semantic approach
    Ferri, F
    Ghiselli, C
    Grifoni, P
    Padula, M
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1571 - 1574
  • [32] Semantic HTML']HTML page segmentation using type analysis
    Yang, Xin
    Xiang, Peifeng
    Shi, Yuanchun
    2006 1ST INTERNATIONAL SYMPOSIUM ON PERVASIVE COMPUTING AND APPLICATIONS, PROCEEDINGS, 2006, : 669 - +
  • [33] On extracting data from tables that are encoded using HTML']HTML
    Roldan, Juan C.
    Jimenez, Patricia
    Corchuelo, Rafael
    KNOWLEDGE-BASED SYSTEMS, 2020, 190
  • [34] An electronic immunohistochemical vade mecum using Microsoft HTML']HTML help
    Bishop, PW
    JOURNAL OF PATHOLOGY, 2001, 195 : 38A - 38A
  • [35] Using Semantic-Level Tags in HTML']HTML/XML Documents
    Henschen, Lawrence J.
    Lee, Julia C.
    UNIVERSAL ACCESS IN HUMAN-COMPUTER INTERACTION: APPLICATIONS AND SERVICES, PT III, 2009, 5616 : 683 - 692
  • [36] Converting Web Pages Mockups to HTML']HTML using Machine Learning
    Boucas, Tiago
    Esteves, Antonio
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES (WEBIST), 2020, : 217 - 224
  • [37] Extraction and Semantic Annotation of Workshop Proceedings in HTML']HTML Using RML
    Dimou, Anastasia
    Vander Sande, Miel
    Colpaert, Pieter
    De Vocht, Laurens
    Verborgh, Ruben
    Mannens, Erik
    Van de Walle, Rik
    SEMANTIC WEB EVALUATION CHALLENGE, 2014, 475 : 114 - 119
  • [38] Mobile e-Services Using HTML']HTML5
    Andersson, Karl
    Johansson, Dan
    PROCEEDINGS OF THE 37TH ANNUAL IEEE CONFERENCE ON LOCAL COMPUTER NETWORKS WORKSHOPS (LCN 2012), 2012, : 814 - 819
  • [39] Building Interactive Books Using EPUB and HTML']HTML5
    Gavrilis, Dimitris
    Angelis, Stavros
    Tsoulos, Ioannis
    AMBIENT MEDIA AND SYSTEMS, 2013, 118 : 31 - 40
  • [40] The evolving HealthWeb: From HTML']HTML to database using cold fusion
    Shedlock, J
    Walton, L
    Barkey, D
    Hunt, S
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1999, : 1160 - 1160