A path-based approach for web page retrieval

被引:6
|
作者
Li, Jian-Qiang [1 ]
Zhao, Yu [1 ]
Garcia-Molina, Hector [2 ]
机构
[1] NEC Labs China, Beijing 100084, Peoples R China
[2] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
navigation path; web search; web information retrieval; POPULARITY; RANKING;
D O I
10.1007/s11280-011-0133-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Use of links to enhance page ranking has been widely studied. The underlying assumption is that links convey recommendations. Although this technique has been used successfully in global web search, it produces poor results for website search, because the majority of the links in a website are used to organize information and convey no recommendations. By distinguishing these two kinds of links, respectively for recommendation and information organization, this paper describes a path-based method for web page ranking. We define the Hierarchical Navigation Path (HNP) as a new resource for improving web search. HNP is composed of multi-step navigation information in visitors' website browsing. It provides indications of the content of the destination page. We first classify the links inside a website. Then, the links for web page organization are exploited to construct the HNPs for each page. Finally, the PathRank algorithm is described for web page retrieval. The experiments show that our approach results in significant improvements over existing solutions.
引用
收藏
页码:257 / 283
页数:27
相关论文
共 50 条
  • [1] A path-based approach for web page retrieval
    Jian-Qiang Li
    Yu Zhao
    Hector Garcia-Molina
    [J]. World Wide Web, 2012, 15 : 257 - 283
  • [2] PathRank: Web Page Retrieval with Navigation Path
    Li, Jianqiang
    Zhao, Yu
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 350 - +
  • [3] Path-based protocol verification approach
    Liu, WC
    Chung, CG
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2000, 42 (04) : 229 - 244
  • [4] Evaluating interconnection relationship for path-based XML retrieval
    Li, Xiaoguang
    Yu, Ge
    Wang, Daling
    Song, Baoyan
    [J]. WEB INFORMATION SYSTEMS - WISE 2006, PROCEEDINGS, 2006, 4255 : 506 - 511
  • [5] Path-based Approach to Integration Testing
    Hu, Jueliang
    Ding, Zuohua
    Pu, Geguang
    [J]. 2009 THIRD IEEE INTERNATIONAL CONFERENCE ON SECURE SOFTWARE INTEGRATION AND RELIABILITY IMPROVEMENT, PROCEEDINGS, 2009, : 445 - +
  • [6] Path-Based Verification for Composition of Semantic Web Services
    Shi, Yuxiang
    Yan, Jun
    Li, Zhongjie
    Zhu, Jun
    [J]. APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 2392 - +
  • [7] Path-based XML Relational Storage Approach
    Wang, Qi
    Ren, Zhongwei
    Dong, Liang
    Sheng, Zhongqi
    [J]. 2012 INTERNATIONAL CONFERENCE ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING (ICMPBE2012), 2012, 33 : 1621 - 1625
  • [8] A path-based approach to information technology in manufacturing
    Upton, DM
    McAfee, AP
    [J]. INTERNATIONAL JOURNAL OF TECHNOLOGY MANAGEMENT, 2000, 20 (3-4) : 354 - 372
  • [9] A path-based approach to the detection of infinite looping
    Zhang, J
    [J]. SECOND ASIA-PACIFIC CONFERENCE ON QUALITY SOFTWARE, PROCEEDINGS, 2001, : 88 - 94
  • [10] A PATH-BASED APPROACH TO CONSTRAINED SPARSE OPTIMIZATION
    Hallak, Nadav
    [J]. SIAM JOURNAL ON OPTIMIZATION, 2024, 34 (01) : 790 - 816