Effectively finding relevant Web pages from linkage information

被引:36
|
作者
Hou, JY [1 ]
Zhang, YC
机构
[1] Deakin Univ, Sch Informat Technol, Melbourne, Vic 3125, Australia
[2] Victoria Univ Technol, Sch Comp Sci & Math, Melbourne, Vic 8001, Australia
关键词
World Wide Web; Web search; information retrieval; hyperlink analysis; singular value decomposition (SVD);
D O I
10.1109/TKDE.2003.1209010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents two hyperlink analysis-based algorithms to find relevant pages for a given Web page (URL). The first algorithm comes from the extended cocitation analysis of the Web pages. It is intuitive and easy to implement. The second one takes advantage of linear algebra theories to reveal deeper relationships among the Web pages and to identify relevant pages more precisely and effectively. The experimental results show the feasibility and effectiveness of the algorithms. These algorithms could be used for various Web applications, such as enhancing Web search. The ideas and techniques in this work would be helpful to other Web-related researches.
引用
收藏
页码:940 / 951
页数:12
相关论文
共 50 条
  • [1] KernelRank: Exploiting Semantic Linkage Kernels for Relevant Pages Finding
    Wang Yaowei
    Su Limin
    Tian Yonghong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2009, 18 (03) : 405 - 410
  • [2] Exploring content and linkage structures for searching relevant web pages
    Davis, Darren
    Jiang, Eric
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2007, 4632 : 15 - +
  • [3] Improving linkage of web pages
    Gupta, Rakesh
    Bagchi, Amitava
    Sarkar, Sumit
    [J]. INFORMS JOURNAL ON COMPUTING, 2007, 19 (01) : 127 - 136
  • [4] Information Extraction from Web pages
    Novotny, Robert
    Vojtas, Peter
    Maruscak, Dusan
    [J]. 2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 3, 2009, : 121 - +
  • [5] Finding Pages on the Unarchived Web
    Huurdeman, Hugo C.
    Ben-David, Anat
    Kamps, Jaap
    Samar, Thaer
    de Vries, Arjen P.
    [J]. 2014 IEEE/ACM JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL), 2014, : 331 - 340
  • [6] Automatically Discovering Relevant Images From Web Pages
    Uzun, Erdinc
    Ozhan, Erkan
    Agun, Hayri Volkan
    Yerlikaya, Tarik
    Bulus, Halil Nusret
    [J]. IEEE ACCESS, 2020, 8 : 208910 - 208921
  • [7] Finding and Extracting Data Records from Web Pages
    Manuel Álvarez
    Alberto Pan
    Juan Raposo
    Fernando Bellas
    Fidel Cacheda
    [J]. Journal of Signal Processing Systems, 2010, 59 : 123 - 137
  • [8] Finding and Extracting Data Records from Web Pages
    Alvarez, Manuel
    Pan, Alberto
    Raposo, Juan
    Bellas, Fernando
    Cacheda, Fidel
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2010, 59 (01): : 123 - 137
  • [9] Finding and extracting data records from web pages
    Alvarez, Manuel
    Pan, Alberto
    Raposo, Juan
    Bellas, Fernando
    Cacheda, Fidel
    [J]. EMBEDDED AND UBIQUITOUS COMPUTING, PROCEEDINGS, 2007, 4808 : 466 - 478
  • [10] Fast Information Retrieval from Web Pages
    El-Bakry, Hazem M.
    Mastorakis, Nikos
    [J]. PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, MAN-MACHINE SYSTEMS AND CYBERNETICS (CIMMACS '08): RECENT ADVANCES IN COMPUTATIONAL INTELLIGENCE, MAN-MACHINE SYSTEMS AND CYBERNETICS, 2008, : 229 - +