Web page clustering: A hyperlink-based similarity and matrix-based hierarchical algorithms

被引:0
|
作者
Hou, JY [1 ]
Zhang, YC
Cao, JL
机构
[1] Deakin Univ, Sch Informat Technol, Melbourne, Vic 3125, Australia
[2] Univ So Queensland, Dept Math & Comp, Toowoomba, Qld 4350, Australia
[3] La Trobe Univ, Dept Comp Sci & Comp Engn, Melbourne, Vic 3086, Australia
来源
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a hyperlink-based web page similarity measurement and two matrix-based hierarchical web page clustering algorithms. The web page similarity measurement incorporates hyperlink transitivity and page importance within the concerned web page space. One clustering algorithm takes cluster overlapping into account, another one does not. These algorithms do not require predefined similarity thresholds for clustering, and are independent of the page order. The primary evaluations show the effectiveness of the proposed algorithms in clustering improvement.
引用
收藏
页码:201 / 212
页数:12
相关论文
共 50 条
  • [1] Deriving and verifying statistical distribution of a hyperlink-based Web page quality metric
    Dhyani, D
    Bhowmick, SS
    Ng, WK
    [J]. DATA & KNOWLEDGE ENGINEERING, 2003, 46 (03) : 291 - 315
  • [2] A matrix approach for hierarchical web page clustering based on hyperlinks
    Hou, JY
    Zhang, YC
    [J]. WISE 2002: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING (WORKSHOPS), 2002, : 207 - 216
  • [3] Web enabled expert systems using hyperlink-based inference
    Kim, WJ
    Song, YU
    Hong, JS
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2005, 28 (01) : 79 - 91
  • [4] A hyperlink-based algorithm incorporating hops between web pages
    Lai, J
    Soh, B
    [J]. International Symposium on Communications and Information Technologies 2005, Vols 1 and 2, Proceedings, 2005, : 779 - 782
  • [5] Web enabled expert systems using hyperlink-based inference
    Song, YU
    Kim, W
    Hong, JS
    [J]. IKE'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2003, : 598 - 605
  • [6] Matrix-based hierarchical clustering for developing product architecture
    Daie, Pooya
    Li, Simon
    [J]. CONCURRENT ENGINEERING-RESEARCH AND APPLICATIONS, 2016, 24 (02): : 139 - 152
  • [7] A matrix-based model for web page community construction and more
    Hou, Jingyu
    [J]. INFORMATICA, 2007, 18 (02) : 217 - 238
  • [8] An improved Aitken extrapolation in hyperlink-based ranking
    Liu, Huiyi
    Dong, Zhiyong
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 2633 - 2636
  • [9] Adjustment of web page hyperlink based on greedy algorithm
    Chen, QZ
    Zhang, WY
    Chu, YQ
    Chen, XY
    Han, JG
    [J]. PROCEEDINGS OF THE 11TH JOINT INTERNATIONAL COMPUTER CONFERENCE, 2005, : 214 - 217
  • [10] Adjustment of web page hyperlink based on greedy algorithm
    Chen, Qingzhang
    Chen, Xiaoyin
    Cao, Che
    Gu, Yujie
    [J]. 2005 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND TECHNOLOGY, PROCEEDINGS, 2005, : 424 - 428