Page clustering using a distance based algorithm

被引:0
|
作者
Mojica, JA
Rojas, DA
Gómez, J
González, F
机构
关键词
D O I
10.1109/LAWEB.2005.27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an application of a clustering algorithm based on gravitational forces to the problem of Web Page Clustering in a dynamic environment. The proposed algorithm uses a modification of the gravitational algorithm proposed by Gomez et al. but using only the distance measures (a notion of space is not required). This approach is useful when similarities (and/or then distances) between pages can be defined and compute quickly, but the definition of a space is computationally expensive. Experiments with data representing real URL's and sessions are performed, and a comparison with the incremental connected components algorithm, which has been previously used to solve this problem, is done.
引用
收藏
页码:223 / 229
页数:7
相关论文
共 50 条
  • [21] SmallSteps: An adaptive distance-based clustering algorithm
    Koch, Gy.
    Dombi, J.
    Acta Cybernetica, 2001, 15 (02): : 241 - 256
  • [22] SmallSteps: An adaptive distance-based clustering algorithm
    2001, University of Szeged, Arpad ter 2., Szeged, H-6720, Hungary (15):
  • [23] An incremental learning clustering algorithm based on Mahalanobis distance
    Zhang, Yong
    Sun, Xiaopeng
    Zheng, Hongliang
    Wang, Jianying
    Journal of Computational Information Systems, 2010, 6 (03): : 973 - 980
  • [24] Improving of cache memory performance based on a fuzzy clustering based page replacement algorithm by using four features
    Akbari-Bengar, Davood
    Ebrahimnejad, Ali
    Motameni, Homayun
    Golsorkhtabaramiri, Mehdi
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (05) : 7899 - 7908
  • [25] A Weighting Fuzzy Clustering Algorithm Based on Euclidean Distance
    Xue, Zhan-ao
    Cen, Feng
    Wei, Li-ping
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2008, : 172 - 175
  • [26] Direct clustering algorithm based on generalized information distance
    College of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221008, China
    不详
    不详
    Jisuanji Yanjiu yu Fazhan, 2007, 4 (674-679):
  • [27] Clustering Algorithm Based on Semantic Distance for XML Documents
    Yang, Lingxian
    Gu, Jinguang
    Chen, Heping
    FIRST INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, : 549 - +
  • [28] Iterative optimization clustering algorithm based on manifold distance
    Wang, Na
    Du, Haifeng
    Wang, Sun'an
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2009, 43 (05): : 76 - 79
  • [29] Web page sorting algorithm based on query keyword distance relation
    Yang, Han
    Cui, HongGang
    Tang, Hao
    GREEN ENERGY AND SUSTAINABLE DEVELOPMENT I, 2017, 1864
  • [30] Web Page Prediction by Clustering and Integrated Distance Measure
    Poornalatha, G.
    Raghavendra, Prakash S.
    2012 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2012, : 1349 - 1354