A web document clustering algorithm based on concept of neighbor

被引:0
|
作者
Song, JC [1 ]
Shen, JY [1 ]
机构
[1] Xian Jiaotong Univ, Dept Comp Sci & Technol, Xian 710049, Peoples R China
关键词
web mining; document mining; document clustering; nearest neigbor technique;
D O I
10.1109/ICMLC.2003.1264440
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the WWW devloped rapidly, it becomes the most important resource gradually that transfers and shares the global information as well as being full of the latent capacity. Recent years, the researches of the Web mining have been concerned broadly and gotten a great deal of achievements simultaneously. The nearest neighbor technique, which is hierarchical clustering method based on distance, has been applied to many cases widely for the efficiency and validity. In this paper, based on the Vector Space Model (VSM) of the Web documents, We improved the nearest neighbor method, put forward a new Web document clustering algorithm, and researched the validity and scalability of the algorithm, the time and space complexity of the algorithm.
引用
收藏
页码:46 / 50
页数:5
相关论文
共 50 条
  • [1] A fuzzy-based algorithm for Web document clustering
    Friedman, M
    Kandel, A
    Schneider, M
    Last, M
    Shapira, B
    Elovici, Y
    Zaafrany, O
    [J]. NAFIPS 2004: ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY, VOLS 1AND 2: FUZZY SETS IN THE HEART OF THE CANADIAN ROCKIES, 2004, : 524 - 527
  • [2] An improved clustering algorithm for web document
    Wang, Jing
    Liu, Zhijing
    [J]. Journal of Information and Computational Science, 2009, 6 (02): : 959 - 966
  • [3] Concept based document clustering using K prototype Algorithm
    Pasarate, Sneha
    Shedge, Rajashree
    [J]. 2018 INTERNATIONAL CONFERENCE ON CONTROL, POWER, COMMUNICATION AND COMPUTING TECHNOLOGIES (ICCPCCT), 2018, : 579 - 583
  • [4] An effective web document clustering algorithm based on bisection and merge
    Ingyu Lee
    Byung-Won On
    [J]. Artificial Intelligence Review, 2011, 36 : 69 - 85
  • [5] An effective web document clustering algorithm based on bisection and merge
    Lee, Ingyu
    On, Byung-Won
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2011, 36 (01) : 69 - 85
  • [6] Fuzzy concept graph and application in web document clustering
    An, C
    Ning, C
    Jia, WJ
    Luo, SD
    [J]. 2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, : C101 - C106
  • [7] Formal Concept Analysis Support for Web Document Clustering Based on Social Tagging
    Ouyang, Chunping
    Yang, Xiaohua
    Li, Xiaoyun
    Liu, Zhiming
    [J]. 2012 2ND INTERNATIONAL CONFERENCE ON UNCERTAINTY REASONING AND KNOWLEDGE ENGINEERING (URKE), 2012, : 304 - 307
  • [8] A clustering algorithm based on natural nearest neighbor
    Zhu, Qingsheng
    Huang, Jinlong
    Feng, Ji
    Zhou, Xianlin
    [J]. Journal of Computational Information Systems, 2014, 10 (13): : 5473 - 5480
  • [9] Documental clustering algorithm based on fuzzy concept graph and its application in Web
    Chen, Ning
    Chen, An
    Zhou, Long-Xiang
    Jia, Wei-Jia
    Luo, San-Ding
    [J]. 2002, Chinese Academy of Sciences (13):
  • [10] K-means algorithm based on particle swarm optimization for web document clustering
    Xiao, L. Z.
    Shao, Z. Q.
    Gu, X. M.
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 980 - 984