Web Document Clustering Research Based on Granular Computing

被引:2
|
作者
Zheng Shangzhi [1 ]
Zhao Xiaolong [1 ]
Zhang Buqun [1 ]
Bu Hualong [1 ]
机构
[1] Chaohu Univ, Dept Comp Sci & Technol, Chaohu, Peoples R China
关键词
Granularcomputing; Clustering; Association rules; Web documents;
D O I
10.1109/ISECS.2009.16
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, a method of web document clustering based on granular computing (WDCGrc) is presented. The method computes the weight value of the words in documents by adopting the TF-IDF principle. Meanwhile, combinative ways defining documents threshold and average weight value are adopted to reduce dimensions and extract the keywords in each document. The paper establishes the transformation between the keywords in documents and the binary granules, and adopts the algorithm of association rules based on granular computing to obtain frequent itemsets between documents. Bring in the set theory thought, numbers of the same word between documents as the document similarity and the clustering result is obtained. The experiment shows that the method is practical and feasible, with good quality of clustering.
引用
收藏
页码:446 / 450
页数:5
相关论文
共 50 条
  • [1] Method of clustering web pages based on granular computing
    Hu, Jun
    Guan, Chun
    Liu, Bocheng
    [J]. Hu, J., 2013, Asian Network for Scientific Information (13) : 2107 - 2110
  • [2] Research of Text Clustering based on Fuzzy Granular Computing
    Zhang Xia
    Yin Yixin
    Xu Mingzhu
    Zhao Hailong
    [J]. 2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 2, 2009, : 288 - +
  • [3] Research of hierarchical clustering based on dynamic granular computing
    Li, Xue-yong
    Sun, Jia-Xia
    Gao, Guo-Hong
    Fu, Jun-Hui
    [J]. Journal of Computers, 2011, 6 (12) : 2526 - 2533
  • [4] Clustering research using dynamic modeling based on granular computing
    Liu, Q
    Jin, WB
    Wu, SY
    Zhou, YH
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2005, : 539 - 543
  • [5] Granular Computing based Comparison of Agglomerative Clustering
    Tsumoto, Shusaku
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5553 - 5560
  • [6] Text Clustering Based on Granular Computing and Wikipedia
    Jing, Liping
    Yu, Jian
    [J]. ROUGH SETS AND KNOWLEDGE TECHNOLOGY, 2011, 6954 : 679 - 688
  • [7] Web Structure Model Based on Granular Computing
    Jun, Hu
    Qiang, Wu
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING ( GRC 2009), 2009, : 245 - +
  • [8] A novel clustering ensemble model based on granular computing
    Xu, Li
    Ding, Shifei
    [J]. APPLIED INTELLIGENCE, 2021, 51 (08) : 5474 - 5488
  • [9] A novel clustering ensemble model based on granular computing
    Li Xu
    Shifei Ding
    [J]. Applied Intelligence, 2021, 51 : 5474 - 5488
  • [10] Efficient phrase-based document indexing for web document clustering
    Hammouda, KM
    Kamel, MS
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (10) : 1279 - 1296