A Robust Algorithm for Fuzzy Document Clustering

被引:0
|
作者
Chen, Lifei [1 ]
Wang, Shengrui [2 ]
Jiang, Qingshan [3 ]
机构
[1] Fujian Normal Univ, Sch Math & Comp Sci, Fuzhou 360108, Peoples R China
[2] Univ Sherbrooke, Dept Comp Sci, Sherbrooke, PQ J1K 2R1, Canada
[3] Xiamen Univ, Software Sch, Xiamen 361005, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In many applications of document clustering, a document may include multiple topics and thus may relate to multiple categories at the same time. Most of the existing subspace clustering algorithms can only perform hard clustering on document collections. In this paper, a fuzzy algorithm named R-FPC is introduced for document clustering. The algorithm discovers soft partitions of a data set in the soft subspaces of the data space. Using the proposed R-Greedy initialization method, R-FPC can always generate stable clustering results with competitive accuracy. The experiments are conducted on some widely used corpuses and the results have shown effectiveness and robustness of the proposed methods.
引用
收藏
页码:679 / +
页数:2
相关论文
共 50 条
  • [1] Application of fuzzy clustering algorithm in Chinese document clustering
    Li, Jiafu
    Zhang, Yafei
    Lu, Jianjiang
    [J]. Jisuanji Gongcheng/Computer Engineering, 2002, 28 (04):
  • [2] A Robust Fuzzy Kernel Clustering Algorithm
    Zhang Chen
    Xia Shixiong
    Liu Bing
    [J]. APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (03): : 1005 - 1012
  • [3] Sentence Clustering in Text Document Using Fuzzy Clustering Algorithm
    Sruthi, S.
    Shalini, L.
    [J]. 2014 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2014, : 1473 - 1476
  • [4] Improved Fuzzy Clustering Algorithm and Its Application in Document Clustering
    Liu Yiming
    Yao Min
    Zheng Xiaoliang
    [J]. PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, VOLS A-C, 2008, : 2366 - 2370
  • [5] A fuzzy-based algorithm for Web document clustering
    Friedman, M
    Kandel, A
    Schneider, M
    Last, M
    Shapira, B
    Elovici, Y
    Zaafrany, O
    [J]. NAFIPS 2004: ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY, VOLS 1AND 2: FUZZY SETS IN THE HEART OF THE CANADIAN ROCKIES, 2004, : 524 - 527
  • [6] Fuzzy Document Clustering Based on Ant Colony Algorithm
    Wang, Fei
    Zhang, Dexian
    Bao, Na
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 2, PROCEEDINGS, 2009, 5552 : 709 - 716
  • [7] Robust fuzzy co-clustering algorithm
    Tjhi, William-Chandra
    Chen, Lihui
    [J]. 2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1591 - 1595
  • [8] Document Clustering by Fuzzy C-Mean Algorithm
    Win, Thaung Thaung
    Mon, Lin
    [J]. 2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 1, 2010, : 239 - 242
  • [9] Fuzzy clusterers combination by positional voting for robust document clustering
    Sevillano, Xavier
    Claudi Socoro, Joan
    Alias, Francesc
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2009, (43): : 245 - 253
  • [10] Fuzzy Ontology for Distributed Document Clustering based on Genetic Algorithm
    Thangamani, M.
    Thangaraj, P.
    [J]. APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (04): : 1563 - 1574