Incremental fuzzy clustering for document categorization

被引:0
|
作者
Mei, Jian-Ping [1 ]
Wang, Yangtao [2 ]
Chen, Lihui [2 ]
Miao, Chunyan [3 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Zhejiang, Peoples R China
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[3] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Incremental clustering has been proposed to handle large datasets which can not fit into memory entirely. Single pass fuzzy c-means (SpFCM) and Online fuzzy c-means (OFCM) are two representative incremental fuzzy clustering methods. Both of them extend the scalability of fuzzy c-means (FCM) by processing the dataset chunk by chunk. However, due to the data sparsity and high-dimensionality, SpFCM and OFCM fail to produce reasonable results for document data. In this study, we work on clustering approaches that take care of both the large-scale and high-dimensionality issues. Specifically, we propose two methods for incrementally clustering of document data. The first method is a modification of the existing FCM-based incremental clustering with a step to normalize the centroids in each iteration, while the other method is incremental clustering, i.e., Single-Pass or Online, with weighted fuzzy co-clustering. We use several benchmark document datasets for experimental study. The experimental results show that the proposed approaches achieved significant improvements over existing SpFCM and OFCM in document clustering.
引用
收藏
页码:1518 / 1525
页数:8
相关论文
共 50 条
  • [1] Hyperspherical fuzzy clustering for online document categorization
    Mei, Jian-Ping
    Wang, Yangtao
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2016, : 1487 - 1493
  • [2] Large Scale Document Categorization With Fuzzy Clustering
    Mei, Jian-Ping
    Wang, Yangtao
    Chen, Lihui
    Miao, Chunyan
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2017, 25 (05) : 1239 - 1251
  • [3] A modified fuzzy clustering for documents retrieval: application to document categorization
    Nefti, S.
    Oussalah, M.
    Rezgui, Y.
    [J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2009, 60 (03) : 384 - 394
  • [4] Incremental document clustering using fuzzy-based optimization strategy
    Madhulika Yarlagadda
    Gangadhara Rao Kancherla
    Srikrishna Atluri
    [J]. Evolutionary Intelligence, 2020, 13 : 497 - 510
  • [5] Incremental document clustering using fuzzy-based optimization strategy
    Yarlagadda, Madhulika
    Kancherla, Gangadhara Rao
    Atluri, Srikrishna
    [J]. EVOLUTIONARY INTELLIGENCE, 2020, 13 (03) : 497 - 510
  • [6] Semi-supervised fuzzy co-clustering algorithm for document categorization
    Yan, Yang
    Chen, Lihui
    Tjhi, William-Chandra
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 34 (01) : 55 - 74
  • [7] Semi-supervised fuzzy co-clustering algorithm for document categorization
    Yang Yan
    Lihui Chen
    William-Chandra Tjhi
    [J]. Knowledge and Information Systems, 2013, 34 : 55 - 74
  • [8] An incremental document clustering for the large document database
    Joo, KH
    Lee, WS
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 374 - 387
  • [9] Web Document Categorization by Support Vector Clustering
    Shi, Daming
    Tsui, Ming Hei
    Liu, Jigang
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 1482 - 1487
  • [10] Fuzzy clustering and categorization of text documents
    Ayeldeen, Heba
    Rassanien, Aboul Ella
    Fahmy, Aly Aly
    [J]. 2013 13TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS), 2013, : 262 - 266