Hierarchical Star Clustering Algorithm for Dynamic Document Collections

被引:0
|
作者
Gil-Garcia, Reynaldo [1 ]
Pons-Porrata, Aurora [1 ]
机构
[1] Univ Oriente, Ctr Pattern Recognit & Data Min, Santiago De Cuba, Cuba
关键词
hierarchial clustering; dynamic clustering; overlapped clusters;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a new clustering algorithm called Dynamic Hierarchical Star is introduced. Our approach aims to construct a hierarchy of overlapped clusters, dealing with dynamic data sets. The experimental results on several benchmark text collections show that this method obtains smaller hierarchies than traditional algorithms while achieving a similar clustering quality. Therefore, we advocate its use for tasks that require dynamic overlapped clustering, such as information organization, creation of document taxonomies and hierarchical topic detection.
引用
收藏
页码:187 / 194
页数:8
相关论文
共 50 条
  • [1] A Speed-Up Hierarchical Compact Clustering Algorithm for Dynamic Document Collections
    Gil-Garcia, Reynaldo
    Pons-Porrata, Aurora
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, PROCEEDINGS, 2009, 5856 : 379 - 386
  • [2] Document clustering with hierarchical algorithm
    Wang, Y
    Hodges, J
    Proceedings of the 8th Joint Conference on Information Sciences, Vols 1-3, 2005, : 1614 - 1617
  • [3] Dynamic hierarchical algorithms for document clustering
    Gil-Garcia, Reynaldo
    Pons-Porrata, Aurora
    PATTERN RECOGNITION LETTERS, 2010, 31 (06) : 469 - 477
  • [4] A Document Clustering Method based on Hierarchical Algorithm with Model Clustering
    Sun, Haojun
    Liu, Zhihui
    Kong, Lingjun
    2008 22ND INTERNATIONAL WORKSHOPS ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOLS 1-3, 2008, : 1229 - +
  • [5] Hierarchical clustering in medical document collections: The BIC-means method
    Hourdakis, Nikos
    Argyriou, Michalis
    Petrakis, Euripides G. M
    Milios, Evangelos E.
    Journal of Digital Information Management, 2010, 8 (02): : 71 - 77
  • [6] Dynamic hierarchical compact clustering algorithm
    Gil-García, R
    Badía-Contelles, JM
    Pons-Porrata, A
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2005, 3773 : 302 - 310
  • [7] A single-link method algorithm for clustering large document collections
    Kishida, K
    LIBRARY AND INFORMATION SCIENCE, 2002, (47): : 27 - 38
  • [8] Clustering Dynamic Textures with the Hierarchical EM Algorithm
    Chan, Antoni B.
    Coviello, Emanuele
    Lanckriet, Gert. R. G.
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2022 - 2029
  • [9] An incremental document clustering algorithm based on a hierarchical agglomerative approach
    Joo, KH
    Lee, SJ
    DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY, PROCEEDINGS, 2005, 3816 : 321 - 332
  • [10] Measurement of clustering effectiveness for document collections
    Yuan, Meng
    Zobel, Justin
    Lin, Pauline
    INFORMATION RETRIEVAL JOURNAL, 2022, 25 (03): : 239 - 268