Hierarchical Star Clustering Algorithm for Dynamic Document Collections

被引:0
|
作者
Gil-Garcia, Reynaldo [1 ]
Pons-Porrata, Aurora [1 ]
机构
[1] Univ Oriente, Ctr Pattern Recognit & Data Min, Santiago De Cuba, Cuba
来源
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS | 2008年 / 5197卷
关键词
hierarchial clustering; dynamic clustering; overlapped clusters;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a new clustering algorithm called Dynamic Hierarchical Star is introduced. Our approach aims to construct a hierarchy of overlapped clusters, dealing with dynamic data sets. The experimental results on several benchmark text collections show that this method obtains smaller hierarchies than traditional algorithms while achieving a similar clustering quality. Therefore, we advocate its use for tasks that require dynamic overlapped clustering, such as information organization, creation of document taxonomies and hierarchical topic detection.
引用
收藏
页码:187 / 194
页数:8
相关论文
共 50 条
  • [21] OHDOCLUS - Online and Hierarchical Document Clustering
    Encarnacao, Rui
    Oliveira, Hugo Goncalo
    PROCEEDINGS OF THE EIGHTH EUROPEAN STARTING AI RESEARCHER SYMPOSIUM (STAIRS 2016), 2016, 284 : 51 - 62
  • [22] Hierarchical clustering algorithms for document datasets
    Zhao, Y
    Karypis, G
    DATA MINING AND KNOWLEDGE DISCOVERY, 2005, 10 (02) : 141 - 168
  • [23] A dynamic SOM algorithm for clustering large-scale document collection
    Luo, Kegang
    Liu, Yuanchao
    Wang, Xiaolong
    ALPIT 2007: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, 2007, : 15 - +
  • [24] Fuzzy clustering for topic analysis and summarization of document collections
    Witte, Rene
    Bergler, Sabine
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2007, 4509 : 476 - +
  • [25] A NEURAL ALGORITHM FOR DOCUMENT CLUSTERING
    MACLEOD, KJ
    ROBERTSON, W
    INFORMATION PROCESSING & MANAGEMENT, 1991, 27 (04) : 337 - 346
  • [26] Clustering of document collections to support interactive text exploration
    Nürnberger, A
    Klose, A
    Kruse, R
    Hartmann, G
    Richards, M
    EXPLORATORY DATA ANALYSIS IN EMPIRICAL RESEARCH, PROCEEDINGS, 2003, : 257 - 265
  • [27] Design and evaluation of a parallel document clustering algorithm based on hierarchical latent semantic analysis
    Seshadri, Karthick
    Iyer, K. Viswanathan
    Shalinie, Mercy S.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (13):
  • [28] A Document Clustering Algorithm Based on Semi-constrained Hierarchical Latent Dirichlet Allocation
    Xu, Jungang
    Zhou, Shilong
    Qiu, Lin
    Liu, Shengyuan
    Li, Pengfei
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2014, 2014, 8793 : 49 - 60
  • [29] Exploration of textual document archives using a fuzzy hierarchical clustering algorithm in the GAMBAL system
    Torra, V
    Miyamoto, S
    Lanau, S
    INFORMATION PROCESSING & MANAGEMENT, 2005, 41 (03) : 587 - 598
  • [30] Generating hierarchical document indices from common denominators in large document collections
    OKane, KC
    INFORMATION PROCESSING & MANAGEMENT, 1996, 32 (01) : 105 - 115