Document clustering with hierarchical algorithm

被引:0
|
作者
Wang, Y [1 ]
Hodges, J [1 ]
机构
[1] Mississippi State Univ, Dept Comp Sci & Engn, Mississippi State, MS 39762 USA
关键词
document clustering; information retrieval;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document clustering is a widely used strategy for information retrieval and text data mining. Partitioning and hierarchical clustering methods are most widely used algorithms. Other investigators proposed to use bisecting K-means method for document clustering and their experimental results have indicated that the bisecting K-means method is the preferred method for document clustering [16]. However, in our research we have found that, whereas the bisecting K-means method has advantages when working with large datasets, a traditional hierarchical clustering algorithm still achieves the best performance for small datasets.
引用
收藏
页码:1614 / 1617
页数:4
相关论文
共 50 条
  • [31] An extended chameleon algorithm for document clustering
    AmritaVishwaVidyapeetham, Dept. of Computer Science and Application, India
    [J]. Adv. Intell. Sys. Comput., (335-348):
  • [32] A Robust Algorithm for Fuzzy Document Clustering
    Chen, Lifei
    Wang, Shengrui
    Jiang, Qingshan
    [J]. 2009 INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS: WAINA, VOLS 1 AND 2, 2009, : 679 - +
  • [33] Frequent Document Mining Algorithm with Clustering
    Soni, Rakesh Kumar
    Gupta, Neetesh
    Sinhal, Amit
    Sahu, Shiv K.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2015, 15 (09): : 38 - 43
  • [34] A Novel Algorithm for Automatic Document Clustering
    Agrawal, Ranjana
    Phatak, Madhura
    [J]. PROCEEDINGS OF THE 2013 3RD IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2013, : 877 - 882
  • [35] Application of Genetic Algorithm in Document Clustering
    Wei Jian-Xiang
    Liu Huai
    Sun Yue-hong
    Su Xin-Ning
    [J]. 2009 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE, VOL 1, PROCEEDINGS, 2009, : 145 - +
  • [36] An Improved AntTree Algorithm for Document Clustering
    Perez-Delgado, M. L.
    Escuadra, J.
    Anton, N.
    [J]. DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2010, 79 : 481 - 488
  • [37] Basic Firefly Algorithm for Document Clustering
    Mohammed, Athraa Jasim
    Yusof, Yuhanis
    Husni, Husniza
    [J]. INNOVATION AND ANALYTICS CONFERENCE AND EXHIBITION (IACE 2015), 2015, 1691
  • [38] Topic-Constrained Hierarchical Clustering for Document Datasets
    Zhao, Ying
    [J]. ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I, 2010, 6440 : 181 - 192
  • [39] Hierarchical Document Clustering based on Cosine Similarity measure
    Popat, Shraddha K.
    Deshmukh, Pramod B.
    Metre, Vishakha A.
    [J]. 2017 1ST INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND INFORMATION MANAGEMENT (ICISIM), 2017, : 153 - 159
  • [40] Hierarchical document clustering using frequent closed sets
    Kryszkiewicz, Marzena
    Skonieczny, Lukasz
    [J]. INTELLIGENT INFORMATION PROCESSING AND WEB MINING, PROCEEDINGS, 2006, : 489 - +