Basic Firefly Algorithm for Document Clustering

被引:0
|
作者
Mohammed, Athraa Jasim [1 ,2 ]
Yusof, Yuhanis [1 ]
Husni, Husniza [1 ]
机构
[1] Univ Utara Malaysia, Sch Comp, Sintok 06010, Kedah Darul Ama, Malaysia
[2] Univ Technol Baghdad, Informat & Commun Technol Ctr, Baghdad, Iraq
关键词
D O I
10.1063/1.4937068
中图分类号
O59 [应用物理学];
学科分类号
摘要
The Document clustering plays significant role in Information Retrieval (IR) where it organizes documents prior to the retrieval process. To date, various clustering algorithms have been proposed and this includes the K-means and Particle Swarm Optimization. Even though these algorithms have been widely applied in many disciplines due to its simplicity, such an approach tends to be trapped in a local minimum during its search for an optimal solution. To address the shortcoming, this paper proposes a Basic Firefly (Basic FA) algorithm to cluster text documents. The algorithm employs the Average Distance to Document Centroid (ADDC) as the objective function of the search. Experiments utilizing the proposed algorithm were conducted on the 20Newsgroups benchmark dataset. Results demonstrate that the Basic FA generates a more robust and compact clusters than the ones produced by K-means and Particle Swarm Optimization (PSO).
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Clustering using firefly algorithm: Performance study
    Senthilnath, J.
    Omkar, S. N.
    Mani, V.
    [J]. SWARM AND EVOLUTIONARY COMPUTATION, 2011, 1 (03) : 164 - 171
  • [2] Document clustering with hierarchical algorithm
    Wang, Y
    Hodges, J
    [J]. Proceedings of the 8th Joint Conference on Information Sciences, Vols 1-3, 2005, : 1614 - 1617
  • [3] A NEURAL ALGORITHM FOR DOCUMENT CLUSTERING
    MACLEOD, KJ
    ROBERTSON, W
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1991, 27 (04) : 337 - 346
  • [4] Improved density peaks clustering based on firefly algorithm
    Zhao J.
    Tang J.
    Shi A.
    Fan T.
    Xu L.
    [J]. Xu, Lizhong (lxu0530@126.com), 1600, Inderscience Enterprises Ltd. (15): : 24 - 42
  • [5] Improved density peaks clustering based on firefly algorithm
    Zhao, Jia
    Tang, Jingjing
    Shi, Aiye
    Fan, Tanghuai
    Xu, Lizhong
    [J]. INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2020, 15 (01) : 24 - 42
  • [6] Application of fuzzy clustering algorithm in Chinese document clustering
    Li, Jiafu
    Zhang, Yafei
    Lu, Jianjiang
    [J]. Jisuanji Gongcheng/Computer Engineering, 2002, 28 (04):
  • [7] Application of Algorithm CARDBK in Document Clustering
    ZHU Yehang
    ZHANG Mingjie
    SHI Feng
    [J]. Wuhan University Journal of Natural Sciences, 2018, 23 (06) : 514 - 524
  • [8] An improved clustering algorithm for web document
    Wang, Jing
    Liu, Zhijing
    [J]. Journal of Information and Computational Science, 2009, 6 (02): : 959 - 966
  • [9] An extended chameleon algorithm for document clustering
    AmritaVishwaVidyapeetham, Dept. of Computer Science and Application, India
    [J]. Adv. Intell. Sys. Comput., (335-348):
  • [10] A Robust Algorithm for Fuzzy Document Clustering
    Chen, Lifei
    Wang, Shengrui
    Jiang, Qingshan
    [J]. 2009 INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS: WAINA, VOLS 1 AND 2, 2009, : 679 - +