Improved Cuckoo Search Algorithm for Document Clustering

被引:7
|
作者
Boushaki, Saida Ishak [1 ,2 ]
Kamel, Nadjet [3 ,4 ]
Bendjeghaba, Omar [2 ,5 ]
机构
[1] USTHB, LRIA, Boumerdes, Algeria
[2] Univ Boumerdes, Boumerdes, Algeria
[3] USTHB, LRIA, Setif, Algeria
[4] Univ Ferhat Abas Setif, Setif, Algeria
[5] UMBB, LREEI, Boumerdes, Algeria
关键词
Document clustering; Vector space model; Cuckoo search; Cosine similarity; F-measure; Purity; Metaheuristic; Optimization;
D O I
10.1007/978-3-319-19578-0_18
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient document clustering plays an important role in organizing and browsing the information in the World Wide Web. K-means is the most popular clustering algorithms, due to its simplicity and efficiency. However, it may be trapped in local minimum which leads to poor results. Recently, cuckoo search based clustering has proved to reach interesting results. By against, the number of iterations can increase dramatically due to its slowness convergence. In this paper, we propose an improved cuckoo search clustering algorithm in order to overcome the weakness of the conventional cuckoo search clustering. In this algorithm, the global search procedure is enhanced by a local search method. The experiments tests on four text document datasets and one standard dataset extracted from well known collections show the effectiveness and the robustness of the proposed algorithm to improve significantly the clustering quality in term of fitness function, f-measure and purity.
引用
收藏
页码:217 / 228
页数:12
相关论文
共 50 条
  • [21] Simple and Efficient Clustering Approach Based on Cuckoo Search Algorithm
    Khrissi, Lahbib
    El Akkad, Nabil
    Satori, Hassan
    Satori, Khalid
    2020 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS), 2020,
  • [22] A new quantum chaotic cuckoo search algorithm for data clustering
    Boushaki, Saida Ishak
    Kamel, Nadjet
    Bendjeghaba, Omar
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 96 : 358 - 372
  • [23] A novel squirrel search clustering algorithm for text document clustering
    Chaudhary M.
    Pruthi J.
    Jain V.K.
    Suryakant
    International Journal of Information Technology, 2022, 14 (6) : 3277 - 3286
  • [24] Single document extractive text summarization using cuckoo search algorithm
    Pati, Siba Prasad
    Rautray, Rasmita
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2022, 43 (05): : 1089 - 1097
  • [25] Improved sequential IB algorithm for document clustering
    Ye, Yang-Dong
    Zhang, Jie
    Liu, Dong
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2008, 21 (03): : 417 - 423
  • [26] An Improved Genetic Algorithm for Document Clustering on the Cloud
    Akter, Ruksana
    Chung, Yoojin
    INTERNATIONAL JOURNAL OF CLOUD APPLICATIONS AND COMPUTING, 2018, 8 (04) : 20 - 28
  • [27] Improved Fuzzy Clustering Algorithm and Its Application in Document Clustering
    Liu Yiming
    Yao Min
    Zheng Xiaoliang
    PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, VOLS A-C, 2008, : 2366 - 2370
  • [28] An improved cuckoo search algorithm for power economic load dispatch
    Afzalan, Ehsan
    Joorabian, Mahmood
    INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2015, 25 (06): : 958 - 975
  • [29] An improved Cuckoo search localization algorithm for UWB sensor networks
    Xiaofeng Qin
    Bin Xia
    Tian Ding
    Lei Zhao
    Wireless Networks, 2021, 27 : 527 - 535
  • [30] Near-neighbor Propagation Clustering Algorithm Based on Cuckoo Search
    Wang, Yao
    Liu, Fuguo
    Li, Guodong
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (02) : 1933 - 1940