Solving Document Clustering Problem through Meta Heuristic Algorithm- Black Hole

被引:0
|
作者
Rafi, Muhammad [1 ]
Aamer, Bilal [1 ]
Naseem, Mubashir [1 ]
Osama, Muhammad [1 ]
机构
[1] Natl Univ Comp & Emerging Sci, Karachi Campus, Peshawar, Pakistan
关键词
D O I
10.1145/3184066.3184085
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper proposed a soft computing approach to solve document clustering problem. Document clustering is a specialized clustering problem in which textual documents autonomously segregated to a number of identifiable, subject homogenous and smaller sub-collections (also called clusters). Identifying implicit textual patterns within the documents is a challenging aspect as there can be thousands of such textual features. Partition clustering algorithm like k-means is mainly used for this problem. There are several drawbacks in k-means algorithm such as (i) initial seeds dependency, and (ii) it traps into local optimal solution. Although every k-means solution may contain some good partial arrangements for clustering. Meta-heuristic algorithm like Black Hole (BH) uses certain trade-off of randomization and local search for finding the optimal and near optimal solution. Our motivation comes from the fact that meta-heuristic optimization can quickly produce a global optimal solution using random k-means initial solution. The contributions from this research are (i) an implementation of black hole algorithm using k-mean as embedding (ii) The phenomena of global search and local search optimization are used as parameters adjustments. A series of experiments are performed with our proposed method on standard text mining datasetslike: (i) NEWS20, (ii) Reuters and (iii) WebKB and results are evaluated on Purity and Silhouette Index. In comparison the proposed method outperforms the basic k-means, GA with k-means embedding and quickly converges to global or near global optimal solution.
引用
收藏
页码:77 / 81
页数:5
相关论文
共 50 条
  • [1] A Novel Hybrid Clustering Approach Based on Black Hole Algorithm for Document Clustering
    Malik, Fazila
    Khan, Salabat
    Rizwan, Atif
    Atteia, Ghada
    Samee, Nagwan Abdel
    IEEE ACCESS, 2022, 10 : 97310 - 97326
  • [2] A Novel Hybrid Clustering Approach Based on Black Hole Algorithm for Document Clustering
    Malik, Fazila
    Khan, Salabat
    Rizwan, Atif
    Atteia, Ghada
    Samee, Nagwan Abdel
    IEEE Access, 2022, 10 : 97310 - 97326
  • [3] A Black Hole Algorithm for Solving the Set Covering Problem
    Soto, Ricardo
    Crawford, Broderick
    Figueroa, Ignacio
    Niklander, Stefanie
    Olguin, Eduardo
    TRENDS IN APPLIED KNOWLEDGE-BASED SYSTEMS AND DATA SCIENCE, 2016, 9799 : 855 - 861
  • [4] A black hole algorithm for solving the set covering problem
    Soto, Ricardo (ricardo.soto@ucv.cl), 1600, Springer Verlag (9799):
  • [5] Adaptive Black Hole Algorithm for Solving the Set Covering Problem
    Soto, Ricardo
    Crawford, Broderick
    Olivares, Rodrigo
    Taramasco, Carla
    Figueroa, Ignacio
    Gomez, Alvaro
    Castro, Carlos
    Paredes, Fernando
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2018, 2018
  • [6] Solving travelling salesman problem using black hole algorithm
    Hatamlou, Abdolreza
    SOFT COMPUTING, 2018, 22 (24) : 8167 - 8175
  • [7] Solving travelling salesman problem using black hole algorithm
    Abdolreza Hatamlou
    Soft Computing, 2018, 22 : 8167 - 8175
  • [8] Black Hole Mechanics Optimization: a novel meta-heuristic algorithm
    Kaveh A.
    Seddighian M.R.
    Ghanadpour E.
    Asian Journal of Civil Engineering, 2020, 21 (7) : 1129 - 1149
  • [9] Black hole algorithm as a heuristic approach for rare event classification problem
    Yildirim, Elif
    PAKISTAN JOURNAL OF STATISTICS AND OPERATION RESEARCH, 2023, 19 (04) : 623 - 635
  • [10] A heuristic approach to solving the software clustering problem
    Mitchell, BS
    INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE, PROCEEDINGS, 2003, : 285 - 288