Solving Document Clustering Problem through Meta Heuristic Algorithm- Black Hole

被引:0
|
作者
Rafi, Muhammad [1 ]
Aamer, Bilal [1 ]
Naseem, Mubashir [1 ]
Osama, Muhammad [1 ]
机构
[1] Natl Univ Comp & Emerging Sci, Karachi Campus, Peshawar, Pakistan
关键词
D O I
10.1145/3184066.3184085
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper proposed a soft computing approach to solve document clustering problem. Document clustering is a specialized clustering problem in which textual documents autonomously segregated to a number of identifiable, subject homogenous and smaller sub-collections (also called clusters). Identifying implicit textual patterns within the documents is a challenging aspect as there can be thousands of such textual features. Partition clustering algorithm like k-means is mainly used for this problem. There are several drawbacks in k-means algorithm such as (i) initial seeds dependency, and (ii) it traps into local optimal solution. Although every k-means solution may contain some good partial arrangements for clustering. Meta-heuristic algorithm like Black Hole (BH) uses certain trade-off of randomization and local search for finding the optimal and near optimal solution. Our motivation comes from the fact that meta-heuristic optimization can quickly produce a global optimal solution using random k-means initial solution. The contributions from this research are (i) an implementation of black hole algorithm using k-mean as embedding (ii) The phenomena of global search and local search optimization are used as parameters adjustments. A series of experiments are performed with our proposed method on standard text mining datasetslike: (i) NEWS20, (ii) Reuters and (iii) WebKB and results are evaluated on Purity and Silhouette Index. In comparison the proposed method outperforms the basic k-means, GA with k-means embedding and quickly converges to global or near global optimal solution.
引用
收藏
页码:77 / 81
页数:5
相关论文
共 50 条
  • [41] A heuristic algorithm solving the mutual-exclusivity-sorting problem
    Vinceti, Alessandro
    Trastulla, Lucia
    Perron, Umberto
    Raiconi, Andrea
    Iorio, Francesco
    BIOINFORMATICS, 2023, 39 (01)
  • [42] A multiple heuristic search algorithm for solving traveling salesman problem
    Gang, P
    Iimura, I
    Nakayama, S
    PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PDCAT'2003, PROCEEDINGS, 2003, : 779 - 783
  • [43] A heuristic algorithm for solving flexible job shop scheduling problem
    Mohsen Ziaee
    The International Journal of Advanced Manufacturing Technology, 2014, 71 : 519 - 528
  • [44] A heuristic algorithm for solving flexible job shop scheduling problem
    Ziaee, Mohsen
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2014, 71 (1-4): : 519 - 528
  • [45] An Efficient Heuristic Algorithm for Solving Connected Vertex Cover Problem
    Zhang, Yongfei
    Wu, Jun
    Zhang, Liming
    Zhao, Peng
    Zhou, Junping
    Yin, Minghao
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2018, 2018
  • [46] A new heuristic algorithm for solving the job shop scheduling problem
    Yin, Aihua
    Zhang, Shousheng
    COMPUTATION IN MODERN SCIENCE AND ENGINEERING VOL 2, PTS A AND B, 2007, 2 : 1412 - 1416
  • [47] An Efficient Robust Hyper-Heuristic Algorithm to Clustering Problem
    Bonab, Mohammad Babrdel
    Tay, Yong Haur
    Hashim, Siti Zaiton Mohd
    Soon, Khoo Thau
    COMPUTATIONAL INTELLIGENCE IN INFORMATION SYSTEMS (CIIS 2018), 2019, 888 : 48 - 60
  • [48] Solving clustering problem by hybrid ant colony algorithm
    Gao, Shang
    Zhang, Zaiyue
    Zhang, Xiaoru
    Cao, Cungen
    Journal of Information and Computational Science, 2009, 6 (04): : 1839 - 1846
  • [49] A New Meta-Heuristic Algorithm for Solving the Flexible Dynamic Job-Shop Problem with Parallel Machines
    Sangaiah, Arun Kumar
    Suraki, Mohsen Yaghoubi
    Sadeghilalimi, Mehdi
    Bozorgi, Seyed Mostafa
    Hosseinabadi, Ali Asghar Rahmani
    Wang, Jin
    SYMMETRY-BASEL, 2019, 11 (02):
  • [50] A modified nature inspired meta-heuristic whale optimization algorithm for solving 0–1 knapsack problem
    Mohamed Abdel-Basset
    Doaa El-Shahat
    Arun Kumar Sangaiah
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 495 - 514