Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering

被引:1
|
作者
Laith Mohammad Abualigah
Ahamad Tajudin Khader
机构
[1] Universiti Sains Malaysia (USM),School of Computer Sciences
来源
关键词
Unsupervised text feature selection; Particle swarm optimization; Genetic operators; K-mean text clustering; Hybridization;
D O I
暂无
中图分类号
学科分类号
摘要
The text clustering technique is an appropriate method used to partition a huge amount of text documents into groups. The documents size affects the text clustering by decreasing its performance. Subsequently, text documents contain sparse and uninformative features, which reduce the performance of the underlying text clustering algorithm and increase the computational time. Feature selection is a fundamental unsupervised learning technique used to select a new subset of informative text features to improve the performance of the text clustering and reduce the computational time. This paper proposes a hybrid of particle swarm optimization algorithm with genetic operators for the feature selection problem. The k-means clustering is used to evaluate the effectiveness of the obtained features subsets. The experiments were conducted using eight common text datasets with variant characteristics. The results show that the proposed algorithm hybrid algorithm (H-FSPSOTC) improved the performance of the clustering algorithm by generating a new subset of more informative features. The proposed algorithm is compared with the other comparative algorithms published in the literature. Finally, the feature selection technique encourages the clustering algorithm to obtain accurate clusters.
引用
收藏
页码:4773 / 4795
页数:22
相关论文
共 50 条
  • [1] Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering
    Abualigah, Laith Mohammad
    Khader, Ahamad Tajudin
    [J]. JOURNAL OF SUPERCOMPUTING, 2017, 73 (11): : 4773 - 4795
  • [2] Unsupervised Feature Selection Technique Based on Genetic Algorithm for Improving the Text Clustering
    Abualigah, Laith Mohammad
    Khader, Ahamad Tajudin
    Al-Betar, Mohammed Azmi
    [J]. 2016 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSIT), 2016,
  • [3] Hybrid particle swarm optimization algorithm for text feature selection problems
    Mourad Nachaoui
    Issam Lakouam
    Imad Hafidi
    [J]. Neural Computing and Applications, 2024, 36 : 7471 - 7489
  • [4] Hybrid particle swarm optimization algorithm for text feature selection problems
    Nachaoui, Mourad
    Lakouam, Issam
    Hafidi, Imad
    [J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (13): : 7471 - 7489
  • [5] Unsupervised Feature Selection Technique Based on Harmony Search Algorithm for Improving the Text Clustering
    Abualigah, Laith Mohammad
    Khader, Ahamad Tajudin
    Al-Betar, Mohammed Azmi
    [J]. 2016 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSIT), 2016,
  • [6] Hybrid Grasshopper and Chameleon Swarm Optimization Algorithm for Text Feature Selection with Density Peaks Clustering
    Purushothaman, R.
    Selvakumar, S.
    Rajagopalan, S. P.
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2022, 21 (03)
  • [7] Improved particle swarm optimization algorithm and its application in text feature selection
    Lu, Yonghe
    Liang, Minghui
    Ye, Zeyuan
    Cao, Lichao
    [J]. APPLIED SOFT COMPUTING, 2015, 35 : 629 - 636
  • [8] A new unsupervised feature selection method for text clustering based on genetic algorithms
    Pirooz Shamsinejadbabki
    Mohammad Saraee
    [J]. Journal of Intelligent Information Systems, 2012, 38 : 669 - 684
  • [9] A new unsupervised feature selection method for text clustering based on genetic algorithms
    Shamsinejadbabki, Pirooz
    Saraee, Mohammad
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2012, 38 (03) : 669 - 684
  • [10] FEATURE SELECTION USING PARTICLE SWARM OPTIMIZATION IN TEXT CATEGORIZATION
    Aghdam, Mehdi Hosseinzadeh
    Heidari, Setareh
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2015, 5 (04) : 231 - 238