An Evolutionary Algorithm for Feature Selective Double Clustering of Text Documents

被引:0
|
作者
Nourashrafeddin, S. N. [1 ]
Milios, Evangelos [1 ]
Arnold, Dirk V. [1 ]
机构
[1] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 4R2, Canada
关键词
Genetic algorithm; co-clustering; multiobjective optimization; text clustering; INFORMATION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We propose FSDC, an evolutionary algorithm for Feature Selective Double Clustering of text documents. We first cluster the terms existing in the document corpus. The term clusters are then fed into multiobjective genetic algorithms to prune non- informative terms and form sets of keyterms representing topics. Based on the topic keyterms found, representative documents for each topic are extracted. These documents are then used as seeds to cluster all documents in the dataset. FSDC is compared to some well- known co- clusterers on real text datasets. The experimental results show that our algorithm can outperform the competitors.
引用
下载
收藏
页码:446 / 453
页数:8
相关论文
共 50 条
  • [1] Subspace clustering of text documents with feature weighting K-means algorithm
    Jing, LP
    Ng, MK
    Xu, J
    Huang, JZ
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 802 - 812
  • [2] Evolutionary Feature Selection for Text Documents using the SVM
    Morariu, Daniel I.
    Vintan, Lucian N.
    Tresp, Volker
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 15, 2006, 15 : 215 - +
  • [3] A Krill Herd Algorithm For Efficient Text Documents Clustering
    Abualigah, Laith Mohammad
    Khader, Ahamad Tajudin
    Al-Betar, Mohammed Azmi
    Awadallah, Mohammed A.
    2016 IEEE SYMPOSIUM ON COMPUTER APPLICATIONS & INDUSTRIAL ELECTRONICS (ISCAIE), 2016, : 67 - 72
  • [4] A Novel Hybrid Method for Clustering Text Documents using Evolutionary Optimization
    Naderi, Muhammad
    Amiri, Maryam
    2023 13th International Conference on Computer and Knowledge Engineering, ICCKE 2023, 2023, : 369 - 374
  • [5] Hierarchical clustering of text documents
    Lomakina, L. S.
    Rodionov, V. B.
    Surkova, A. S.
    AUTOMATION AND REMOTE CONTROL, 2014, 75 (07) : 1309 - 1315
  • [6] Hierarchical clustering of text documents
    L. S. Lomakina
    V. B. Rodionov
    A. S. Surkova
    Automation and Remote Control, 2014, 75 : 1309 - 1315
  • [7] Discriminative clustering of text documents
    Peltonen, J
    Sinkkonen, J
    Kaski, S
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 1956 - 1960
  • [8] Ants in text documents clustering
    Machnik, Lukasz
    Advances in Systems, Computing Sciences and Software Engineering, 2006, : 209 - 212
  • [9] AUTOMATIC CLUSTERING OF TEXT DOCUMENTS BASED ON A GENETIC ALGORITHM WITH ARTIFICIAL SELECTION
    Bodyanskiy, E. V.
    Volkova, V. V.
    Koval, K. V.
    RADIO ELECTRONICS COMPUTER SCIENCE CONTROL, 2009, 2 : 91 - 96
  • [10] Text stream clustering algorithm based on adaptive feature selection
    Gong, Linghui
    Zeng, Jianping
    Zhang, Shiyong
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (03) : 1393 - 1399