CCGA: Co-similarity based Co-clustering using genetic algorithm

被引:27
|
作者
Hussain, Syed Fawad [1 ,2 ]
Iqbal, Shahid [2 ]
机构
[1] GIK Inst, Machine Learning & Data Sci MDS Lab, Topi, Khyber Pakhtunk, Pakistan
[2] GIK Inst, Fac Comp Sci & Engn, Topi, Khyber Pakhtunk, Pakistan
关键词
Clustering; Co-clustering; Co-similarity; Genetic algorithms; X-Sim; MULTIOBJECTIVE EVOLUTIONARY ALGORITHMS; PARTICLE SWARM OPTIMIZATION;
D O I
10.1016/j.asoc.2018.07.045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Co-clustering refers to the simultaneous clustering of objects and their features. It is used as a clustering technique when the data exhibit similarities only in a subset of features instead of the whole feature set. Clustering (and co-clustering) has been proven to be an optimization problem which makes evolutionary algorithms a suitable candidate for optimizing the cluster labels. Genetic algorithms have been used in the literature for data clustering by optimizing cluster labels to reduce mean distance from cluster centers. Using only genetic operators and Euclidean distances, however, have resulted in limited success. In this paper, we propose to use a Genetic Algorithm framework for co-clustering data. What makes this contribution significant and distinctly unique is that we propose the use of a co-similarity objective function that uses multiple objective functions to seamlessly integrate the co-clustering framework into the optimization problem. Co-similarity matrices are intertwined row and column similarity matrices that are computed on the basis of each other. To the best of our knowledge, we are the first to propose the use of Genetic Algorithm to optimize co-similarity matrices for the co-clustering task. We conduct several experiments to analyse the performance of our proposed approach and compare them with numerous state-of-the-art clustering and co-clustering algorithms, on a variety of real world datasets. Our results show that the proposed approach significantly outperforms other clustering and co-clustering algorithms on all the datasets tested. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:30 / 42
页数:13
相关论文
共 50 条
  • [1] Biclustering of human cancer microarray data using co-similarity based co-clustering
    Hussain, Syed Fawad
    Ramazan, Muhammad
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2016, 55 : 520 - 531
  • [2] Co-clustering by similarity refinement
    Zhang, Jian
    [J]. ICMLA 2007: SIXTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2007, : 381 - 386
  • [3] Bi-clustering Gene Expression Data Using Co-similarity
    Hussain, Syed Fawad
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PT I, 2011, 7120 : 190 - 200
  • [4] Text sentiment classification based on a genetic algorithm and word and document co-clustering
    E. V. Kotelnikov
    M. V. Pletneva
    [J]. Journal of Computer and Systems Sciences International, 2016, 55 : 106 - 114
  • [5] Text sentiment classification based on a genetic algorithm and word and document co-clustering
    Kotelnikov, E. V.
    Pletneva, M. V.
    [J]. JOURNAL OF COMPUTER AND SYSTEMS SCIENCES INTERNATIONAL, 2016, 55 (01) : 106 - 114
  • [6] HCC: A Hierarchical Co-Clustering Algorithm
    Li, Jingxuan
    Li, Tao
    [J]. SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 861 - 862
  • [7] Robust fuzzy co-clustering algorithm
    Tjhi, William-Chandra
    Chen, Lihui
    [J]. 2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1591 - 1595
  • [8] A Spectral algorithm for Topographical Co-clustering
    Nicoleta, Rogovschi
    Labiod, Lazhar
    Nadif, Mohamed
    [J]. 2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [9] χ-Sim: A New Similarity Measure for the Co-clustering Task
    Bisson, Gilles
    Hussain, Fawad
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 211 - 217
  • [10] An Improved Collaborative Filtering Recommendation Algorithm Based on Co-clustering
    He, H. Q.
    Fan, Z. L.
    [J]. INTERNATIONAL CONFERENCE ON ADVANCED EDUCATIONAL TECHNOLOGY AND INFORMATION ENGINEERING (AETIE 2015), 2015, : 508 - 515