Order preserving clustering by finding frequent orders in gene expression data

被引:0
|
作者
Teng, Li [1 ]
Chan, Laiwan [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This paper concerns the discovery of Order Preserving Clusters (OP-Clusters) in gene expression data, in each of which a subset of genes induce a similar linear ordering along a subset of conditions. After converting each gene vector into an ordered label sequence. The problem is transferred into finding frequent orders appearing in the sequence set. We propose an algorithm of finding the frequent orders by iteratively Combining the most Frequent Prefixes and Suffixes (CFPS) in a statistical way. We also define the significance of an OP-Cluster. Our method has good scale-up property with dimension of the dataset and size of the cluster. Experimental study on both synthetic datasets and real gene expression dataset shows our approach is very effective and efficient.
引用
收藏
页码:218 / 229
页数:12
相关论文
共 50 条
  • [21] Problems in gene clustering based on gene expression data
    Bryan, J
    JOURNAL OF MULTIVARIATE ANALYSIS, 2004, 90 (01) : 44 - 66
  • [22] Finding the optimal gene order in displaying microarray data
    Lee, SK
    Kim, YH
    Moon, BR
    GENETIC AND EVOLUTIONARY COMPUTATION - GECCO 2003, PT II, PROCEEDINGS, 2003, 2724 : 2215 - 2226
  • [23] Clustering cancer gene expression data by projective clustering ensemble
    Yu, Xianxue
    Yu, Guoxian
    Wang, Jun
    PLOS ONE, 2017, 12 (02):
  • [24] Finding frequent trajectories by clustering and sequential pattern mining
    Arthur A.Shaw
    N.P.Gopalan
    Journal of Traffic and Transportation Engineering(English Edition) , 2014, (06) : 393 - 403
  • [25] Finding frequent trajectories by clustering and sequential pattern mining
    Shaw, Arthur A.
    Gopalan, N. P.
    JOURNAL OF TRAFFIC AND TRANSPORTATION ENGINEERING-ENGLISH EDITION, 2014, 1 (06) : 393 - 403
  • [26] Analysis of gene expression data: clustering and beyond
    Zohar Yakhini
    Amir Ben-Dor
    Stuart Kim
    Ron Shamir
    Nature Genetics, 1999, 23 (Suppl 3) : 83 - 83
  • [27] A repulsive clustering algorithm for gene expression data
    Cheng, CS
    Wang, SS
    THIRD IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING - BIBE 2003, PROCEEDINGS, 2003, : 407 - 412
  • [28] Projection Based Clustering of Gene Expression Data
    Tasoulis, Sotiris K.
    Plagianakos, Vassilis P.
    Tasoulis, Dimitris K.
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, 2010, 6160 : 228 - +
  • [29] Evaluation of clustering algorithms for gene expression data
    Susmita Datta
    Somnath Datta
    BMC Bioinformatics, 7
  • [30] An improved algorithm for clustering gene expression data
    Bandyopadhyay, Sanghamitra
    Mukhopadhyay, Anirban
    Maulik, Ujjwal
    BIOINFORMATICS, 2007, 23 (21) : 2859 - 2865