Order preserving clustering by finding frequent orders in gene expression data

被引:0
|
作者
Teng, Li [1 ]
Chan, Laiwan [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This paper concerns the discovery of Order Preserving Clusters (OP-Clusters) in gene expression data, in each of which a subset of genes induce a similar linear ordering along a subset of conditions. After converting each gene vector into an ordered label sequence. The problem is transferred into finding frequent orders appearing in the sequence set. We propose an algorithm of finding the frequent orders by iteratively Combining the most Frequent Prefixes and Suffixes (CFPS) in a statistical way. We also define the significance of an OP-Cluster. Our method has good scale-up property with dimension of the dataset and size of the cluster. Experimental study on both synthetic datasets and real gene expression dataset shows our approach is very effective and efficient.
引用
收藏
页码:218 / 229
页数:12
相关论文
共 50 条
  • [31] Statistical issues in the clustering of gene expression data
    Goldstein, DR
    Ghosh, D
    Conlon, EM
    STATISTICA SINICA, 2002, 12 (01) : 219 - 240
  • [32] Constrained clustering for gene expression data mining
    Tseng, Vincent S.
    Chen, Lien-Chin
    Kao, Ching-Pin
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 759 - 766
  • [33] Evaluation of clustering algorithms for gene expression data
    Datta, Susmita
    Datta, Somnath
    BMC BIOINFORMATICS, 2006, 7 (Suppl 4)
  • [34] Comparison of Clustering Approaches for Gene Expression Data
    Borg, Anton
    Lavesson, Niklas
    Boeva, Veselka
    TWELFTH SCANDINAVIAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (SCAI 2013), 2013, 257 : 55 - 64
  • [35] Clustering methods for microarray gene expression data
    Belacel, Nabil
    Wang, Qian
    Cuperlovic-Culf, Miroslava
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2006, 10 (04) : 507 - 531
  • [36] Clustering gene expression data with Temporal Abstractions
    Sacchi, L
    Bellazzi, R
    Larizza, C
    Magni, P
    Curk, T
    Petrovic, U
    Zupan, B
    MEDINFO 2004: PROCEEDINGS OF THE 11TH WORLD CONGRESS ON MEDICAL INFORMATICS, PT 1 AND 2, 2004, 107 : 798 - 802
  • [37] Bayesian Fourier clustering of gene expression data
    Kim, Jaehee
    Kyung, Minjung
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2017, 46 (08) : 6475 - 6494
  • [38] Clustering Algorithms: Their Application to Gene Expression Data
    Oyelade, Jelili
    Isewon, Itunuoluwa
    Oladipupo, Funke
    Aromolaran, Olufemi
    Uwoghiren, Efosa
    Ameh, Faridah
    Achas, Moses
    Adebiyi, Ezekiel
    BIOINFORMATICS AND BIOLOGY INSIGHTS, 2016, 10 : 237 - 253
  • [39] A Similarity Measure for Clustering Gene Expression Data
    Baishya, Ram Charan
    Sarmah, Rosy
    Bhattacharyya, Dhruba Kumar
    Dutta, Malay Ananda
    APPLIED ALGORITHMS, 2014, 8321 : 245 - 256
  • [40] The Clustering Algorithm Study of Gene Expression Data
    He Rui
    Lin Chunmei
    ENVIRONMENTAL BIOTECHNOLOGY AND MATERIALS ENGINEERING, PTS 1-3, 2011, 183-185 : 93 - +