Order preserving clustering by finding frequent orders in gene expression data

被引:0
|
作者
Teng, Li [1 ]
Chan, Laiwan [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This paper concerns the discovery of Order Preserving Clusters (OP-Clusters) in gene expression data, in each of which a subset of genes induce a similar linear ordering along a subset of conditions. After converting each gene vector into an ordered label sequence. The problem is transferred into finding frequent orders appearing in the sequence set. We propose an algorithm of finding the frequent orders by iteratively Combining the most Frequent Prefixes and Suffixes (CFPS) in a statistical way. We also define the significance of an OP-Cluster. Our method has good scale-up property with dimension of the dataset and size of the cluster. Experimental study on both synthetic datasets and real gene expression dataset shows our approach is very effective and efficient.
引用
收藏
页码:218 / 229
页数:12
相关论文
共 50 条
  • [41] Clustering of high throughput gene expression data
    Pirim, Harun
    Eksioglu, Burak
    Perkins, Andy D.
    Yuceer, Cetin
    COMPUTERS & OPERATIONS RESEARCH, 2012, 39 (12) : 3046 - 3061
  • [42] Clustering gene expression data:: an experimental analysis
    Ortiz-Gama, S
    Sucar, LE
    Rodríguez, AF
    PROCEEDINGS OF THE FIFTH MEXICAN INTERNATIONAL CONFERENCE IN COMPUTER SCIENCE (ENC 2004), 2004, : 168 - 175
  • [43] Order preserving hierarchical agglomerative clustering
    Bakkelund, Daniel
    MACHINE LEARNING, 2022, 111 (05) : 1851 - 1901
  • [44] Order preserving hierarchical agglomerative clustering
    Daniel Bakkelund
    Machine Learning, 2022, 111 : 1851 - 1901
  • [45] Preserving similarity order for unsupervised clustering
    Wang, Jinghua
    Wang, Li
    Jiang, Jianmin
    PATTERN RECOGNITION, 2022, 128
  • [46] Finding Frequent Entities in Continuous Data
    Alet, Ferran
    Chitnis, Rohan
    Kaelbling, Leslie P.
    Lozano-Perez, Tomas
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 1992 - 1999
  • [47] On finding frequent elements in a data stream
    Charikar, Moses
    Chen, Kevin
    Farach-Colton, Martin
    APPROXIMATION, RANDOMIZATION, AND COMBINATORIAL OPTIMIZATION: ALGORITHMS AND TECHNIQUES, 2007, 4627 : 584 - +
  • [48] Finding frequent items in data streams
    Charikar, M
    Chen, K
    Farach-Colton, M
    THEORETICAL COMPUTER SCIENCE, 2004, 312 (01) : 3 - 15
  • [49] Finding the Frequent Items in Streams of Data
    Cormode, Graham
    Hadjieleftheriou, Marios
    COMMUNICATIONS OF THE ACM, 2009, 52 (10) : 97 - 105
  • [50] Finding frequent items in data streams
    Charikar, M
    Chen, K
    Farach-Colton, M
    AUTOMATA, LANGUAGES AND PROGRAMMING, 2002, 2380 : 693 - 703