Order preserving clustering by finding frequent orders in gene expression data

被引:0
|
作者
Teng, Li [1 ]
Chan, Laiwan [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This paper concerns the discovery of Order Preserving Clusters (OP-Clusters) in gene expression data, in each of which a subset of genes induce a similar linear ordering along a subset of conditions. After converting each gene vector into an ordered label sequence. The problem is transferred into finding frequent orders appearing in the sequence set. We propose an algorithm of finding the frequent orders by iteratively Combining the most Frequent Prefixes and Suffixes (CFPS) in a statistical way. We also define the significance of an OP-Cluster. Our method has good scale-up property with dimension of the dataset and size of the cluster. Experimental study on both synthetic datasets and real gene expression dataset shows our approach is very effective and efficient.
引用
收藏
页码:218 / 229
页数:12
相关论文
共 50 条
  • [1] Mining order preserving patterns in microarray data by finding frequent orders
    Teng, Li
    Chan, Laiwan
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 1019 - 1026
  • [2] Order-preserving clustering and its application to gene expression data
    Syeda-Mahmood, T
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, 2004, : 637 - 640
  • [3] Constrained query of order-preserving submatrix in gene expression data
    Tao Jiang
    Zhanhuai Li
    Xuequn Shang
    Bolin Chen
    Weibang Li
    Zhilei Yin
    Frontiers of Computer Science, 2016, 10 : 1052 - 1066
  • [4] An Algorithm for Discovering Deep Order Preserving Submatrix in Gene Expression Data
    Kuang, Qiuhua
    Zhang, Meizhen
    Ma, Zhihao
    Ma, Bo
    Liu, Zhiwen
    Xue, Yun
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 1678 - 1683
  • [5] Mining Bucket Order-Preserving SubMatrices in Gene Expression Data
    Fang, Qiong
    Ng, Wilfred
    Feng, Jianlin
    Li, Yuliang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (12) : 2218 - 2231
  • [6] Indexing and Search of Order-Preserving Submatrix for Gene Expression Data
    Jiang, Tao
    Chen, Bolin
    Li, Juntao
    Xu, Guoyu
    IEEE ACCESS, 2019, 7 : 184769 - 184785
  • [7] Constrained query of order-preserving submatrix in gene expression data
    Tao JIANG
    Zhanhuai LI
    Xuequn SHANG
    Bolin CHEN
    Weibang LI
    Zhilei YIN
    Frontiers of Computer Science, 2016, 10 (06) : 1052 - 1066
  • [8] Constrained query of order-preserving submatrix in gene expression data
    Jiang, Tao
    Li, Zhanhuai
    Shang, Xuequn
    Chen, Bolin
    Li, Weibang
    Yin, Zhilei
    FRONTIERS OF COMPUTER SCIENCE, 2016, 10 (06) : 1052 - 1066
  • [9] A Novel Evolutionary Algorithm for Bi-clustering of Gene Expression Data based on the Order Preserving Sub-Matrix (OPSM) Constraint
    Roh, Hongchan
    Park, Sanghyun
    8TH IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING, VOLS 1 AND 2, 2008, : 212 - 225
  • [10] Finding groups in gene expression data
    Hand, DJ
    Heard, NA
    JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2005, (02): : 215 - 225