A novel pattern based clustering methodology for time-series microarray data

被引:11
|
作者
Phan, Sieu
Famili, Fazel [1 ]
Tang, Zoujian
Pan, Youlian
Liu, Ziying
Ouyang, Junjun
Lenferink, Anne
O'Connor, Maureen Mc-Court
机构
[1] Natl Res Council Canada, Inst Informat Technol, Ottawa, ON K1A 0R6, Canada
[2] Natl Res Council Canada, Biotechnol Res Inst, Montreal, PQ H4P 2R2, Canada
关键词
biological pattern recognition; time-series microarray data; data mining; clustering; co-expressed genes;
D O I
10.1080/00207160701203419
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Identification of co-expressed genes sharing similar biological behaviours is an essential step in functional genomics. Traditional clustering techniques are generally based on overall similarity of expression levels and often generate clusters with mixed profile patterns. A novel pattern recognition method for selecting co-expressed genes based on rate of change and modulation status of gene expression at each time interval is proposed in this paper. This method is capable of identifying gene clusters consisting of highly similar shapes of expression profiles and modulation patterns. Furthermore, we develop a quality index based on the semantic similarity in gene annotations to assess the likelihood of a cluster being a co-regulated group. The effectiveness of the proposed methodology is demonstrated by applying it to the well-known yeast sporulation dataset and an in-house cancer genomics dataset.
引用
收藏
页码:585 / 597
页数:13
相关论文
共 50 条
  • [1] Clustering short time-series microarray
    Ping, Loh Wei
    Abu Hasan, Yahya
    [J]. INTERNATIONAL CONFERENCE ON MATHEMATICAL BIOLOGY 2007, 2008, 971 : 39 - 46
  • [2] A methodology for index tracking based on time-series clustering
    Focardi, SM
    Fabozzi, FJ
    [J]. QUANTITATIVE FINANCE, 2004, 4 (04) : 417 - 425
  • [3] Alignment versus Variation Methods for Clustering Microarray Time-Series Data
    Subhani, Numanul
    Li, Yifeng
    Ngom, Alioune
    Rueda, Luis
    [J]. 2010 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2010,
  • [4] A time-series clustering methodology for knowledge extraction in energy consumption data
    Ruiz, L. G. B.
    Pegalajar, M. C.
    Arcucci, R.
    Molina-Solana, M.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 160
  • [5] A time-series clustering methodology for knowledge extraction in energy consumption data
    Ruiz, L.G.B.
    Pegalajar, M.C.
    Arcucci, R.
    Molina-Solana, M.
    [J]. Expert Systems with Applications, 2020, 160
  • [6] Multiple gene expression profile alignment for microarray time-series data clustering
    Subhani, Numanul
    Rueda, Luis
    Ngom, Alioune
    Burden, Conrad J.
    [J]. BIOINFORMATICS, 2010, 26 (18) : 2281 - 2288
  • [7] Gene clustering for time-series microarray with production outputs
    Chira, Camelia
    Sedano, Javier
    Villar, Jose R.
    Camara, Monica
    Prieto, Carlos
    [J]. SOFT COMPUTING, 2016, 20 (11) : 4301 - 4312
  • [8] Gene clustering for time-series microarray with production outputs
    Camelia Chira
    Javier Sedano
    José R. Villar
    Monica Camara
    Carlos Prieto
    [J]. Soft Computing, 2016, 20 : 4301 - 4312
  • [9] Clustering of multivariate time-series data
    Singhal, A
    Seborg, DE
    [J]. PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 3931 - 3936
  • [10] Clustering multivariate time-series data
    Singhal, A
    Seborg, DE
    [J]. JOURNAL OF CHEMOMETRICS, 2005, 19 (08) : 427 - 438