Clustering of short time-course gene expression data with dissimilar replicates

被引:1
|
作者
Cinar, Ozan [1 ]
Ilk, Ozlem [2 ]
Iyigun, Cem [3 ]
机构
[1] Maastricht Univ, Dept Psychiat & Neuropsychol, Maastricht, Netherlands
[2] Middle East Tech Univ, Dept Stat, Ankara, Turkey
[3] Middle East Tech Univ, Dept Ind Engn, Ankara, Turkey
关键词
Microarray gene expression; Short time-series; Replication; Distance; Clustering; Cluster validation; SERIES DATA; MICROARRAY EXPERIMENTS; FORECAST DENSITIES; DNA MICROARRAY; CELL-CYCLE; PROFILES; PATTERNS; MODEL; CLASSIFICATION; IDENTIFICATION;
D O I
10.1007/s10479-017-2583-3
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Microarrays are used in genetics and medicine to examine large numbers of genes simultaneously through their expression levels under any condition such as a disease of interest. The information from these experiments can be enriched by following the expression levels through time and biological replicates. The purpose of this study is to propose an algorithm which clusters the genes with respect to the similarities between their behaviors through time. The algorithm is also aimed at highlighting the genes which show different behaviors between the replicates and separating the constant genes that keep their baseline expression levels throughout the study. Finally, we aim to feature cluster validation techniques to suggest a sensible number of clusters when it is not known a priori. The illustrations show that the proposed algorithm in this study offers a fast approach to clustering the genes with respect to their behavior similarities, and also separates the constant genes and the genes with dissimilar replicates without any need for pre-processing. Moreover, it is also successful at suggesting the correct number of clusters when that is not known.
引用
收藏
页码:405 / 428
页数:24
相关论文
共 50 条
  • [11] Autoregressive-model based dynamic fuzzy clustering for time-course gene expression data
    School of Information, Southern Yantze University, Wuxi, 214122, China
    Biotechnology, 2008, 1 (59-65)
  • [12] Partial mixture model for tight clustering of gene expression time-course
    Yuan, Yinyin
    Li, Chang-Tsun
    Wilson, Roland
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [13] Partial mixture model for tight clustering of gene expression time-course
    Yinyin Yuan
    Chang-Tsun Li
    Roland Wilson
    BMC Bioinformatics, 9
  • [14] Time-Course Gene Set Analysis for Longitudinal Gene Expression Data
    Hejblum, Boris P.
    Skinner, Jason
    Thiebaut, Rodolphe
    PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (06)
  • [15] Time-course data prediction for repeatedly measured gene expression
    Bhattacharjee, Atanu
    Vishwakarma, Gajendra K.
    INTERNATIONAL JOURNAL OF BIOMATHEMATICS, 2019, 12 (04)
  • [16] Screening and Clustering for Time-course Yeast Microarray Gene Expression Data using Gaussian Process Regression
    Kim, Jaehee
    Kim, Taehoun
    KOREAN JOURNAL OF APPLIED STATISTICS, 2013, 26 (03) : 389 - 399
  • [17] Clustering short time series gene expression data
    Ernst, J
    Nau, GJ
    Bar-Joseph, Z
    BIOINFORMATICS, 2005, 21 : I159 - I168
  • [18] Clustering of time-course gene expression data using a mixed-effects model with B-splines
    Luan, YH
    Li, HZ
    BIOINFORMATICS, 2003, 19 (04) : 474 - 482
  • [19] Distance functions for clustering time course gene expression data
    Chalasani, V
    Sundaram, S
    METMBS'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, 2003, : 515 - 518
  • [20] Finding explained groups of time-course gene expression profiles with predictive clustering trees
    Slavkov, Ivica
    Gjorgjioski, Valentin
    Struyf, Jan
    Dzeroski, Saso
    MOLECULAR BIOSYSTEMS, 2010, 6 (04) : 729 - 740