Clustering longitudinal profiles using P-splines and mixed effects models applied to time-course gene expression data

被引:32
|
作者
Coffey, N. [1 ]
Hinde, J. [2 ]
Holian, E. [2 ]
机构
[1] Univ Coll Dublin, Sch Math Sci Syst Biol Ireland, Dublin, Ireland
[2] Natl Univ Ireland, Sch Math Stat & Appl Math, Galway, Ireland
基金
爱尔兰科学基金会;
关键词
Longitudinal profiles; Time-course gene expression; Clustering; Mixed effects model; Finite mixture model; FUNCTIONAL DATA; MIXTURE MODEL; PATTERNS; COMPONENTS;
D O I
10.1016/j.csda.2013.04.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Longitudinal data is becoming increasingly common and various methods have been developed to analyze this type of data. Profiles from time-course gene expression studies, where cluster analysis plays an important role to identify groups of co-expressed genes overtime, are investigated. A number of procedures have been used to cluster time-course gene expression data, however there are many limitations to the techniques previously described. An alternative approach is proposed, which aims to alleviate some of these limitations. The method exploits the connection between the linear mixed effects model and P-spline smoothing to simultaneously smooth the gene expression data to remove any measurement error/noise and cluster the expression profiles using finite mixtures of mixed effects models. This approach has a number of advantages, including decreased computation time and ease of implementation in standard software packages. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:14 / 29
页数:16
相关论文
共 50 条
  • [1] Clustering of time-course gene expression data using a mixed-effects model with B-splines
    Luan, YH
    Li, HZ
    [J]. BIOINFORMATICS, 2003, 19 (04) : 474 - 482
  • [2] Clustering of time-course gene expression profiles using normal mixture models with autoregressive random effects
    Kui Wang
    Shu Kay Ng
    Geoffrey J McLachlan
    [J]. BMC Bioinformatics, 13
  • [3] Clustering of time-course gene expression profiles using normal mixture models with autoregressive random effects
    Wang, Kui
    Ng, Shu Kay
    McLachlan, Geoffrey J.
    [J]. BMC BIOINFORMATICS, 2012, 13
  • [4] Classification of longitudinal profiles using semi-parametric nonlinear mixed models with P-Splines and the SAEM algorithm
    Marquez, Maritza
    Meza, Cristian
    Lee, Dae-Jin
    De la Cruz, Rolando
    [J]. STATISTICS IN MEDICINE, 2023, 42 (27) : 4952 - 4971
  • [5] Parsimonious time series clustering using P-splines
    Iorio, Carmela
    Frasso, Gianluca
    D'Ambrosio, Antonio
    Siciliano, Roberta
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2016, 52 : 26 - 38
  • [6] Clustering of time-course gene expression data using functional data analysis
    Song, Joon Jin
    Lee, Ho-Jin
    Morris, Jeffrey S.
    Kang, Sanghoon
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2007, 31 (04) : 265 - 274
  • [7] Improved dynamic predictions from joint models of longitudinal and survival data with time-varying effects using P-splines
    Andrinopoulou, Eleni-Rosalina
    Eilers, Paul H. C.
    Takkenberg, Johanna J. M.
    Rizopoulos, Dimitris
    [J]. BIOMETRICS, 2018, 74 (02) : 685 - 693
  • [8] Time-Course Gene Set Analysis for Longitudinal Gene Expression Data
    Hejblum, Boris P.
    Skinner, Jason
    Thiebaut, Rodolphe
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (06)
  • [9] Bayesian Functional Mixed-effects Models with Grouped Smoothness for Analyzing Time-course Gene Expression Data
    Ye, Shangyuan
    Liang, Ye
    Zhang, Bo
    [J]. CURRENT BIOINFORMATICS, 2021, 16 (01) : 2 - 12
  • [10] A new distance measurement for clustering time-course gene expression data
    Chen, GR
    Dai, Y
    [J]. PROCEEDINGS OF THE 26TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2004, 26 : 2929 - 2932