A Convolutional Deep Clustering Framework for Gene Expression Time Series

被引:7
|
作者
Ozgul, Ozan Frat [1 ]
Bardak, Batuhan [1 ]
Tan, Mehmet [1 ]
机构
[1] TOBB Univ Econ & Technol, Dept Comp Engn, TR-06510 Ankara, Turkey
关键词
Time series analysis; Gene expression; Machine learning; Clustering algorithms; Biological system modeling; Trajectory; Biological information theory; clustering; recurrence plots; deep learning; NF-KAPPA-B; HELICOBACTER-PYLORI; RECURRENCE PLOT;
D O I
10.1109/TCBB.2020.2988985
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The functional or regulatory processes within the cell are explicitly governed by the expression levels of a subset of its genes. Gene expression time series captures activities of individual genes over time and aids revealing underlying cellular dynamics. An important step in high-throughput gene expression time series experiment is clustering genes based on their temporal expression patterns and is conventionally achieved by unsupervised machine learning techniques. However, most of the clustering techniques either suffer from the short length of gene expression time series or ignore temporal structure of the data. In this work, we propose DeepTrust, a novel deep learning-based framework for gene expression time series clustering which can overcome these issues. DeepTrust initially transforms time series data into images to obtain richer data representations. Afterwards, a deep convolutional clustering algorithm is applied on the constructed images. Analyses on both simulated and biological data sets exhibit the efficiency of this new framework, compared to widely used clustering techniques. We also utilize enrichment analyses to illustrate the biological plausibility of the clusters detected by DeepTrust. Our code and data are available from http://github.com/tanlab/DeepTrust.
引用
收藏
页码:2198 / 2207
页数:10
相关论文
共 50 条
  • [21] An unsupervised conditional random fields approach for clustering gene expression time series
    Li, Chang-Tsun
    Yuan, Yinyin
    Wilson, Roland
    BIOINFORMATICS, 2008, 24 (21) : 2467 - 2473
  • [22] E-MFDBSCAN: an evolutionary clustering algorithm for gene expression time series
    Erdem, Atakan
    Gundem, Taflan Imre
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2017, 25 (04) : 3443 - 3454
  • [23] Hybrid Attentive Graph Neural Networks for time series gene expression clustering
    Giovanoudi, Eleni
    Rafailidis, Dimitrios
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 277
  • [24] Clustering gene expression series with prior knowledge
    Bréhélin, L
    ALGORITHMS IN BIOINFORMATICS, PROCEEDINGS, 2005, 3692 : 27 - 38
  • [25] A New Computational Framework for Gene Expression Clustering
    Kasim, Shahreen
    Deris, Safaai
    Othman, Razib M.
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I, 2010, 6440 : 603 - 610
  • [26] Topology Identification of Low-Voltage Distribution Network Based on Deep Convolutional Time-Series Clustering
    Ni, Qingzhong
    Jiang, Hui
    ENERGIES, 2023, 16 (11)
  • [27] A deep learning framework for time series classification using Relative Position Matrix and Convolutional Neural Network
    Chen, Wei
    Shi, Ke
    NEUROCOMPUTING, 2019, 359 : 384 - 394
  • [28] Clustering time series gene expression data based on sum-of-exponentials fitting
    Giurcaneanu, CD
    Tabus, L
    Astola, J
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (08) : 1159 - 1173
  • [29] Clustering for time-series gene expression data using mixture of constrained PCAS
    Yoshioka, T
    Ishii, S
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 2239 - 2243
  • [30] Multiple gene expression profile alignment for microarray time-series data clustering
    Subhani, Numanul
    Rueda, Luis
    Ngom, Alioune
    Burden, Conrad J.
    BIOINFORMATICS, 2010, 26 (18) : 2281 - 2288