A Convolutional Deep Clustering Framework for Gene Expression Time Series

被引:7
|
作者
Ozgul, Ozan Frat [1 ]
Bardak, Batuhan [1 ]
Tan, Mehmet [1 ]
机构
[1] TOBB Univ Econ & Technol, Dept Comp Engn, TR-06510 Ankara, Turkey
关键词
Time series analysis; Gene expression; Machine learning; Clustering algorithms; Biological system modeling; Trajectory; Biological information theory; clustering; recurrence plots; deep learning; NF-KAPPA-B; HELICOBACTER-PYLORI; RECURRENCE PLOT;
D O I
10.1109/TCBB.2020.2988985
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The functional or regulatory processes within the cell are explicitly governed by the expression levels of a subset of its genes. Gene expression time series captures activities of individual genes over time and aids revealing underlying cellular dynamics. An important step in high-throughput gene expression time series experiment is clustering genes based on their temporal expression patterns and is conventionally achieved by unsupervised machine learning techniques. However, most of the clustering techniques either suffer from the short length of gene expression time series or ignore temporal structure of the data. In this work, we propose DeepTrust, a novel deep learning-based framework for gene expression time series clustering which can overcome these issues. DeepTrust initially transforms time series data into images to obtain richer data representations. Afterwards, a deep convolutional clustering algorithm is applied on the constructed images. Analyses on both simulated and biological data sets exhibit the efficiency of this new framework, compared to widely used clustering techniques. We also utilize enrichment analyses to illustrate the biological plausibility of the clusters detected by DeepTrust. Our code and data are available from http://github.com/tanlab/DeepTrust.
引用
收藏
页码:2198 / 2207
页数:10
相关论文
共 50 条
  • [1] Deep Convolutional Clustering-Based Time Series Anomaly Detection
    Chadha, Gavneet Singh
    Islam, Intekhab
    Schwung, Andreas
    Ding, Steven X.
    SENSORS, 2021, 21 (16)
  • [2] Clustering short time series gene expression data
    Ernst, J
    Nau, GJ
    Bar-Joseph, Z
    BIOINFORMATICS, 2005, 21 : I159 - I168
  • [3] TimeClust: a clustering tool for gene expression time series
    Magni, Paolo
    Ferrazzi, Fulvia
    Sacchi, Lucia
    Bellazzi, Riccardo
    BIOINFORMATICS, 2008, 24 (03) : 430 - 432
  • [4] Interpolation based consensus clustering for gene expression time series
    Chiu, Tai-Yu
    Hsu, Ting-Chieh
    Yen, Chia-Cheng
    Wang, Jia-Shung
    BMC BIOINFORMATICS, 2015, 16
  • [5] Constrained Subspace Clustering for Time Series Gene Expression Data
    Qu, Jibin
    Ng, Michael
    Chen, Luonan
    COMPUTATIONAL SYSTEMS BIOLOGY, 2010, 13 : 323 - +
  • [6] Interpolation based consensus clustering for gene expression time series
    Tai-Yu Chiu
    Ting-Chieh Hsu
    Chia-Cheng Yen
    Jia-Shung Wang
    BMC Bioinformatics, 16
  • [7] Time series clustering with random convolutional kernels
    Jorge, Marco-Blanco
    Ruben, Cuevas
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (04) : 1862 - 1888
  • [8] Clustering Time-Series Gene Expression Data with Unequal Time Intervals
    Rueda, Luis
    Bari, Ataul
    Ngom, Alioune
    TRANSACTIONS ON COMPUTATIONAL SYSTEMS BIOLOGY X, 2008, 5410 : 100 - 123
  • [9] Integration of heterogeneous time series gene expression data by clustering on time dimension
    Ahn, Hongryul
    Chae, Heejoon
    Jung, Woosuk
    Kim, Sun
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2017, : 332 - 335
  • [10] Phase-wise Clustering of Time Series Gene Expression Data
    Goyal, Poonam
    Karwa, Rohan Sunil
    Goyal, Navneet
    John, Matthew
    TRUSTCOM 2011: 2011 INTERNATIONAL JOINT CONFERENCE OF IEEE TRUSTCOM-11/IEEE ICESS-11/FCST-11, 2011, : 1668 - 1674