Evaluation of classification and forecasting methods on time series gene expression data

被引:7
|
作者
Tripto, Nafis Irtiza [1 ]
Kabir, Mohimenul [1 ]
Bayzid, Md. Shamsuzzoha [1 ]
Rahman, Atif [1 ]
机构
[1] Bangladesh Univ Engn & Technol, Dept Comp Sci & Engn, Dhaka, Bangladesh
来源
PLOS ONE | 2020年 / 15卷 / 11期
关键词
MICROARRAY DATA-ANALYSIS; NETWORKS; SUPPORT; SELECTION; CYCLE;
D O I
10.1371/journal.pone.0241686
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Time series gene expression data is widely used to study different dynamic biological processes. Although gene expression datasets share many of the characteristics of time series data from other domains, most of the analyses in this field do not fully leverage the time-ordered nature of the data and focus on clustering the genes based on their expression values. Other domains, such as financial stock and weather prediction, utilize time series data for forecasting purposes. Moreover, many studies have been conducted to classify generic time series data based on trend, seasonality, and other patterns. Therefore, an assessment of these approaches on gene expression data would be of great interest to evaluate their adequacy in this domain. Here, we perform a comprehensive evaluation of different traditional unsupervised and supervised machine learning approaches as well as deep learning based techniques for time series gene expression classification and forecasting on five real datasets. In addition, we propose deep learning based methods for both classification and forecasting, and compare their performances with the state-of-the-art methods. We find that deep learning based methods generally outperform traditional approaches for time series classification. Experiments also suggest that supervised classification on gene expression is more effective than clustering when labels are available. In time series gene expression forecasting, we observe that an autoregressive statistical approach has the best performance for short term forecasting, whereas deep learning based methods are better suited for long term forecasting.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Empirical Evaluation of Ranking Prediction Methods for Gene Expression Data Classification
    de Souza, Bruno Feres
    de Carvalho, Andre C. P. L. F.
    Soares, Carlos
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2010, 2010, 6433 : 194 - 203
  • [2] Evaluation of interpretability methods for multivariate time series forecasting
    Ozyegen, Ozan
    Ilic, Igor
    Cevik, Mucahit
    [J]. APPLIED INTELLIGENCE, 2022, 52 (05) : 4727 - 4743
  • [3] Evaluation of interpretability methods for multivariate time series forecasting
    Ozan Ozyegen
    Igor Ilic
    Mucahit Cevik
    [J]. Applied Intelligence, 2022, 52 : 4727 - 4743
  • [4] Analysis of time-series gene expression data: Methods, challenges, and opportunities
    Androulakis, I. P.
    Yang, E.
    Almon, R. R.
    [J]. ANNUAL REVIEW OF BIOMEDICAL ENGINEERING, 2007, 9 : 205 - 228
  • [5] Comparison of Missing Data Imputation Methods in Time Series Forecasting
    Ahn, Hyun
    Sun, Kyunghee
    Kim, Kwanghoon Pio
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (01): : 767 - 779
  • [6] An Evaluation of Classification Methods for 3D Printing Time-Series Data
    Mahato, Vivek
    Obeidi, Muhannad Ahmed
    Brabazon, Dermot
    Cunningham, Padraig
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 8211 - 8216
  • [7] Analyzing time series gene expression data
    Bar-Joseph, Z
    [J]. BIOINFORMATICS, 2004, 20 (16) : 2493 - 2503
  • [8] Classification for consumption data in smart grid based on forecasting time series
    Tornai, Kalman
    Kovacs, Lorant
    Olah, Andras
    Drenyovszki, Rajmund
    Pinter, Istvan
    Tisza, David
    Levendovszky, Janos
    [J]. ELECTRIC POWER SYSTEMS RESEARCH, 2016, 141 : 191 - 201
  • [9] Gene Selection in Time-Series Gene Expression Data
    Adhikari, Prem Raj
    Upadhyaya, Bimal Babu
    Meng, Chen
    Hollmen, Jaakko
    [J]. PATTERN RECOGNITION IN BIOINFORMATICS, 2011, 7036 : 145 - +
  • [10] Forecasting temperature data with complex seasonality using time series methods
    Elseidi, Mohammed
    [J]. MODELING EARTH SYSTEMS AND ENVIRONMENT, 2023, 9 (02) : 2553 - 2567