Integrative Gene Selection on Gene Expression Data: Providing Biological Context to Traditional Approaches

被引:14
|
作者
Perscheid, Cindy [1 ]
Grasnick, Bastien [1 ]
Uflacker, Matthias [1 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst, Digital Engn Fac, Potsdam, Germany
关键词
Gene Expression Data Analysis; Integrative Gene Selection; Pattern Recognition; Prior Knowledge; Knowledge Bases; BREAST-CANCER; CLASSIFICATION; ALGORITHM; ONTOLOGY; FILTER;
D O I
10.1515/jib-2018-0064
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The advance of high-throughput RNA-Sequencing techniques enables researchers to analyze the complete gene activity in particular cells. From the insights of such analyses, researchers can identify disease-specific expression profiles, thus understand complex diseases like cancer, and eventually develop effective measures for diagnosis and treatment. The high dimensionality of gene expression data poses challenges to its computational analysis, which is addressed with measures of gene selection. Traditional gene selection approaches base their findings on statistical analyses of the actual expression levels, which implies several drawbacks when it comes to accurately identifying the underlying biological processes. In turn, integrative approaches include curated information on biological processes from external knowledge bases during gene selection, which promises to lead to better interpretability and improved predictive performance. Our work compares the performance of traditional and integrative gene selection approaches. Moreover, we propose a straightforward approach to integrate external knowledge with traditional gene selection approaches. We introduce a framework enabling the automatic external knowledge integration, gene selection, and evaluation. Evaluation results prove our framework to be a useful tool for evaluation and show that integration of external knowledge improves overall analysis results.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] CONTEXT-SPECIFIC GENE REGULATIONS IN CANCER GENE EXPRESSION DATA
    Sen, Ina
    Verdicchio, Michael P.
    Jung, Sungwon
    Trevino, Robert
    Bittner, Michael
    Kim, Seungchan
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2009, 2009, : 75 - +
  • [22] Gene expression database scales up, providing baseline data
    Susan Matthews
    Nature Medicine, 2013, 19 : 799 - 799
  • [23] Gene expression database scales up, providing baseline data
    Matthews, Susan
    NATURE MEDICINE, 2013, 19 (07) : 799 - 799
  • [24] Exploiting the Accumulated Evidence for Gene Selection in Microarray Gene Expression Data
    Prat-Masramon, Gabriel
    Belanche-Munoz, Lluis A.
    ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 989 - +
  • [25] A semiparametric approach for marker gene selection based on gene expression data
    Guan, Z
    Zhao, HY
    BIOINFORMATICS, 2005, 21 (04) : 529 - 536
  • [26] Evolutionary Tolerance-Based Gene Selection in Gene Expression Data
    Jiao, Na
    TRANSACTIONS ON ROUGH SETS XIV, 2011, 6600 : 100 - 118
  • [27] Efficient gene selection with rough sets from gene expression data
    Sun, Lijun
    Miao, Duoqian
    Zhang, Hongyun
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, 2008, 5009 : 164 - +
  • [28] DATA MINING METHODS FOR GENE SELECTION ON THE BASIS OF GENE EXPRESSION ARRAYS
    Muszynski, Michal
    Osowski, Stanislaw
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2014, 24 (03) : 657 - 668
  • [29] A blocking strategy to improve gene selection for classification of gene expression data
    Bontempi, Gianluca
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2007, 4 (02) : 293 - 300
  • [30] Gene Selection Using High Dimensional Gene Expression Data: An Appraisal
    Bhola, Abhishek
    Singh, Shailendra
    CURRENT BIOINFORMATICS, 2018, 13 (03) : 225 - 233