Integrative Gene Selection on Gene Expression Data: Providing Biological Context to Traditional Approaches

被引:14
|
作者
Perscheid, Cindy [1 ]
Grasnick, Bastien [1 ]
Uflacker, Matthias [1 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst, Digital Engn Fac, Potsdam, Germany
关键词
Gene Expression Data Analysis; Integrative Gene Selection; Pattern Recognition; Prior Knowledge; Knowledge Bases; BREAST-CANCER; CLASSIFICATION; ALGORITHM; ONTOLOGY; FILTER;
D O I
10.1515/jib-2018-0064
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The advance of high-throughput RNA-Sequencing techniques enables researchers to analyze the complete gene activity in particular cells. From the insights of such analyses, researchers can identify disease-specific expression profiles, thus understand complex diseases like cancer, and eventually develop effective measures for diagnosis and treatment. The high dimensionality of gene expression data poses challenges to its computational analysis, which is addressed with measures of gene selection. Traditional gene selection approaches base their findings on statistical analyses of the actual expression levels, which implies several drawbacks when it comes to accurately identifying the underlying biological processes. In turn, integrative approaches include curated information on biological processes from external knowledge bases during gene selection, which promises to lead to better interpretability and improved predictive performance. Our work compares the performance of traditional and integrative gene selection approaches. Moreover, we propose a straightforward approach to integrate external knowledge with traditional gene selection approaches. We introduce a framework enabling the automatic external knowledge integration, gene selection, and evaluation. Evaluation results prove our framework to be a useful tool for evaluation and show that integration of external knowledge improves overall analysis results.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Gene and context: Integrative approaches to genome analysis
    Huynen, MA
    Snel, B
    ADVANCES IN PROTEIN CHEMISTRY, VOL 54: ANALYSIS OF AMINO ACID SEQUENCES, 2000, 54 : 345 - 379
  • [2] Integrating Biological Context into the Analysis of Gene Expression Data
    Perscheid, Cindy
    Uflacker, Matthias
    DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2019, 801 : 339 - 343
  • [3] An Ensemble Approach for Gene Selection in Gene Expression Data
    Castellanos-Garzon, Jose A.
    Ramos, Juan
    Lopez-Sanchez, Daniel
    de Paz, Juan F.
    11TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, 2017, 616 : 237 - 247
  • [4] A model for gene selection and classification of gene expression data
    Mohamad M.S.
    Omatu S.
    Deris S.
    Hashim S.Z.M.
    Artificial Life and Robotics, 2007, 11 (2) : 219 - 222
  • [5] Application of Biological Domain Knowledge Based Feature Selection on Gene Expression Data
    Yousef, Malik
    Kumar, Abhishek
    Bakir-Gungor, Burcu
    ENTROPY, 2021, 23 (01) : 1 - 15
  • [6] Gene Selection in Time-Series Gene Expression Data
    Adhikari, Prem Raj
    Upadhyaya, Bimal Babu
    Meng, Chen
    Hollmen, Jaakko
    PATTERN RECOGNITION IN BIOINFORMATICS, 2011, 7036 : 145 - +
  • [7] Gene expression data modeling and validation of gene selection methods
    Ruffino, Francesca
    BIOLOGICAL AND ARTIFICIAL INTELLIGENCE ENVIRONMENTS, 2005, : 73 - 79
  • [8] Quantitative approaches for investigating the spatial context of gene expression
    Lee, Je H.
    WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE, 2017, 9 (02)
  • [9] Feature selection and gene clustering from gene expression data
    Mitra, P
    Majumder, DD
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 343 - 346
  • [10] Integrative analysis of methylation and gene expression data in TCGA
    Liu, Yihua
    Qiu, Peng
    2012 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS), 2012, : 1 - 4