A Topological Data Analysis Approach on Predicting Phenotypes from Gene Expression Data

被引:10
|
作者
Mandal, Sayan [1 ]
Guzman-Saenz, Aldo [2 ]
Haiminen, Niina [2 ]
Basu, Saugata [3 ]
Parida, Laxmi [2 ]
机构
[1] Ohio State Univ, Columbus, OH 43210 USA
[2] TJ Watson Res Ctr, IBM Res, Yorktown Hts, NY 10598 USA
[3] Purdue Univ, W Lafayette, IN 47907 USA
关键词
Topological data analysis; Gene expression; Phenotype prediction; Parkinson's disease; PERSISTENT HOMOLOGY;
D O I
10.1007/978-3-030-42266-0_14
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The goal of this study was to investigate if gene expression measured from RNA sequencing contains enough signal to separate healthy and afflicted individuals in the context of phenotype prediction. We observed that standard machine learning methods alone performed somewhat poorly on the disease phenotype prediction task; therefore we devised an approach augmenting machine learning with topological data analysis. We describe a framework for predicting phenotype values by utilizing gene expression data transformed into sample-specific topological signatures by employing feature subsampling and persistent homology. The topological data analysis approach developed in this work yielded improved results on Parkinson's disease phenotype prediction when measured against standard machine learning methods. This study confirms that gene expression can be a useful indicator of the presence or absence of a condition, and the subtle signal contained in this high dimensional data reveals itself when considering the intricate topological connections between expressed genes.
引用
收藏
页码:178 / 187
页数:10
相关论文
共 50 条
  • [1] A topological approach for cancer subtyping from gene expression data
    Rafique, Omar
    Mir, A. H.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 102
  • [2] Predicting genotypes from gene expression data
    Mary Muers
    [J]. Nature Reviews Genetics, 2012, 13 (6) : 379 - 379
  • [3] Use of gene expression data for predicting continuous phenotypes for animal production and breeding
    Robinson, N.
    Goddard, M.
    Hayes, B.
    [J]. ANIMAL, 2008, 2 (10) : 1413 - 1420
  • [4] Classification of dendritic cell phenotypes from gene expression data
    Giacomo Tuana
    Viola Volpato
    Paola Ricciardi-Castagnoli
    Francesca Zolezzi
    Fabio Stella
    Maria Foti
    [J]. BMC Immunology, 12
  • [5] Classification of dendritic cell phenotypes from gene expression data
    Tuana, Giacomo
    Volpato, Viola
    Ricciardi-Castagnoli, Paola
    Zolezzi, Francesca
    Stella, Fabio
    Foti, Maria
    [J]. BMC IMMUNOLOGY, 2011, 12
  • [6] Predicting gene knockout effects from expression data
    Rosenski, Jonathan
    Shifman, Sagiv
    Kaplan, Tommy
    [J]. BMC MEDICAL GENOMICS, 2023, 16 (01)
  • [7] Predicting gene knockout effects from expression data
    Jonathan Rosenski
    Sagiv Shifman
    Tommy Kaplan
    [J]. BMC Medical Genomics, 16
  • [8] Joint Genetic Analysis of Gene Expression Data with Inferred Cellular Phenotypes
    Parts, Leopold
    Stegle, Oliver
    Winn, John
    Durbin, Richard
    [J]. PLOS GENETICS, 2011, 7 (01):
  • [9] A novel computational approach for predicting complex phenotypes in Drosophila (starvation-sensitive and sterile) by deriving their gene expression signatures from public data
    Ivanov, Dobril K.
    Bostelmann, Gerrit
    Lan-Leung, Benoit
    Williams, Julie
    Partridge, Linda
    Escott-Price, Valentina
    Thornton, Janet M.
    [J]. PLOS ONE, 2020, 15 (10):
  • [10] TOPOLOGICAL FEATURES IN CANCER GENE EXPRESSION DATA
    Lockwood, S.
    Krishnamoorthy, B.
    [J]. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2015 (PSB), 2015, : 108 - 119