Machine Learning-Based Analysis of Glioma Grades Reveals Co-Enrichment

被引:3
|
作者
Garbulowski, Mateusz [1 ,2 ]
Smolinska, Karolina [1 ]
Cabuk, Ugur [1 ,3 ,4 ]
Yones, Sara A. [1 ]
Celli, Ludovica [1 ,5 ,6 ]
Yaz, Esma Nur [1 ,7 ]
Barrenas, Fredrik [1 ,8 ]
Diamanti, Klev [1 ,9 ]
Wadelius, Claes [9 ]
Komorowski, Jan [1 ,8 ,10 ,11 ]
机构
[1] Uppsala Univ, Dept Cell & Mol Biol, S-75237 Uppsala, Sweden
[2] Stockholm Univ, Dept Biochem & Biophys, Sci Life Lab, S-10691 Solna, Sweden
[3] Helmholtz Ctr Polar & Marine Res, Alfred Wegener Inst, Polar Terr Environm Syst, D-14473 Potsdam, Germany
[4] Univ Potsdam, Inst Biochem & Biol, D-14469 Potsdam, Germany
[5] CNR, Inst Mol Genet Luigi Luca Cavalli Sforza, I-27100 Pavia, Italy
[6] Univ Pavia, Dept Biol & Biotechnol, I-27100 Pavia, Italy
[7] Istanbul Medipol Univ, Grad Sch Engn & Nat Sci, Dept Biomed Engn & Bioinformat, TR-34810 Istanbul, Turkey
[8] Washington Natl Primate Res Ctr, Seattle, WA 98195 USA
[9] Uppsala Univ, Dept Immunol Genet & Pathol, S-75185 Uppsala, Sweden
[10] Swedish Coll Adv Study, S-75238 Uppsala, Sweden
[11] Polish Acad Sci, Inst Comp Sci, PL-01248 Warsaw, Poland
基金
美国国家卫生研究院;
关键词
glioma; machine learning; batch effect; TCGA; co-enrichment; rough sets; EXPRESSION; GLIOBLASTOMA; BIOLOGY; AURORA; TUMORS;
D O I
10.3390/cancers14041014
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Simple Summary Gliomas are heterogenous types of cancer, therefore the therapy should be personalized and targeted toward specific pathways. We developed a methodology that corrected strong batch effects from The Cancer Genome Atlas datasets and estimated glioma grade-specific co-enrichment mechanisms using machine learning. Our findings created hypotheses for annotations, e.g., pathways, that should be considered as therapeutic targets. Gliomas develop and grow in the brain and central nervous system. Examining glioma grading processes is valuable for improving therapeutic challenges. One of the most extensive repositories storing transcriptomics data for gliomas is The Cancer Genome Atlas (TCGA). However, such big cohorts should be processed with caution and evaluated thoroughly as they can contain batch and other effects. Furthermore, biological mechanisms of cancer contain interactions among biomarkers. Thus, we applied an interpretable machine learning approach to discover such relationships. This type of transparent learning provides not only good predictability, but also reveals co-predictive mechanisms among features. In this study, we corrected the strong and confounded batch effect in the TCGA glioma data. We further used the corrected datasets to perform comprehensive machine learning analysis applied on single-sample gene set enrichment scores using collections from the Molecular Signature Database. Furthermore, using rule-based classifiers, we displayed networks of co-enrichment related to glioma grades. Moreover, we validated our results using the external glioma cohorts. We believe that utilizing corrected glioma cohorts from TCGA may improve the application and validation of any future studies. Finally, the co-enrichment and survival analysis provided detailed explanations for glioma progression and consequently, it should support the targeted treatment.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Machine Learning-Based Temporary Traffic Control Cost Analysis
    Jiang, Yuhan
    Han, Sisi
    Bai, Yong
    RESILIENCE AND SUSTAINABLE TRANSPORTATION SYSTEMS: PROCEEDINGS OF THE 13TH ASIA PACIFIC TRANSPORTATION DEVELOPMENT CONFERENCE, 2020, : 86 - 96
  • [42] Editorial: Machine Learning-Based Methods for RNA Data Analysis
    Peng, Lihong
    Yang, Jialiang
    Wang, Minxian
    Zhou, Liqian
    FRONTIERS IN GENETICS, 2022, 13
  • [43] Accuracy Analysis of Machine Learning-Based Performance Modeling for Microprocessors
    Tanaka, Yoshihiro
    Oka, Keitaro
    Ono, Takatsugu
    Inoue, Koji
    2016 FOURTH INTERNATIONAL JAPAN-EGYPT CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND COMPUTERS (JEC-ECC), 2016, : 83 - 86
  • [44] Machine learning-based optimization and performance analysis of cooling towers
    Salins, Sampath Suranjan
    Kumar, Shiva
    Ganesha, A.
    Reddy, S. V. Kota
    JOURNAL OF BUILDING ENGINEERING, 2024, 96
  • [45] Multiomics and machine learning-based analysis of pancancer pseudouridine modifications
    Zhang, Jiheng
    Xu, Lei
    Yan, Xiuwei
    Hu, Jiahe
    Gao, Xin
    Zhao, Hongtao
    Geng, Mo
    Wang, Nan
    Hu, Shaoshan
    DISCOVER ONCOLOGY, 2024, 15 (01)
  • [46] WorMachine: machine learning-based phenotypic analysis tool for worms
    Adam Hakim
    Yael Mor
    Itai Antoine Toker
    Amir Levine
    Moran Neuhof
    Yishai Markovitz
    Oded Rechavi
    BMC Biology, 16
  • [47] Fairness Analysis of Machine Learning-Based Code Reviewer Recommendation
    Mohajer, Mohammad Mahdi
    Belle, Alvine Boaye
    Harzevili, Nima Shiri
    Wang, Junjie
    Hemmati, Hadi
    Wang, Song
    Jiang, Zhen Ming
    ADVANCES IN BIAS AND FAIRNESS IN INFORMATION RETRIEVAL, BIAS 2024, 2025, 2227 : 46 - 63
  • [48] Performance Analysis of Machine Learning-Based Systems for Detecting Deforestation
    de Araujo, Michel
    Andrade, Ermeson
    Machida, Fumio
    2021 XI BRAZILIAN SYMPOSIUM ON COMPUTING SYSTEMS ENGINEERING (SBESC), 2021,
  • [49] Image analysis and machine learning-based malaria assessment system
    Kyle Manning
    Xiaojun Zhai
    Wangyang Yu
    Digital Communications and Networks, 2022, 8 (02) : 132 - 142
  • [50] Machine Learning-Based A Comparative Analysis for Air Quality Prediction
    Utku, Anil
    Can, Umit
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,