Machine Learning-Based Analysis of Glioma Grades Reveals Co-Enrichment

被引:3
|
作者
Garbulowski, Mateusz [1 ,2 ]
Smolinska, Karolina [1 ]
Cabuk, Ugur [1 ,3 ,4 ]
Yones, Sara A. [1 ]
Celli, Ludovica [1 ,5 ,6 ]
Yaz, Esma Nur [1 ,7 ]
Barrenas, Fredrik [1 ,8 ]
Diamanti, Klev [1 ,9 ]
Wadelius, Claes [9 ]
Komorowski, Jan [1 ,8 ,10 ,11 ]
机构
[1] Uppsala Univ, Dept Cell & Mol Biol, S-75237 Uppsala, Sweden
[2] Stockholm Univ, Dept Biochem & Biophys, Sci Life Lab, S-10691 Solna, Sweden
[3] Helmholtz Ctr Polar & Marine Res, Alfred Wegener Inst, Polar Terr Environm Syst, D-14473 Potsdam, Germany
[4] Univ Potsdam, Inst Biochem & Biol, D-14469 Potsdam, Germany
[5] CNR, Inst Mol Genet Luigi Luca Cavalli Sforza, I-27100 Pavia, Italy
[6] Univ Pavia, Dept Biol & Biotechnol, I-27100 Pavia, Italy
[7] Istanbul Medipol Univ, Grad Sch Engn & Nat Sci, Dept Biomed Engn & Bioinformat, TR-34810 Istanbul, Turkey
[8] Washington Natl Primate Res Ctr, Seattle, WA 98195 USA
[9] Uppsala Univ, Dept Immunol Genet & Pathol, S-75185 Uppsala, Sweden
[10] Swedish Coll Adv Study, S-75238 Uppsala, Sweden
[11] Polish Acad Sci, Inst Comp Sci, PL-01248 Warsaw, Poland
基金
美国国家卫生研究院;
关键词
glioma; machine learning; batch effect; TCGA; co-enrichment; rough sets; EXPRESSION; GLIOBLASTOMA; BIOLOGY; AURORA; TUMORS;
D O I
10.3390/cancers14041014
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Simple Summary Gliomas are heterogenous types of cancer, therefore the therapy should be personalized and targeted toward specific pathways. We developed a methodology that corrected strong batch effects from The Cancer Genome Atlas datasets and estimated glioma grade-specific co-enrichment mechanisms using machine learning. Our findings created hypotheses for annotations, e.g., pathways, that should be considered as therapeutic targets. Gliomas develop and grow in the brain and central nervous system. Examining glioma grading processes is valuable for improving therapeutic challenges. One of the most extensive repositories storing transcriptomics data for gliomas is The Cancer Genome Atlas (TCGA). However, such big cohorts should be processed with caution and evaluated thoroughly as they can contain batch and other effects. Furthermore, biological mechanisms of cancer contain interactions among biomarkers. Thus, we applied an interpretable machine learning approach to discover such relationships. This type of transparent learning provides not only good predictability, but also reveals co-predictive mechanisms among features. In this study, we corrected the strong and confounded batch effect in the TCGA glioma data. We further used the corrected datasets to perform comprehensive machine learning analysis applied on single-sample gene set enrichment scores using collections from the Molecular Signature Database. Furthermore, using rule-based classifiers, we displayed networks of co-enrichment related to glioma grades. Moreover, we validated our results using the external glioma cohorts. We believe that utilizing corrected glioma cohorts from TCGA may improve the application and validation of any future studies. Finally, the co-enrichment and survival analysis provided detailed explanations for glioma progression and consequently, it should support the targeted treatment.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Machine learning-based classification of quality grades for concrete vibration behaviour
    Fan, Shuai
    He, Tao
    Li, Weihao
    Zeng, Chuang
    Chen, Peng
    Chen, Lufeng
    Shu, Jiangpeng
    AUTOMATION IN CONSTRUCTION, 2024, 167
  • [2] Semantic Enrichment of BIM: The Role of Machine Learning-Based Image Recognition
    Mirarchi, Claudio
    Gholamzadehmir, Maryam
    Daniotti, Bruno
    Pavan, Alberto
    BUILDINGS, 2024, 14 (04)
  • [3] An Analysis of Machine Learning-Based Semantic Matchmaking
    Karabulut, Erkan
    Sofia, Rute C. C.
    IEEE ACCESS, 2023, 11 : 27829 - 27842
  • [4] Machine learning-based analysis of historical towers
    Dabiri, Hamed
    Clementi, Jessica
    Marini, Roberta
    Mugnozza, Gabriele Scarascia
    Bozzano, Francesca
    Mazzanti, Paolo
    ENGINEERING STRUCTURES, 2024, 304
  • [5] Multivariate machine learning-based language mapping in glioma patients based on lesion topography
    Zhang, Nan
    Yuan, Binke
    Yan, Jing
    Cheng, Jingliang
    Lu, Junfeng
    Wu, Jinsong
    BRAIN IMAGING AND BEHAVIOR, 2021, 15 (05) : 2552 - 2562
  • [6] Machine Learning-based Software Effort Estimation : An Analysis
    Polkowski, Zdzislaw
    Vora, Jayneel
    Tanwar, Sudeep
    Tyagi, Sudhanshu
    Singh, Pradeep Kumar
    Singh, Yashwant
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI-2019), 2019,
  • [7] Machine learning-based analysis of adolescent gambling factors
    Seo, Wonju
    Kim, Namho
    Lee, Sang-Kyu
    Park, Sung-Min
    JOURNAL OF BEHAVIORAL ADDICTIONS, 2020, 9 (03) : 734 - 743
  • [8] MACHINE LEARNING-BASED ANALYSIS OF ENGLISH LATERAL ALLOPHONES
    Piotrowska, Magdalena
    Korvel, Grazina
    Kostek, Bozena
    Ciszewski, Tomasz
    Czyzewski, Andrzej
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2019, 29 (02) : 393 - 405
  • [9] Machine Learning-Based Framework for the Analysis of Project Viability
    Tshimula, Jean Marie
    Togashi, Atsushi
    PROCEEDINGS OF 2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS), 2018, : 80 - 84
  • [10] Machine Learning-Based Queueing Time Analysis in XGPON
    Ismail, N. A.
    Idrus, S. M.
    Iqbal, F.
    Zin, A. M.
    Atan, F.
    Ali, N.
    INTERNATIONAL JOURNAL OF NANOELECTRONICS AND MATERIALS, 2021, 14 : 157 - 163