Machine Learning-Based Analysis of Glioma Grades Reveals Co-Enrichment

被引:3
|
作者
Garbulowski, Mateusz [1 ,2 ]
Smolinska, Karolina [1 ]
Cabuk, Ugur [1 ,3 ,4 ]
Yones, Sara A. [1 ]
Celli, Ludovica [1 ,5 ,6 ]
Yaz, Esma Nur [1 ,7 ]
Barrenas, Fredrik [1 ,8 ]
Diamanti, Klev [1 ,9 ]
Wadelius, Claes [9 ]
Komorowski, Jan [1 ,8 ,10 ,11 ]
机构
[1] Uppsala Univ, Dept Cell & Mol Biol, S-75237 Uppsala, Sweden
[2] Stockholm Univ, Dept Biochem & Biophys, Sci Life Lab, S-10691 Solna, Sweden
[3] Helmholtz Ctr Polar & Marine Res, Alfred Wegener Inst, Polar Terr Environm Syst, D-14473 Potsdam, Germany
[4] Univ Potsdam, Inst Biochem & Biol, D-14469 Potsdam, Germany
[5] CNR, Inst Mol Genet Luigi Luca Cavalli Sforza, I-27100 Pavia, Italy
[6] Univ Pavia, Dept Biol & Biotechnol, I-27100 Pavia, Italy
[7] Istanbul Medipol Univ, Grad Sch Engn & Nat Sci, Dept Biomed Engn & Bioinformat, TR-34810 Istanbul, Turkey
[8] Washington Natl Primate Res Ctr, Seattle, WA 98195 USA
[9] Uppsala Univ, Dept Immunol Genet & Pathol, S-75185 Uppsala, Sweden
[10] Swedish Coll Adv Study, S-75238 Uppsala, Sweden
[11] Polish Acad Sci, Inst Comp Sci, PL-01248 Warsaw, Poland
基金
美国国家卫生研究院;
关键词
glioma; machine learning; batch effect; TCGA; co-enrichment; rough sets; EXPRESSION; GLIOBLASTOMA; BIOLOGY; AURORA; TUMORS;
D O I
10.3390/cancers14041014
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Simple Summary Gliomas are heterogenous types of cancer, therefore the therapy should be personalized and targeted toward specific pathways. We developed a methodology that corrected strong batch effects from The Cancer Genome Atlas datasets and estimated glioma grade-specific co-enrichment mechanisms using machine learning. Our findings created hypotheses for annotations, e.g., pathways, that should be considered as therapeutic targets. Gliomas develop and grow in the brain and central nervous system. Examining glioma grading processes is valuable for improving therapeutic challenges. One of the most extensive repositories storing transcriptomics data for gliomas is The Cancer Genome Atlas (TCGA). However, such big cohorts should be processed with caution and evaluated thoroughly as they can contain batch and other effects. Furthermore, biological mechanisms of cancer contain interactions among biomarkers. Thus, we applied an interpretable machine learning approach to discover such relationships. This type of transparent learning provides not only good predictability, but also reveals co-predictive mechanisms among features. In this study, we corrected the strong and confounded batch effect in the TCGA glioma data. We further used the corrected datasets to perform comprehensive machine learning analysis applied on single-sample gene set enrichment scores using collections from the Molecular Signature Database. Furthermore, using rule-based classifiers, we displayed networks of co-enrichment related to glioma grades. Moreover, we validated our results using the external glioma cohorts. We believe that utilizing corrected glioma cohorts from TCGA may improve the application and validation of any future studies. Finally, the co-enrichment and survival analysis provided detailed explanations for glioma progression and consequently, it should support the targeted treatment.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Machine Learning-Based Multifunctional Optical Spectrum Analysis Technique
    Wang, Danshi
    Zhang, Min
    Zhang, Zhiguo
    Li, Jin
    Gao, Hui
    Zhang, Fan
    Chen, Xue
    IEEE ACCESS, 2019, 7 : 19726 - 19737
  • [32] Supervised Machine Learning-Based Cardiovascular Disease Analysis and Prediction
    Hossen, M. D. Amzad
    Tazin, Tahia
    Khan, Sumiaya
    Alam, Evan
    Sojib, Hossain Ahmed
    Khan, Mohammad Monirujjaman
    Alsufyani, Abdulmajeed
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [33] A Machine Learning-based Approach for Automated Vulnerability Remediation Analysis
    Zhang, Fengli
    Huff, Philip
    McClanahan, Kylie
    Li, Qinghua
    2020 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2020,
  • [34] Machine Learning-Based Solution for Thermomechanical Analysis of MMIC Packaging
    Kang, Sumin
    Lee, Jae Hak
    Kim, Seung Man
    Lim, Jaeseung
    Park, Ah-Young
    Han, Seongheum
    Song, Jun-Yeob
    Kim, Seong-Il
    ADVANCED MATERIALS TECHNOLOGIES, 2023, 8 (05)
  • [35] Machine Learning-Based Analysis of Sustainable Biochar Production Processes
    Cosgun, Ahmet
    Oral, Burcu
    Gunay, M. Erdem
    Yildirim, Ramazan
    BIOENERGY RESEARCH, 2024, 17 (04) : 2311 - 2327
  • [36] Machine Learning-Based Bibliometric Analysis of Systems Thinking Research
    Oosthuizen, Rudolph
    Grobbelaar, Schalk
    Proceedings of the 29th International Conference on Engineering, Technology, and Innovation: Shaping the Future, ICE 2023, 2023,
  • [37] Machine Learning-Based Analysis of Program Binaries: A Comprehensive Study
    Xue, Hongfa
    Sun, Shaowen
    Venkataramani, Guru
    Lan, Tian
    IEEE ACCESS, 2019, 7 : 65889 - 65912
  • [38] A machine learning-based approach to prognostic analysis of thoracic transplantations
    Delen, Dursun
    Oztekin, Asil
    Kong, Zhenyu
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2010, 49 (01) : 33 - 42
  • [39] Machine learning-based cytokine microarray digital immunoassay analysis
    Song, Yujing
    Zhao, Jingyang
    Cai, Tao
    Stephens, Andrew
    Su, Shiuan-Haur
    Sandford, Erin
    Flora, Christopher
    Singer, Benjamin H.
    Ghosh, Monalisa
    Choi, Sung Won
    Tewari, Muneesh
    Kurabayashi, Katsuo
    BIOSENSORS & BIOELECTRONICS, 2021, 180
  • [40] Machine learning-based methods in structural reliability analysis: A review
    Afshari, Sajad Saraygord
    Enayatollahi, Fatemeh
    Xu, Xiangyang
    Liang, Xihui
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2022, 219