Comprehensive Pan-cancer Gene Signature Assessment through the Implementation of a Cascade Machine Learning System

被引:1
|
作者
Castillo-Secilla, Daniel [1 ,2 ]
Galvez, Juan Manuel [2 ]
Carrillo-Perez, Francisco [2 ]
Prieto-Prieto, Juan Carlos [3 ]
Valenzuela, Olga [4 ]
Herrera, Luis Javier [2 ]
Rojas, Ignacio [2 ]
机构
[1] Fujitsu Technol Solut SA, CoE Data Intelligence, Camino Cerro Gamos, 1, Madrid 28224, Spain
[2] Univ Granada, Dept Comp Architecture & Technol, CITIC, Periodista Rafael Gomez Montero, 2, Granada 18014, Spain
[3] Univ Hosp Reina Sofia, Nucl Med Dept, IMIBIC, Menendez Pidal Ave, Cordoba 14004, Spain
[4] Univ Granada, Fac Ciencias, Dept Appl Math, Campus Fuentenueva, Granada 18071, Spain
关键词
Pan-cancer; RNA-seq; TCGA; gene expression; machine learning; feature selection; CDSS; THERAPEUTIC TARGET; CELL-PROLIFERATION; PROSTATE-CANCER; POOR-PROGNOSIS; GASTRIC-CANCER; EXPRESSION; PROGRESSION; DERMATOPONTIN; METHYLATION; CARCINOMA;
D O I
10.2174/1574893617666220421100512
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Despite all the medical advances introduced for personalized patient treatment and the research supported in search of genetic patterns inherent to the occurrence of its different manifestations on the human being, the unequivocal and effective treatment of cancer, unfortunately, remains as an unresolved challenge within the scientific panorama. Until a universal solution for its control is achieved, early detection mechanisms for preventative diagnosis increasingly avoid treatments, resulting in unreliable effectiveness. The discovery of unequivocal gene patterns allowing us to discern between multiple pathological states could help shed light on patients suspected of an oncological disease but with uncertainty in the histological and immunohistochemical results.Methods This study presents an approach for pan-cancer diagnosis based on gene expression analysis that determines a reduced set of 12 genes, making it possible to distinguish between the main 14 cancer diseases.Results Our cascade machine learning process has been robustly designed, obtaining a mean F1 score of 92% and a mean AUC of 99.37% in the test set. Our study showed heterogeneous over-or underexpression of the analyzed genes, which can act as oncogenes or tumor suppressor genes. Upregulation of LPAR5 and PAX8 was demonstrated in thyroid cancer samples. KLF5 was highly expressed in the majority of cancer types.Conclusion Our model constituted a useful tool for pan-cancer gene expression evaluation. In addition to providing biological clues about a hypothetical common origin of cancer, the scalability of this study promises to be very useful for future studies to reinforce, confirm, and extend the biological observations presented here. Code availability and datasets are stored in the following GitHub repository to aim for the research reproducibility: .
引用
收藏
页码:40 / 54
页数:15
相关论文
共 50 条
  • [1] Towards a potential pan-cancer prognostic signature for gene expression based on probesets and ensemble machine learning
    Davide Chicco
    Abbas Alameer
    Sara Rahmati
    Giuseppe Jurman
    [J]. BioData Mining, 15
  • [2] Towards a potential pan-cancer prognostic signature for gene expression based on probesets and ensemble machine learning
    Chicco, Davide
    Alameer, Abbas
    Rahmati, Sara
    Jurman, Giuseppe
    [J]. BIODATA MINING, 2022, 15 (01)
  • [3] Constructing diagnostic signature of serum microRNAs using machine learning for early pan-cancer detection
    Xu, Yuyan
    Liao, Wei
    Chen, Huanwei
    Pan, Mingxin
    [J]. DISCOVER ONCOLOGY, 2024, 15 (01)
  • [4] Comprehensive study of gene expression outliers and their regulation mechanisms in pan-cancer
    Han, Jee Yun
    Eng, Stefan
    Arbet, Jaron
    Boutros, Paul
    [J]. CANCER RESEARCH, 2023, 83 (07)
  • [5] A comprehensive analysis of gasdermin family gene as therapeutic targets in pan-cancer
    Huo, Cheng-Long
    Deng, Yan
    Sun, Zhen-Gang
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [6] A comprehensive analysis of gasdermin family gene as therapeutic targets in pan-cancer
    Cheng-Long Huo
    Yan Deng
    Zhen-Gang Sun
    [J]. Scientific Reports, 12
  • [7] Machine learning for pan-cancer classification based on RNA sequencing data
    Stancl, Paula
    Karlic, Rosa
    [J]. FRONTIERS IN MOLECULAR BIOSCIENCES, 2023, 10
  • [8] A Comprehensive Pan-Cancer Analysis for Pituitary Tumor-Transforming Gene 1
    Gong, Siming
    Wu, Changwu
    Duan, Yingjuan
    Tang, Juyu
    Wu, Panfeng
    [J]. FRONTIERS IN GENETICS, 2022, 13
  • [9] Pan-cancer analysis of cuproptosis-promoting gene signature from multiple perspectives
    Jincheng Ma
    Baocheng Gong
    Qiang Zhao
    [J]. Clinical and Experimental Medicine, 2023, 23 : 4997 - 5014
  • [10] Exploiting convergent phenotypes to derive a pan-cancer cisplatin response gene expression signature
    Scarborough, Jessica A. A.
    Eschrich, Steven A. A.
    Torres-Roca, Javier
    Dhawan, Andrew
    Scott, Jacob G. G.
    [J]. NPJ PRECISION ONCOLOGY, 2023, 7 (01)