Identification of Breast Cancer Metastasis Markers from Gene Expression Profiles Using Machine Learning Approaches

被引:4
|
作者
Jung, Jinmyung [1 ]
Yoo, Sunyong [2 ]
机构
[1] Univ Suwon, Coll Informat & Commun Technol, Div Data Sci, Hwaseong 18323, South Korea
[2] Chonnam Natl Univ, Dept ICT Convergence Syst Engn, Gwangju 61005, South Korea
基金
新加坡国家研究基金会;
关键词
metastasis marker; gene expression; machine learning; XGBoost; breast cancer; feature importance; PROTEIN; REGULATOR; RESOURCE;
D O I
10.3390/genes14091820
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Cancer metastasis accounts for approximately 90% of cancer deaths, and elucidating markers in metastasis is the first step in its prevention. To characterize metastasis marker genes (MGs) of breast cancer, XGBoost models that classify metastasis status were trained with gene expression profiles from TCGA. Then, a metastasis score (MS) was assigned to each gene by calculating the inner product between the feature importance and the AUC performance of the models. As a result, 54, 202, and 357 genes with the highest MS were characterized as MGs by empirical p-value cutoffs of 0.001, 0.005, and 0.01, respectively. The three sets of MGs were compared with those from existing metastasis marker databases, which provided significant results in most comparisons (p-value < 0.05). They were also significantly enriched in biological processes associated with breast cancer metastasis. The three MGs, SPPL2C, KRT23, and RGS7, showed highly significant results (p-value < 0.01) in the survival analysis. The MGs that could not be identified by statistical analysis (e.g., GOLM1, ELAVL1, UBP1, and AZGP1), as well as the MGs with the highest MS (e.g., ZNF676, FAM163B, LDOC2, IRF1, and STK40), were verified via the literature. Additionally, we checked how close the MGs were to each other in the protein-protein interaction networks. We expect that the characterized markers will help understand and prevent breast cancer metastasis.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Predictive analysis of breast cancer metastasis and identification of genetic markers using machine learning
    Umadevi, Kovuri
    Sundeep, Dola
    JOURNAL OF CLINICAL ONCOLOGY, 2024, 42 (23_SUPPL) : 4 - 4
  • [2] Identification and characterization of optimal gene expression markers for detection of breast cancer metastasis
    Backus, J
    Laughlin, T
    Wang, YX
    Belly, R
    White, R
    Baden, J
    Min, CJ
    Mannie, A
    Tafra, L
    Atkins, D
    Verbanac, KM
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2005, 7 (03): : 327 - 336
  • [3] Identification of Gene Expression in Different Stages of Breast Cancer with Machine Learning
    Abidalkareem, Ali
    Ibrahim, Ali K.
    Abd, Moaed
    Rehman, Oneeb
    Zhuang, Hanqi
    CANCERS, 2024, 16 (10)
  • [4] Identification of a gene expression breast cancer metastasis profile
    Seebach, J.
    Field, L. A.
    Love, B.
    Hollern, K.
    Hooke, J. A.
    Ellsworth, R. E.
    Shriver, C. D.
    BREAST CANCER RESEARCH AND TREATMENT, 2007, 106 : S244 - S244
  • [5] Gene expression profiles and breast cancer metastasis: a genetic perspective
    Kent W. Hunter
    Jude Alsarraj
    Clinical & Experimental Metastasis, 2009, 26 : 497 - 503
  • [6] Gene expression profiles and breast cancer metastasis: a genetic perspective
    Hunter, Kent W.
    Alsarraj, Jude
    CLINICAL & EXPERIMENTAL METASTASIS, 2009, 26 (06) : 497 - 503
  • [7] Classification of breast cancer patients using somatic mutation profiles and machine learning approaches
    Vural, Suleyman
    Wang, Xiaosheng
    Guda, Chittibabu
    BMC SYSTEMS BIOLOGY, 2016, 10
  • [8] Breast Cancer Identification Using Machine Learning
    Jia, Xiao
    Sun, Xiaolin
    Zhang, Xingang
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [9] Identification of a gene expression breast cancer nodal metastasis profile
    Ellsworth, R. E.
    Heckman, C.
    Seebach, J.
    Field, L. A.
    Love, B.
    Hooke, J. A.
    Shriver, C. D.
    JOURNAL OF CLINICAL ONCOLOGY, 2008, 26 (15)
  • [10] Gene expression profiles of breast cancer metastasis according to organ site
    Braso-Maristany, Fara
    Pare, Laia
    Chic, Nuria
    Martinez-Saez, Olga
    Pascual, Tomas
    Mallafre-Larrosa, Meritxell
    Schettini, Francesco
    Gonzalez-Farre, Blanca
    Sanfeliu, Esther
    Martinez, Debora
    Galvan, Patricia
    Barnadas, Esther
    Salinas, Belinda
    Tolosa, Pablo
    Ciruelos, Eva
    Carcelero, Esther
    Guillen, Cecilia
    Adamo, Barbara
    Moreno, Reinaldo
    Vidal, Maria
    Munoz, Montserrat
    Prat, Aleix
    MOLECULAR ONCOLOGY, 2022, 16 (01) : 69 - 87