Deep learning generates custom-made logistic regression models for explaining how breast cancer subtypes are classified

被引:3
|
作者
Shibahara, Takuma [1 ,4 ]
Wada, Chisa [2 ]
Yamashita, Yasuho [1 ]
Fujita, Kazuhiro [2 ]
Sato, Masamichi [2 ]
Kuwata, Junichi [1 ]
Okamoto, Atsushi [2 ]
Ono, Yoshimasa [3 ]
机构
[1] Hitachi Ltd, Res & Dev Grp, Tokyo, Japan
[2] Translat Res Dept Ltd, Bioinformat Grp, Daiichi Sankyo RD Novare Coporat, Tokyo, Japan
[3] Translat Res Dept Ltd, Daiichi Sankyo RD Novare Coporat, Tokyo, Japan
[4] Hitachi Ltd, Cent Res Lab, Tokyo, Japan
来源
PLOS ONE | 2023年 / 18卷 / 05期
关键词
MOLECULAR PORTRAITS; HETEROGENEITY;
D O I
10.1371/journal.pone.0286072
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Differentiating the intrinsic subtypes of breast cancer is crucial for deciding the best treatment strategy. Deep learning can predict the subtypes from genetic information more accurately than conventional statistical methods, but to date, deep learning has not been directly utilized to examine which genes are associated with which subtypes. To clarify the mechanisms embedded in the intrinsic subtypes, we developed an explainable deep learning model called a point-wise linear (PWL) model that generates a custom-made logistic regression for each patient. Logistic regression, which is familiar to both physicians and medical informatics researchers, allows us to analyze the importance of the feature variables, and the PWL model harnesses these practical abilities of logistic regression. In this study, we show that analyzing breast cancer subtypes is clinically beneficial for patients and one of the best ways to validate the capability of the PWL model. First, we trained the PWL model with RNA-seq data to predict PAM50 intrinsic subtypes and applied it to the 41/50 genes of PAM50 through the subtype prediction task. Second, we developed a deep enrichment analysis method to reveal the relationships between the PAM50 subtypes and the copy numbers of breast cancer. Our findings showed that the PWL model utilized genes relevant to the cell cycle-related pathways. These preliminary successes in breast cancer subtype analysis demonstrate the potential of our analysis strategy to clarify the mechanisms underlying breast cancer and improve overall clinical outcomes.
引用
收藏
页数:19
相关论文
共 3 条
  • [1] Time-related survival prediction in molecular subtypes of breast cancer using time-to-event deep-learning-based models
    Shahraki, Saba Zarean
    Looha, Mehdi Azizmohammad
    Kazaj, Pooya Mohammadi
    Aria, Mehrad
    Akbari, Atieh
    Emami, Hassan
    Asadi, Farkhondeh
    Akbari, Mohammad Esmaeil
    FRONTIERS IN ONCOLOGY, 2023, 13
  • [2] The features associated with mammography-occult MRI-detected newly diagnosed breast cancer analysed by comparing machine learning models with a logistic regression model
    Yang, Wei
    Yang, Yan
    Zhang, Ningmei
    Yin, Qingyun
    Zhang, Chaolin
    Han, Jinyu
    Zhou, Xiaoping
    Liu, Kaihui
    RADIOLOGIA MEDICA, 2024, 129 (05): : 751 - 766
  • [3] The features associated with mammography-occult MRI-detected newly diagnosed breast cancer analysed by comparing machine learning models with a logistic regression model
    Wei Yang
    Yan Yang
    Ningmei Zhang
    Qingyun Yin
    Chaolin Zhang
    Jinyu Han
    Xiaoping Zhou
    Kaihui Liu
    La radiologia medica, 2024, 129 : 751 - 766