Predicting breast cancer risk using personal health data and machine learning models

被引:65
|
作者
Stark, Gigi F. [1 ]
Hart, Gregory R. [1 ]
Nartowt, Bradley J. [1 ]
Deng, Jun [1 ]
机构
[1] Yale Univ, Dept Therapeut Radiol, New Haven, CT 06520 USA
来源
PLOS ONE | 2019年 / 14卷 / 12期
基金
美国国家卫生研究院;
关键词
VALIDATION;
D O I
10.1371/journal.pone.0226765
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Among women, breast cancer is a leading cause of death. Breast cancer risk predictions can inform screening and preventative actions. Previous works found that adding inputs to the widely-used Gail model improved its ability to predict breast cancer risk. However, these models used simple statistical architectures and the additional inputs were derived from costly and / or invasive procedures. By contrast, we developed machine learning models that used highly accessible personal health data to predict five-year breast cancer risk. We created machine learning models using only the Gail model inputs and models using both Gail model inputs and additional personal health data relevant to breast cancer risk. For both sets of inputs, six machine learning models were trained and evaluated on the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial data set. The area under the receiver operating characteristic curve metric quantified each models performance. Since this data set has a small percentage of positive breast cancer cases, we also reported sensitivity, specificity, and precision. We used Delong tests (p < 0.05) to compare the testing data set performance of each machine learning model to that of the Breast Cancer Risk Prediction Tool (BCRAT), an implementation of the Gail model. None of the machine learning models with only BCRAT inputs were significantly stronger than the BCRAT. However, the logistic regression, linear discriminant analysis, and neural network models with the broader set of inputs were all significantly stronger than the BCRAT. These results suggest that relative to the BCRAT, additional easy-to-obtain personal health inputs can improve five-year breast cancer risk prediction. Our models could be used as non-invasive and cost-effective risk stratification tools to increase early breast cancer detection and prevention, motivating both immediate actions like screening and long-term preventative measures such as hormone replacement therapy and chemoprevention.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Predicting and Classifying Breast Cancer Using Machine Learning
    Alkhathlan, Lina
    Saudagar, Abdul Khader Jilani
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2022, 29 (06) : 497 - 514
  • [2] Predicting breast cancer risk using interacting genetic and demographic factors and machine learning
    Hamid Behravan
    Jaana M. Hartikainen
    Maria Tengström
    Veli–Matti Kosma
    Arto Mannermaa
    [J]. Scientific Reports, 10
  • [3] Predicting breast cancer risk using interacting genetic and demographic factors and machine learning
    Behravan, Hamid
    Hartikainen, Jaana M.
    Tengstrom, Maria
    Kosma, Veli-Matti
    Mannermaa, Arto
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [4] Predicting the Risk Level of a Loan Based on the Customer's Personal Factors Using Machine Learning Models
    Hedrick, Jacob
    Yeboah, Jones
    Nti, Isaac Kofi
    [J]. 2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [5] An analysis method for predicting breast cancer using data science processes and machine learning
    Cordova Calle, Juan Jose
    Farez Villa, John Xavier
    Hurtado Ortiz, Remigio Ismael
    [J]. 2022 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC), 2022,
  • [6] Predicting Asthma Exacerbation Risk in the Adult South Korean Population Using Integrated Health Data and Machine Learning Models
    Choi, Joon Young
    Rhee, Chin Kook
    [J]. JOURNAL OF ASTHMA AND ALLERGY, 2024, 17 : 783 - 789
  • [7] Breast Cancer Prediction using Machine Learning Models
    Iparraguirre-Villanueva, Orlando
    Epifania-Huerta, Andres
    Torres-Ceclen, Carmen
    Ruiz-Alvarado, John
    Cabanillas-Carbonell, Michael
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (02) : 610 - 620
  • [8] Predicting the recurrence of breast cancer using machine learning algorithms
    Amal Alzu’bi
    Hassan Najadat
    Wesam Doulat
    Osama Al-Shari
    Leming Zhou
    [J]. Multimedia Tools and Applications, 2021, 80 : 13787 - 13800
  • [9] Predicting the recurrence of breast cancer using machine learning algorithms
    Alzu'bi, Amal
    Najadat, Hassan
    Doulat, Wesam
    Al-Shari, Osama
    Zhou, Leming
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (09) : 13787 - 13800
  • [10] Comparison of machine learning models for predicting the risk of breast cancer- related lymphedema in Chinese women
    Wu, Xiumei
    Guan, Qiongyao
    Cheng, Andy S. K.
    Guan, Changhe
    Su, Yan
    Jiang, Jingchi
    Wang, Boran
    Zeng, Linghui
    Zeng, Yingchun
    [J]. ASIA-PACIFIC JOURNAL OF ONCOLOGY NURSING, 2022, 9 (12)