How can machine-learning methods assist in virtual screening for hyperuricemia? A healthcare machine-learning approach

被引:40
|
作者
Ichikawa, Daisuke [1 ]
Saito, Toki [1 ]
Ujita, Waka [1 ]
Oyama, Hiroshi [1 ]
机构
[1] Univ Tokyo, Grad Sch Med, Dep Clin Informat Engn, Div Social Med, Tokyo, Japan
关键词
Hyperuricemia; Machine-learning; Prediction; URIC-ACID; CLASSIFICATION; PROGRAM; RISK;
D O I
10.1016/j.jbi.2016.09.012
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Object: Our purpose was to develop a new machine-learning approach (a virtual health check-up) toward identification of those at high risk of hyperuricemia. Applying the system to general health check-ups is expected to reduce medical costs compared with administering an additional test. Methods: Data were collected during annual health check-ups performed in Japan between 2011 and 2013 (inclusive). We prepared training and test datasets from the health check-up data to build prediction models; these were composed of 43,524 and 17,789 persons, respectively. Gradient-boosting decision tree (GBDT), random forest (RF), and logistic regression (LR) approaches were trained using the training dataset and were then used to predict hyperuricemia in the test dataset. Undersampling was applied to build the prediction models to deal with the imbalanced class dataset. Results: The results showed that the RF and GBDT approaches afforded the best performances in terms of sensitivity and specificity, respectively. The area under the curve (AUC) values of the models, which reflected the total discriminative ability of the classification, were 0.796 [95% confidence interval (CI): 0.766-0.825] for the GBDT, 0.784 [95% CI: 0.752-0.815] for the RF, and 0.785 [95% CI: 0.752-0.819] for the LR approaches. No significant differences were observed between pairs of each approach. Small changes occurred in the AUCs after applying undersampling to build the models. Conclusions: We developed a virtual health check-up that predicted the development of hyperuricemia using machine-learning methods. The GBDT, RF, and LR methods had similar predictive capability. Undersampling did not remarkably improve predictive power. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:20 / 24
页数:5
相关论文
共 50 条
  • [31] Examining the radius valley: a machine-learning approach
    MacDonald, Mariah G.
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2019, 487 (04) : 5062 - 5069
  • [32] A machine-learning approach to predict postprandial hypoglycemia
    Wonju Seo
    You-Bin Lee
    Seunghyun Lee
    Sang-Man Jin
    Sung-Min Park
    [J]. BMC Medical Informatics and Decision Making, 19
  • [33] Virtual Issue on Machine-Learning Discoveries in Materials Science
    Oliynyk, Anton O.
    Buriak, Jillian M.
    [J]. CHEMISTRY OF MATERIALS, 2019, 31 (20) : 8243 - 8247
  • [34] A practical guide to machine-learning scoring for structure-based virtual screening
    Viet-Khoa Tran-Nguyen
    Muhammad Junaid
    Saw Simeon
    Pedro J. Ballester
    [J]. Nature Protocols, 2023, 18 : 3460 - 3511
  • [35] Performance of machine-learning scoring functions in structure-based virtual screening
    Maciej Wójcikowski
    Pedro J. Ballester
    Pawel Siedlecki
    [J]. Scientific Reports, 7
  • [36] Performance of machine-learning scoring functions in structure-based virtual screening
    Wojcikowski, Maciej
    Ballester, Pedro J.
    Siedlecki, Pawel
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [37] A practical guide to machine-learning scoring for structure-based virtual screening
    Tran-Nguyen, Viet-Khoa
    Junaid, Muhammad
    Simeon, Saw
    Ballester, Pedro J.
    [J]. NATURE PROTOCOLS, 2023, 18 (11) : 3460 - 3511
  • [38] Machine-learning micropattern manufacturing
    Wang, Si
    Shen, Ziao
    Shen, Zhenyu
    Dong, Yuanjun
    Li, Yanran
    Cao, Yuxin
    Zhang, Yanmei
    Guo, Shengshi
    Shuai, Jianwei
    Yang, Yun
    Lin, Changjian
    Chen, Xun
    Zhang, Xingcai
    Huang, Qiaoling
    [J]. NANO TODAY, 2021, 38
  • [39] Certified Machine-Learning Models
    Damiani, Ernesto
    Ardagna, Claudio A.
    [J]. SOFSEM 2020: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2020, 12011 : 3 - 15
  • [40] Machine-learning with Cellular Automata
    Povalej, P
    Kokol, P
    Druzovec, TW
    Stiglic, B
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS VI, PROCEEDINGS, 2005, 3646 : 305 - 315