Investigation of expert rule bases, logistic regression, and non-linear machine learning techniques for predicting response to antiretroviral treatment

被引:0
|
作者
Prosperi, Mattia C. F. [1 ,2 ]
Altmann, Andre [3 ]
Rosen-Zvi, Michal [4 ]
Aharoni, Ehud [4 ]
Gabor Borgulya [5 ]
Fulop Bazso [5 ]
Sonnerborg, Anders [6 ]
Schuelter, Eugen [7 ]
Struck, Daniel [8 ]
Ulivi, Giovanni [1 ]
Vandamme, Anne-Mieke [9 ]
Vercauteren, Jurgen [9 ]
Zazzi, Maurizio [10 ]
机构
[1] Roma Tre Univ, Dept Comp Sci & Automat, Rome, Italy
[2] Informa, Rome, Italy
[3] Max Planck Inst Informat, Saarbrucken, Germany
[4] IBM Haifa Res Lab, Haifa, Israel
[5] Hungarian Acad Sci, KFKI Res Inst Particle & Nucl Phys, Budapest, Hungary
[6] Karolinska Inst, Stockholm, Sweden
[7] Univ Cologne, Cologne, Germany
[8] Ctr Rech Publ Sante, Luxembourg, Luxembourg
[9] Katholieke Univ Leuven, Rega Inst, Leuven, Belgium
[10] Univ Siena, I-53100 Siena, Italy
关键词
DRUG-RESISTANCE; INTERPRETATION SYSTEMS; GENOTYPIC-RESISTANCE; HIV-1; THERAPY; DISCORDANCES; VALIDATION; ALGORITHMS; PATTERNS; PROTEASE;
D O I
暂无
中图分类号
R51 [传染病];
学科分类号
100401 ;
摘要
Background: The extreme flexibility of the HIV type-1 (HIV-1) genome makes it challenging to build the ideal antiretroviral treatment regimen. Interpretation of HIV-1 genotypic drug resistance is evolving from rule-based systems guided by expert opinion to data-driven engines developed through machine learning methods. Methods: The aim of the study was to investigate linear and non-linear statistical learning models for classifying short-term virological outcome of antiretroviral treatment. To optimize the model, different feature selection methods were considered. Robust extra-sample error estimation and different loss functions were used to assess model performance. The results were compared with widely used rule-based genotypic interpretation systems (Stanford HIVdb, Rega and ANRS). Results: A set of 3,143 treatment change episodes were extracted from the EuResist database. The dataset included patient demographics, treatment history and viral genotypes. A logistic regression model using high order interaction variables performed better than rule-based genotypic interpretation systems (accuracy 75.63% versus 71.74-73.89%, area under the receiver operating characteristic curve [AUC] 0.76 versus 0.68-0.70) and was equivalent to a random forest model (accuracy 76.16%, AUC 0.77). However, when rule-based genotypic interpretation systems were coupled with additional patient attributes, and the combination was provided as input to the logistic regression model, the performance increased significantly, becoming comparable to the fully data-driven methods. Conclusions: Patient-derived supplementary features significantly improved the accuracy of the prediction of response to treatment, both with rule-based and data-driven interpretation systems. Fully data-driven models derived from large-scale data sources show promise as antiretroviral treatment decision support tools.
引用
收藏
页码:433 / 442
页数:10
相关论文
共 43 条
  • [1] Comparison of Statistical Logistic Regression and RandomForest Machine Learning Techniques in Predicting Diabetes
    Daghistani, Tahani
    Alshammari, Riyad
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2020, 11 (02) : 78 - 83
  • [2] Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects
    Dumitrescu, Elena
    Hue, Sullivan
    Hurlin, Christophe
    Tokpavi, Sessi
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 297 (03) : 1178 - 1192
  • [3] Predicting Response to Antiretroviral Treatment by Machine Learning: The EuResist Project
    Zazzi, Maurizio
    Incardona, Francesca
    Rosen-Zvi, Michal
    Prosperi, Mattia
    Lengauer, Thomas
    Altmann, Andre
    Sonnerborg, Anders
    Lavee, Tamar
    Schuelter, Eugen
    Kaiser, Rolf
    INTERVIROLOGY, 2012, 55 (02) : 123 - 127
  • [4] Comparison of machine learning and logistic regression models in predicting psoriasis treatment outcome: A scoping review
    Haw, W.
    Hussain, A.
    Reynolds, N. J.
    Griffiths, C.
    Peek, N.
    Warren, R. B.
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2022, 142 (12) : S200 - S200
  • [5] A comparative study of logistic regression based machine learning techniques for prediction of early virological suppression in antiretroviral initiating HIV patients
    Bisaso, Kuteesa R.
    Karungi, Susan A.
    Kiragga, Agnes
    Mukonzo, Jackson K.
    Castelnuovo, Barbara
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18
  • [6] A comparative study of logistic regression based machine learning techniques for prediction of early virological suppression in antiretroviral initiating HIV patients
    Kuteesa R. Bisaso
    Susan A. Karungi
    Agnes Kiragga
    Jackson K. Mukonzo
    Barbara Castelnuovo
    BMC Medical Informatics and Decision Making, 18
  • [7] A Comparison of Linear and Non-Linear Machine Learning Techniques (PCA and SOM) for Characterizing Urban Nutrient Runoff
    Gorgoglione, Angela
    Castro, Alberto
    Iacobellis, Vito
    Gioia, Andrea
    SUSTAINABILITY, 2021, 13 (04) : 1 - 19
  • [8] Machine learning methods are comparable to logistic regression techniques in predicting severe walking limitation following total knee arthroplasty
    Yong-Hao Pua
    Hakmook Kang
    Julian Thumboo
    Ross Allan Clark
    Eleanor Shu-Xian Chew
    Cheryl Lian-Li Poon
    Hwei-Chi Chong
    Seng-Jin Yeo
    Knee Surgery, Sports Traumatology, Arthroscopy, 2020, 28 : 3207 - 3216
  • [9] Machine learning methods are comparable to logistic regression techniques in predicting severe walking limitation following total knee arthroplasty
    Pua, Yong-Hao
    Kang, Hakmook
    Thumboo, Julian
    Clark, Ross Allan
    Chew, Eleanor Shu-Xian
    Poon, Cheryl Lian-Li
    Chong, Hwei-Chi
    Yeo, Seng-Jin
    KNEE SURGERY SPORTS TRAUMATOLOGY ARTHROSCOPY, 2020, 28 (10) : 3207 - 3216
  • [10] Multi-task learning framework for predicting water quality using non-linear machine learning technique
    Senthilkumar, D.
    Washington, D. George
    Reshmy, A. K.
    Noornisha, M.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (06) : 5667 - 5679