Investigation of expert rule bases, logistic regression, and non-linear machine learning techniques for predicting response to antiretroviral treatment

被引:0
|
作者
Prosperi, Mattia C. F. [1 ,2 ]
Altmann, Andre [3 ]
Rosen-Zvi, Michal [4 ]
Aharoni, Ehud [4 ]
Gabor Borgulya [5 ]
Fulop Bazso [5 ]
Sonnerborg, Anders [6 ]
Schuelter, Eugen [7 ]
Struck, Daniel [8 ]
Ulivi, Giovanni [1 ]
Vandamme, Anne-Mieke [9 ]
Vercauteren, Jurgen [9 ]
Zazzi, Maurizio [10 ]
机构
[1] Roma Tre Univ, Dept Comp Sci & Automat, Rome, Italy
[2] Informa, Rome, Italy
[3] Max Planck Inst Informat, Saarbrucken, Germany
[4] IBM Haifa Res Lab, Haifa, Israel
[5] Hungarian Acad Sci, KFKI Res Inst Particle & Nucl Phys, Budapest, Hungary
[6] Karolinska Inst, Stockholm, Sweden
[7] Univ Cologne, Cologne, Germany
[8] Ctr Rech Publ Sante, Luxembourg, Luxembourg
[9] Katholieke Univ Leuven, Rega Inst, Leuven, Belgium
[10] Univ Siena, I-53100 Siena, Italy
关键词
DRUG-RESISTANCE; INTERPRETATION SYSTEMS; GENOTYPIC-RESISTANCE; HIV-1; THERAPY; DISCORDANCES; VALIDATION; ALGORITHMS; PATTERNS; PROTEASE;
D O I
暂无
中图分类号
R51 [传染病];
学科分类号
100401 ;
摘要
Background: The extreme flexibility of the HIV type-1 (HIV-1) genome makes it challenging to build the ideal antiretroviral treatment regimen. Interpretation of HIV-1 genotypic drug resistance is evolving from rule-based systems guided by expert opinion to data-driven engines developed through machine learning methods. Methods: The aim of the study was to investigate linear and non-linear statistical learning models for classifying short-term virological outcome of antiretroviral treatment. To optimize the model, different feature selection methods were considered. Robust extra-sample error estimation and different loss functions were used to assess model performance. The results were compared with widely used rule-based genotypic interpretation systems (Stanford HIVdb, Rega and ANRS). Results: A set of 3,143 treatment change episodes were extracted from the EuResist database. The dataset included patient demographics, treatment history and viral genotypes. A logistic regression model using high order interaction variables performed better than rule-based genotypic interpretation systems (accuracy 75.63% versus 71.74-73.89%, area under the receiver operating characteristic curve [AUC] 0.76 versus 0.68-0.70) and was equivalent to a random forest model (accuracy 76.16%, AUC 0.77). However, when rule-based genotypic interpretation systems were coupled with additional patient attributes, and the combination was provided as input to the logistic regression model, the performance increased significantly, becoming comparable to the fully data-driven methods. Conclusions: Patient-derived supplementary features significantly improved the accuracy of the prediction of response to treatment, both with rule-based and data-driven interpretation systems. Fully data-driven models derived from large-scale data sources show promise as antiretroviral treatment decision support tools.
引用
收藏
页码:433 / 442
页数:10
相关论文
共 43 条
  • [21] Machine learning models using non-linear techniques improve the prediction of resting energy expenditure in individuals receiving hemodialysis
    Bailey, Alainn
    Eltawil, Mohamed
    Gohel, Suril
    Byham-Gray, Laura
    ANNALS OF MEDICINE, 2023, 55 (02)
  • [22] Analyzing driving factors of land values in urban scale based on big data and non-linear machine learning techniques
    Ma, Jun
    Cheng, Jack C. P.
    Jiang, Feifeng
    Chen, Weiwei
    Zhang, Jingcheng
    LAND USE POLICY, 2020, 94
  • [23] Detection and classification of microcalcification from digital mammograms with firefly algorithm, extreme learning machine and non-linear regression models: A comparison
    Chakravarthy, S. R. Sannasi
    Rajaguru, Harikumar
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2020, 30 (01) : 126 - 146
  • [24] A cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults
    Yuyi Peng
    Chi Zhang
    Bo Zhou
    BMC Geriatrics, 25 (1)
  • [25] Examining non-linear relationship between streetscape features and propensity of walking to school in Hong Kong using machine learning techniques
    Wu, Fangning
    Li, Wenjing
    Qiu, Waishan
    JOURNAL OF TRANSPORT GEOGRAPHY, 2023, 113
  • [26] Deterministic modelling of implied volatility in cryptocurrency options with underlying multiple resolution momentum indicator and non-linear machine learning regression algorithm
    Leung, F.
    Law, M.
    Djeng, S. K.
    FINANCIAL INNOVATION, 2024, 10 (01)
  • [27] Data-Driven Fault Diagnosis Techniques: Non-Linear Directional Residual vs. Machine-Learning-Based Methods
    Cartocci, Nicholas
    Napolitano, Marcello R.
    Crocetti, Francesco
    Costante, Gabriele
    Valigi, Paolo
    Fravolini, Mario L.
    SENSORS, 2022, 22 (07)
  • [28] A comparative study of several machine learning based non-linear regression methods in estimating solar radiation: Case studies of the USA and Turkey regions
    Alizamir, Meysam
    Kim, Sungwon
    Kisi, Ozgur
    Zounemat-Kermani, Mohammad
    ENERGY, 2020, 197
  • [29] Predicting non-response to multimodal day clinic treatment in severely impaired depressed patients: a machine learning approach
    Vetter, Johannes Simon
    Schultebraucks, Katharina
    Galatzer-Levy, Isaac
    Boeker, Heinz
    Bruhl, Annette
    Seifritz, Erich
    Kleim, Birgit
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [30] Predicting non-response to multimodal day clinic treatment in severely impaired depressed patients: a machine learning approach
    Johannes Simon Vetter
    Katharina Schultebraucks
    Isaac Galatzer-Levy
    Heinz Boeker
    Annette Brühl
    Erich Seifritz
    Birgit Kleim
    Scientific Reports, 12