PREDICTION OF THE NASH THROUGH PENALIZED MIXTURE OF LOGISTIC REGRESSION MODELS

被引:0
|
作者
Morvan, Marie [1 ]
Devijver, Emilie [2 ]
Giacofci, Madison [1 ]
Monbet, Valerie [1 ]
机构
[1] Univ Rennes, CNRS, IRMAR UMR 6625, Rennes, France
[2] Univ Grenoble Alpes, Grenoble INP, CNRS, INRIA, Grenoble, France
来源
ANNALS OF APPLIED STATISTICS | 2021年 / 15卷 / 02期
关键词
Mixture regression model; prediction; variable selection; heterogeneous data; spectrometry data; FINITE MIXTURE; MAXIMUM-LIKELIHOOD; VARIABLE SELECTION; EM ALGORITHM;
D O I
10.1214/20-AOAS1409
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper an appropriate and interpretable diagnosis statistical model is proposed to predict Nonalcoholic Steatohepatitis (NASH) from near infrared spectrometry data. In this disease, unknown patients' profiles are expected to lead to a different diagnosis. The model has then to take into account the heterogeneity of the data and the dimension of the spectrometric data. To this end, we propose to fit a mixture model on the joint distribution of the diagnostic binary variable and the covariates selected in the spectra. The penalized maximum likelihood estimator is considered. In practice, a twofold penalty on both regression coefficients and covariance parameters is imposed. Automatic selection criteria, such as the AIC and BIC, are used to select the amount of shrinkage and the number of clusters. The performance of the overall procedure is evaluated by a simulation study, and its application on the NASH data set is analyzed. The model leads to better prediction performance than competitive methods and provides highly interpretable results.
引用
收藏
页码:952 / 970
页数:19
相关论文
共 50 条
  • [1] Penalized Estimation of a Finite Mixture of Linear Regression Models
    Rocci, Roberto
    Di Mari, Roberto
    Gattone, Stefano Antonio
    BUILDING BRIDGES BETWEEN SOFT AND STATISTICAL METHODOLOGIES FOR DATA SCIENCE, 2023, 1433 : 326 - 333
  • [2] Comparison of penalized logistic regression models for rare event case
    Olmus, Hulya
    Nazman, Ezgi
    Erbas, Semra
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (04) : 1578 - 1590
  • [3] Fitting Penalized Logistic Regression Models Using QR Factorization
    Klimaszewski, Jacek
    Korzen, Marcin
    COMPUTATIONAL SCIENCE - ICCS 2020, PT II, 2020, 12138 : 44 - 57
  • [4] PENALIZED LIKELIHOOD FOR LOGISTIC-NORMAL MIXTURE MODELS WITH UNEQUAL VARIANCES
    Shen, Juan
    Wang, Yingchuan
    He, Xuming
    STATISTICA SINICA, 2017, 27 (02) : 711 - 731
  • [5] Multiclass-penalized logistic regression
    Nibbering, Didier
    Hastie, Trevor J.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2022, 169
  • [6] Image retrieval system for citizen services using penalized logistic regression models
    de Ves, E.
    Benavent, X.
    Ayala, G.
    Cerveron, V
    PROCEEDINGS OF THE 10TH EURO-AMERICAN CONFERENCE ON TELEMATICS AND INFORMATION SYSTEMS (EATIS 2020), 2020,
  • [7] Prediction of software failures through logistic regression
    Salem, AM
    Rekab, K
    Whittaker, JA
    INFORMATION AND SOFTWARE TECHNOLOGY, 2004, 46 (12) : 781 - 789
  • [8] Classification of gene microarrays by penalized logistic regression
    Zhu, J
    Hastie, T
    BIOSTATISTICS, 2004, 5 (03) : 427 - 443
  • [9] Penalized robust estimators in sparse logistic regression
    Bianco, Ana M.
    Boente, Graciela
    Chebi, Gonzalo
    TEST, 2022, 31 (03) : 563 - 594
  • [10] Penalized logistic regression for detecting gene interactions
    Park, Mee Young
    Hastie, Trevor
    BIOSTATISTICS, 2008, 9 (01) : 30 - 50