On Propagated Scoring for Semisupervised Additive Models

被引:5
|
作者
Culp, Mark [1 ]
机构
[1] W Virginia Univ, Dept Stat, Morgantown, WV 26506 USA
关键词
Additive model; Fixed-point optimization; Semisupervised learning;
D O I
10.1198/jasa.2011.tm09316
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This article presents a semisupervised modeling framework that combines feature-based (x) data and graph-based (G) data for classification/regression of the response Y. In this semisupervised setting, Y is observed for a subset of the observations (labeled) and missing for the remainder (unlabeled). The Propagated Scoring algorithm proposed for fitting this model is a semisupervised fixed-point regularization approach that essentially extends the generalized additive model into the semisupervised setting. I first articulate when semisupervised degeneracies are expected within my framework, and then provide a general regularization strategy to address such circumstances. For statistical analysis, I establish that the approach uses shrinking smoothers, provide circumstances in which when the result is consistent, provide measures of inference and description, and establish clear connections to supervised models. Several semisupervised approaches have been considered for the classification problem posed, typically motivated from energy optimization perspective. In this work, I rigorously connect the statistically based propagated scoring framework to this class of approaches. This is particularly insightful, especially with regard to supervised comparisons, because this type of analysis is lacking for the previous work. Two applications are presented, one involving classification of protein location on a cell using a network of protein interaction data and the other involving classification of text documents with citation network information and text data. This article has supplementary material online.
引用
收藏
页码:248 / 259
页数:12
相关论文
共 50 条
  • [1] Generalized Partially Linear Additive Models for Credit Scoring
    Shim, Ju-Hyun
    Lee, Young K.
    KOREAN JOURNAL OF APPLIED STATISTICS, 2011, 24 (04) : 587 - 595
  • [2] SEMISUPERVISED HYPERSPECTRAL IMAGE CLASSIFICATION BASED ON AFFINITY SCORING
    Chen, Zhao
    Wang, Bin
    Niu, Yubin
    Xia, Wei
    Zhang, Jian Qiu
    Hu, Bo
    2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 4967 - 4970
  • [3] Monotonic Neural Additive Models: Pursuing Regulated Machine Learning Models for Credit Scoring
    Chen, Dangxing
    Ye, Weicheng
    3RD ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, ICAIF 2022, 2022, : 70 - 78
  • [4] The total cost of misclassification in credit scoring: A comparison of generalized linear models and generalized additive models
    Lohmann, Christian
    Ohliger, Thorsten
    JOURNAL OF FORECASTING, 2019, 38 (05) : 375 - 389
  • [5] Nonlinear, flexible, semisupervised learning scheme for face beauty scoring
    Dornaika, Fadi
    Elorza, Anne
    Wang, Kunwei
    Arganda-Carreras, Ignacio
    JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (04)
  • [6] Learning and scoring Gaussian latent variable causal models with unknown additive interventions
    Taeb, Armeen
    Gamella, Juan L.
    Heinze-Deml, Christina
    Buhlmann, Peter
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [7] Semisupervised Spectral-Spatial Classification of Hyperspectral Imagery With Affinity Scoring
    Chen, Zhao
    Wang, Bin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2015, 12 (08) : 1710 - 1714
  • [8] Semisupervised learning of mixture models with class constraints
    Zhao, Q
    Miller, DJ
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 185 - 188
  • [9] A New Method for Scoring Additive Multi- attributeValue Models Using Pairwise Rankings of Alternatives
    Hansen, Paul
    Ombler, Franz
    JOURNAL OF MULTI-CRITERIA DECISION ANALYSIS, 2008, 15 (3-4) : 87 - 107
  • [10] A REASSESSMENT OF THE ADDITIVE SCORING OF HEALTH PRACTICES
    SLATER, CH
    LINDER, SH
    MEDICAL CARE, 1988, 26 (12) : 1216 - 1227