Semi-supervised learning via constraints

被引:0
|
作者
Pan, Wei [1 ]
Shen, Xiaotong [1 ]
机构
[1] Univ Minnesota, Dept Biostat, Minneapolis, MN 55455 USA
来源
PREDICTION AND DISCOVERY | 2007年 / 443卷
关键词
empirical likelihood; logistic regression; penalization; regularization;
D O I
暂无
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Given partially labeled data consisting of labeled data (y(i), x(i))'s and unlabeled data x(j)'s, semi-supervised learning aims to use unlabeled data, in addition to labeled data, to improve over supervised learning that uses only labeled data. Because most classifications are inherently based on the conditional distribution f(Y vertical bar X) while unlabeled data only contain information on marginal distribution f(X), some modeling assumptions are necessary to connect the two distributions. Here we assume that some characteristics of the marginal distribution of Y, f(Y), are known, and hence are used as extra constraints on the original objective function of supervised learning. As a prototype implementation, for a binary problem with Y = 1 or -1, we assume that the marginal class probability po = Pr(Y = 1) is known, and incorporate it as an additional penalty term into penalized logistic regression. We point out that our proposal is closely related to the approach of combining parametric and empirical likelihoods. We use simulated data and real data to demonstrate that the proposed method can have improved performance over penalized logistic regression without using unlabeled data. The idea can be extended to kernel logistic regression, SVM and even other semi-supervised learning algorithms.
引用
收藏
页码:193 / 204
页数:12
相关论文
共 50 条
  • [1] Semi-supervised document clustering via active learning with pairwise constraints
    Huang, Ruizhang
    Lam, Wai
    [J]. ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 517 - 522
  • [2] Active Learning of Constraints for Semi-Supervised Clustering
    Xiong, Sicheng
    Azimi, Javad
    Fern, Xiaoli Z.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (01) : 43 - 54
  • [3] Semi-supervised dictionary learning via local sparse constraints for violence detection
    Zhang, Tao
    Jia, Wenjing
    Gong, Chen
    Sun, Jun
    Song, Xiaoning
    [J]. PATTERN RECOGNITION LETTERS, 2018, 107 : 98 - 104
  • [4] Semi-Supervised Learning via Regularized Boosting Working on Multiple Semi-Supervised Assumptions
    Chen, Ke
    Wang, Shihai
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (01) : 129 - 143
  • [5] Semi-supervised metric learning via topology preserving multiple semi-supervised assumptions
    Wang, Qianying
    Yuen, Pong C.
    Feng, Guocan
    [J]. PATTERN RECOGNITION, 2013, 46 (09) : 2576 - 2587
  • [6] Semi-Supervised Metric Learning Using Pairwise Constraints
    Baghshah, Mahdieh Soleymani
    Shouraki, Saeed Bagheri
    [J]. 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 1217 - 1222
  • [7] Active Learning of Constraints for Semi-supervised Text Clustering
    Huang, Ruizhang
    Lam, Wai
    Zhang, Zhigang
    [J]. PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 113 - 124
  • [8] Semi-Supervised Learning with Declaratively Specified Entropy Constraints
    Sun, Haitian
    Bing, Lidong
    Cohen, William W.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [9] Multitask Semi-supervised Learning with Constraints and Constraint Exceptions
    Maggini, Marco
    Papini, Tiziano
    [J]. ARTIFICIAL NEURAL NETWORKS (ICANN 2010), PT III, 2010, 6354 : 218 - 227
  • [10] Teeth Segmentation via Semi-Supervised Learning
    Gao, Yonghui
    Li, Xiaoxiao
    [J]. PROCEEDINGS OF THE 2013 6TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS (BMEI 2013), VOLS 1 AND 2, 2013, : 558 - 563