Maximum margin semi-supervised learning with irrelevant data

被引:10
|
作者
Yang, Haiqin [1 ,2 ]
Huang, Kaizhu [3 ]
King, Irwin [1 ,2 ]
Lyu, Michael R. [1 ,2 ]
机构
[1] Chinese Univ Hong Kong, Shenzhen Res Inst, Shenzhen Key Lab Rich Media Big Data Analyt & App, Hong Kong, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
[3] Xian Jiaotong Liverpool Univ, Suzhou, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Maximum margin classifier; Irrelevant data; Semi-supervised learning; Concave convex procedure;
D O I
10.1016/j.neunet.2015.06.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised learning (SSL) is a typical learning paradigms training a model from both labeled and unlabeled data. The traditional SSL models usually assume unlabeled data are relevant to the labeled data, i.e., following the same distributions of the targeted labeled data. In this paper, we address a different, yet formidable scenario in semi-supervised classification, where the unlabeled data may contain irrelevant data to the labeled data. To tackle this problem, we develop a maximum margin model, named tri-class support vector machine (3C-SVM), to utilize the available training data, while seeking a hyperplane for separating the targeted data well. Our 3C-SVM exhibits several characteristics and advantages. First, it does not need any prior knowledge and explicit assumption on the data relatedness. On the contrary, it can relieve the effect of irrelevant unlabeled data based on the logistic principle and maximum entropy principle. That is, 3C-SVM approaches an ideal classifier. This classifier relies heavily on labeled data and is confident on the relevant data lying far away from the decision hyperplane, while maximally ignoring the irrelevant data, which are hardly distinguished. Second, theoretical analysis is provided to prove that in what condition, the irrelevant data can help to seek the hyperplane. Third, 3C-SVM is a generalized model that unifies several popular maximum margin models, including standard SVMs, Semi-supervised SVMs ((SVMS)-V-3), and SVMs learned from the universum (u-SVM5) as its special cases. More importantly, we deploy a concave convex produce to solve the proposed 3C-SVM, transforming the original mixed integer programming, to a semi-definite programming relaxation, and finally to a sequence of quadratic programming subproblems, which yields the same worst case time complexity as that of (SVMs)-V-3. Finally, we demonstrate the effectiveness and efficiency of our proposed 3C-SVM through systematical experimental comparisons. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:90 / 102
页数:13
相关论文
共 50 条
  • [1] Maximum margin based semi-supervised spectral kernel learning
    Xu, Zenglin
    Zhu, Jianke
    Lyu, Michael R.
    King, Irwin
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 418 - 423
  • [2] Large margin semi-supervised learning
    Wang, Junhui
    Shen, Xiaotong
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 1867 - 1891
  • [3] Semi-Supervised Maximum Margin Clustering with Pairwise Constraints
    Zeng, Hong
    Cheung, Yiu-Ming
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (05) : 926 - 939
  • [4] Semi-Supervised Maximum Discriminative Local Margin for Gene Selection
    Zejun Li
    Bo Liao
    Lijun Cai
    Min Chen
    Wenhua Liu
    [J]. Scientific Reports, 8
  • [5] Semi-Supervised Maximum Discriminative Local Margin for Gene Selection
    Li, Zejun
    Liao, Bo
    Cai, Lijun
    Chen, Min
    Liu, Wenhua
    [J]. SCIENTIFIC REPORTS, 2018, 8
  • [6] Large Margin Graph Construction for Semi-Supervised Learning
    Guo, Lan-Zhe
    Wang, Shao-Bo
    Li, Yu-Feng
    [J]. 2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 1030 - 1033
  • [7] A sparse large margin semi-supervised learning method
    Choi, Hosik
    Kim, Jinseog
    Kim, Yongdai
    [J]. JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2010, 39 (04) : 479 - 487
  • [8] A sparse large margin semi-supervised learning method
    Hosik Choi
    Jinseog Kim
    Yongdai Kim
    [J]. Journal of the Korean Statistical Society, 2010, 39 : 479 - 487
  • [9] Data driven semi-supervised learning
    Balcan, Maria-Florina
    Sharma, Dravyansh
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] Maximum Entropy Semi-Supervised Inverse Reinforcement Learning
    Audiffren, Julien
    Valko, Michal
    Lazaric, Alessandro
    Ghavamzadeh, Mohammad
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3315 - 3321