Semi-supervised trees for multi-target regression

被引:36
|
作者
Levatic, Jurica [1 ,2 ]
Kocev, Dragi [1 ,2 ,3 ]
Ceci, Michelangelo [3 ,4 ]
Dzeroski, Saso [1 ,2 ]
机构
[1] Josef Stefan Inst, Dept Knowledge Technol, Ljubljana, Slovenia
[2] Jotef Stefan Int Postgrad Sch, Ljubljana, Slovenia
[3] Univ Bari Aldo Moro, Dept Comp Sci, Bari, Italy
[4] CINI, Rome, Italy
基金
欧盟地平线“2020”;
关键词
Semi-supervised learning; Multi-target regression; Structured outputs; Predictive clustering trees; Random forests; CLASSIFICATION; MODEL; PREDICTION; INDUCTION; ENSEMBLES; INDEX;
D O I
10.1016/j.ins.2018.03.033
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The predictive performance of traditional supervised methods heavily depends on the amount of labeled data. However, obtaining labels is a difficult process in many real-life tasks, and only a small amount of labeled data is typically available for model learning. As an answer to this problem, the concept of semi-supervised learning has emerged. Semi supervised methods use unlabeled data in addition to labeled data to improve the performance of supervised methods. It is even more difficult to get labeled data for data mining problems with structured outputs since several labels need to be determined for each example. Multi-target regression (MTR) is one type of a structured output prediction problem, where we need to simultaneously predict multiple continuous variables. Despite the apparent need for semi supervised methods able to deal with MTR, only a few such methods are available and even those are difficult to use in practice and/or their advantages over supervised methods for MTR are not clear. This paper presents an extension of predictive clustering trees for MTR and ensembles thereof towards semi-supervised learning. The proposed method preserves the appealing characteristic of decision trees while enabling the use of unlabeled examples. In particular, the proposed semi-supervised trees for MTR are interpretable, easy to understand, fast to learn, and can handle both numeric and nominal descriptive features. We perform an extensive empirical evaluation in both an inductive and a transductive semi-supervised setting. The results show that the proposed method improves the performance of supervised predictive clustering trees and enhances their interpretability (due to reduced tree size), whereas, in the ensemble learning scenario, it outperforms its supervised counterpart in the transductive setting. The proposed methods have a mechanism for controlling the influence of unlabeled examples, which makes them highly useful in practice: This mechanism can protect them against a degradation of performance of their supervised counterparts-an inherent risk of semi-supervised learning. The proposed methods also outperform two existing semi-supervised methods for MTR. (C) 2018 Elsevier Inc. All rights reserved.
引用
收藏
页码:109 / 127
页数:19
相关论文
共 50 条
  • [31] Diverse and consistent multi-view networks for semi-supervised regression
    Nguyen, Cuong
    Raja, Arun
    Zhang, Le
    Xu, Xun
    Unnikrishnan, Balagopal
    Ragab, Mohamed
    Lu, Kangkang
    Foo, Chuan-Sheng
    MACHINE LEARNING, 2023, 112 (07) : 2359 - 2395
  • [32] Laplacian-based Semi-supervised Multi-Label Regression
    Kraus, Vivien
    Benabdeslem, Khalid
    Canitia, Bruno
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [33] SEMI-SUPERVISED MULTI-DOMAIN REGRESSION WITH DISTINCT TRAINING SETS
    Michaeli, Tomer
    Eldar, Yonina C.
    Sapiro, Guillermo
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2145 - 2148
  • [34] The joint manifold model for semi-supervised multi-valued regression
    Navaratnam, Ramanan
    Fitzgibbon, Andrew W.
    Cipolla, Roberto
    2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 1160 - 1167
  • [35] Diverse and consistent multi-view networks for semi-supervised regression
    Cuong Nguyen
    Arun Raja
    Le Zhang
    Xun Xu
    Balagopal Unnikrishnan
    Mohamed Ragab
    Kangkang Lu
    Chuan-Sheng Foo
    Machine Learning, 2023, 112 : 2359 - 2395
  • [36] Scene Recognition via Semi-Supervised Multi-Feature Regression
    Zheng, Caixia
    Chen, Jianyu
    Kong, Jun
    Yi, Yugen
    Lu, Yinghua
    Wang, Jianzhong
    Liu, Chong
    IEEE ACCESS, 2019, 7 : 121612 - 121628
  • [37] A Semi-supervised regressor based on model trees
    Fazakis, Nikos . . . . . . . . . . . . . . . . . .
    Karlos, Stamatis
    Kotsiantis, Sotiris
    Sgarbas, Kyriakos
    10TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE (SETN 2018), 2018,
  • [38] Semi-supervised oblique predictive clustering trees
    Stepisnik, Tomaz
    Kocev, Dragi
    PEERJ COMPUTER SCIENCE, 2021,
  • [39] Semi-supervised oblique predictive clustering trees
    Stepišnik T.
    Kocev D.
    PeerJ Computer Science, 2021, 7 : 1 - 20
  • [40] A semi-supervised approach to growing classification trees
    Santhiappan, Sudarsun
    Ravindran, Balaraman
    CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 29 - 37