Augmenting supervised neural classifier training using a corpus of unlabeled data

被引:0
|
作者
Skabar, A [1 ]
机构
[1] Int Univ Germany, Sch Informat Technol, D-76646 Bruchsal, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, there has been growing interest in applying techniques that incorporate knowledge from unlabeled data into systems performing supervised learning. However, disparate results have been presented in the literature, and there is no general consensus that the use of unlabeled examples should always improve classifier performance. This paper proposes a method for incorporating a corpus of unlabeled examples into the supervised training of a neural network classifier and presents results from applying the technique to several datasets from the UCI repository. While the results do not provide support for the claim that unlabeled data can improve overall classification accuracy, a bias-variance decomposition shows that classifiers trained with unlabeled data display lower bias and higher variance than classifiers trained using labeled data alone.
引用
收藏
页码:174 / 185
页数:12
相关论文
共 50 条
  • [21] Semi-supervised Learning from Only Positive and Unlabeled Data Using Entropy
    Wang, Xiaoling
    Xu, Zhen
    Sha, Chaofeng
    Ester, Martin
    Zhou, Aoying
    WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2010, 6184 : 668 - +
  • [22] A new classifier based on information theoretic learning with unlabeled data
    Jeong, KH
    Xu, JW
    Erdogmus, D
    Principe, JC
    NEURAL NETWORKS, 2005, 18 (5-6) : 719 - 726
  • [23] Analysis of unlabeled lung sound samples using semi-supervised convolutional neural networks
    Lang, Rongling
    Fan, Ya
    Liu, Guoliang
    Liu, Guodong
    APPLIED MATHEMATICS AND COMPUTATION, 2021, 411
  • [24] Semi-supervised empirical risk minimization: Using unlabeled data to improve prediction
    Yuval, Oren
    Rosset, Saharon
    ELECTRONIC JOURNAL OF STATISTICS, 2022, 16 (01): : 1434 - 1460
  • [25] Neural Network Classifier-Based OPC With Imbalanced Training Data
    Choi, Suhyeong
    Shim, Seongbo
    Shin, Youngsoo
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2019, 38 (05) : 938 - 948
  • [26] PAKDD’12 best paper: generating balanced classifier-independent training samples from unlabeled data
    Youngja Park
    Zijie Qi
    Suresh N. Chari
    Ian M. Molloy
    Knowledge and Information Systems, 2014, 41 : 871 - 892
  • [27] Classifier-independent visualization of supervised data structures using a graph
    Tenmoto, H
    Mori, Y
    Kudo, M
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2004, 3138 : 1043 - 1051
  • [28] PAKDD'12 best paper: generating balanced classifier-independent training samples from unlabeled data
    Park, Youngja
    Qi, Zijie
    Chari, Suresh N.
    Molloy, Ian M.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (03) : 871 - 892
  • [29] Identifying mislabeled training data with the aid of unlabeled data
    Donghai Guan
    Weiwei Yuan
    Young-Koo Lee
    Sungyoung Lee
    Applied Intelligence, 2011, 35 : 345 - 358
  • [30] Classification accuracy improvement of neural network classifiers by using unlabeled data
    Fardanesh, MT
    Ersoy, OK
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 1998, 36 (03): : 1020 - 1025