Towards Robust SVM Training from Weakly Labeled Large Data Sets

被引:0
|
作者
Kawulok, Michal [1 ]
Nalepa, Jakub [1 ]
机构
[1] Silesian Tech Univ, Inst Informat, Gliwice, Poland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning from large data sets that contain samples of unknown or incorrect labels becomes increasingly important. Such problems are inherent to many big data scenarios, hence there is a need for developing robust generic approaches to learning from difficult data. In this paper, we propose a new memetic algorithm that evolves samples and labels to select a training set for support vector machines from large, weakly-labeled sets. Our extensive experimental study confirmed that the new method presents high robustness against weakly-labeled data and outperforms other state-of-the-art algorithms.
引用
收藏
页码:464 / 468
页数:5
相关论文
共 50 条
  • [21] SVM Classification for Large Data Sets by Considering Models of Classes Distribution
    Cervantes, Jair
    Li, Xiaoou
    Yu, Wen
    MICAI 2007: SIXTH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, : 51 - +
  • [22] GEN, A COMPUTERIZED STATISTICAL PROCEDURE FOR CREATING LARGE DATA SETS FROM SMALL DATA SETS FOR TRAINING DISCRIMINANT FUNCTIONS
    LATHROP, LD
    PENNYPACKER, SP
    PHYTOPATHOLOGY, 1979, 69 (09) : 1036 - 1036
  • [23] Data Programming: Creating Large Training Sets, Quickly
    Ratner, Alexander
    De Sa, Christopher
    Wu, Sen
    Selsam, Daniel
    Re, Christopher
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [24] Towards Robust Human Activity Recognition from RGB Video Stream with Limited Labeled Data
    Sarker, Krishanu
    Masoud, Mohamed
    Belkasim, Saeid
    Ji, Shihao
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 145 - 151
  • [25] Towards the evolution of training data sets for artificial neural networks
    Mayer, HA
    Schwaiger, R
    PROCEEDINGS OF 1997 IEEE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION (ICEC '97), 1997, : 663 - 666
  • [26] STEME: A Robust, Accurate Motif Finder for Large Data Sets
    Reid, John E.
    Wernisch, Lorenz
    PLOS ONE, 2014, 9 (03):
  • [27] A projection method for robust estimation and clustering in large data sets
    Pena, Daniel
    Prieto, Francisco J.
    DATA ANALYSIS, CLASSIFICATION AND THE FORWARD SEARCH, 2006, : 209 - +
  • [28] Learning From Weakly Supervised Data by The Expectation Loss SVM (e-SVM) algorithm
    Zhu, Jun
    Mao, Junhua
    Yuille, Alan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [29] Sequential learning with LS-SVM for large-scale data sets
    Jung, Tobias
    Polani, Daniel
    ARTIFICIAL NEURAL NETWORKS - ICANN 2006, PT 2, 2006, 4132 : 381 - 390
  • [30] Using Locality-Sensitive Hashing for SVM Classification of Large Data Sets
    Gonzalez-Lima, Maria D.
    Ludena, Carenne C.
    MATHEMATICS, 2022, 10 (11)