Towards Robust SVM Training from Weakly Labeled Large Data Sets

被引:0
|
作者
Kawulok, Michal [1 ]
Nalepa, Jakub [1 ]
机构
[1] Silesian Tech Univ, Inst Informat, Gliwice, Poland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning from large data sets that contain samples of unknown or incorrect labels becomes increasingly important. Such problems are inherent to many big data scenarios, hence there is a need for developing robust generic approaches to learning from difficult data. In this paper, we propose a new memetic algorithm that evolves samples and labels to select a training set for support vector machines from large, weakly-labeled sets. Our extensive experimental study confirmed that the new method presents high robustness against weakly-labeled data and outperforms other state-of-the-art algorithms.
引用
收藏
页码:464 / 468
页数:5
相关论文
共 50 条
  • [41] Enabling Robust Horizon Picking From Small Training Sets
    Mattos, Andrea Britto
    Civitarese, Daniel
    Szwarcman, Daniela
    Oliveira, Matheus
    Zaytsev, Semen
    Semin, Daniil G.
    Oliveira, Dario A. B.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (06): : 5317 - 5324
  • [42] A Robust Distributed Clustering of Large Data Sets on a Grid of Commodity Machines
    Taamneh, Salah
    Al-Hami, Mo'taz
    Bani-Salameh, Hani
    Abdallah, Alaa E.
    DATA, 2021, 6 (07)
  • [43] ROBUST VIF REGRESSION WITH APPLICATION TO VARIABLE SELECTION IN LARGE DATA SETS
    Dupuis, Debbie J.
    Victoria-Feser, Maria-Pia
    ANNALS OF APPLIED STATISTICS, 2013, 7 (01): : 319 - 341
  • [44] TOWARDS LARGE SCALE ECOACOUSTIC MONITORING WITH SMALL AMOUNTS OF LABELED DATA
    Coban, Enis Berk
    Syed, Ali Raza
    Pir, Dara
    Mandel, Michael, I
    2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2021, : 181 - 185
  • [45] EXTRACT: Strong Examples from Weakly-Labeled Sensor Data
    Blalock, Davis W.
    Guttag, John V.
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 799 - 804
  • [46] β-risk: a New Surrogate Risk for Learning from Weakly Labeled Data
    Zantedeschi, Valentina
    Emonet, Remi
    Sebban, Marc
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [47] Semi-Supervised Learning for Named Entity Recognition Using Weakly Labeled Training Data
    Zafarian, Atefeh
    Rokni, Ali
    Khadivi, Shahram
    Ghiasifard, Sonia
    2015 INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2015, : 129 - 135
  • [48] Fast SVM training using data reconstruction for classification of very large datasets
    Liang, Peileng
    Li, Weite
    Hu, Jinglu
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2020, 15 (03) : 372 - 381
  • [49] Developing Anytime SVM Training Algorithms for Large-Scale Data Classification
    Han, Rui
    Ghanem, Moustafa
    Williams, Andreas
    Guo, Yike
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFTWARE ENGINEERING (AISE 2014), 2014, : 360 - 366
  • [50] Hybrid Method for Fast SVM Training in Applications Involving Large Volumes of Data
    Wani, M. Arif
    2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 2, 2013, : 491 - 494