Towards Robust SVM Training from Weakly Labeled Large Data Sets

被引:0
|
作者
Kawulok, Michal [1 ]
Nalepa, Jakub [1 ]
机构
[1] Silesian Tech Univ, Inst Informat, Gliwice, Poland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning from large data sets that contain samples of unknown or incorrect labels becomes increasingly important. Such problems are inherent to many big data scenarios, hence there is a need for developing robust generic approaches to learning from difficult data. In this paper, we propose a new memetic algorithm that evolves samples and labels to select a training set for support vector machines from large, weakly-labeled sets. Our extensive experimental study confirmed that the new method presents high robustness against weakly-labeled data and outperforms other state-of-the-art algorithms.
引用
收藏
页码:464 / 468
页数:5
相关论文
共 50 条
  • [31] THE EFFECT OF MIS-LABELED TRAINING DATA ON THE ACCURACY OF SUPERVISED IMAGE CLASSIFICATION BY SVM
    Foody, Giles M.
    2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 4987 - 4990
  • [32] Robust deep learning from weakly dependent data
    Kengne, William
    Wade, Modou
    NEURAL NETWORKS, 2025, 185
  • [33] Towards effective analysis of large grain boundary data sets
    Glowinski, K.
    Morawiec, A.
    17TH INTERNATIONAL CONFERENCE ON TEXTURES OF MATERIALS (ICOTOM 17), 2015, 82
  • [34] Towards a comprehensive visualisation of structure in large scale data sets
    Garriga, Joan
    Bartumeus, Frederic
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (03):
  • [35] Training Support Vector Machines on Large Sets of Image Data
    Kukenys, Ignas
    McCane, Brendan
    Neumegen, Tim
    COMPUTER VISION - ACCV 2009, PT III, 2010, 5996 : 331 - 340
  • [36] Erratum to: Towards automatic bounding box annotations from weakly labeled images
    Christian X. Ries
    Fabian Richter
    Rainer Lienhart
    Multimedia Tools and Applications, 2016, 75 : 6119 - 6119
  • [37] Towards Robust Colour Texture Classification with Limited Training Data
    Shumska, Mariya
    Bunte, Kerstin
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2023, PT I, 2023, 14184 : 154 - 164
  • [38] Towards Robust Colour Texture Analysis with Limited Training Data
    Mariya Shumska
    Michael H. F. Wilkinson
    Kerstin Bunte
    SN Computer Science, 5 (6)
  • [39] Towards automatically creating large labeled datasets for training question domain classifiers
    Tavares, Leandro L.
    Silva, Renato M.
    Almeida, Tiago A.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [40] From visualisation to data mining with large data sets
    Adelmann, A
    Ryne, RD
    Shalf, JM
    Siegerist, C
    2005 IEEE PARTICLE ACCELERATOR CONFERENCE (PAC), VOLS 1-4, 2005, : 542 - 544