Classifying Remote Sensing Data with Support Vector Machines and Imbalanced Training Data

被引:0
|
作者
Waske, Bjorn [1 ]
Benediktsson, Jon Atli [1 ]
Sveinsson, Johannes R. [1 ]
机构
[1] Univ Iceland, Fac Elect & Comp Engn, IS-107 Reykjavik, Iceland
来源
关键词
land cover classification; multispectral; support vector machines; bagging; imbalanced training data; CLASSIFICATION; MULTISOURCE; SVM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The classification of remote sensing data with imbalanced training data is addressed. The classification accuracy of a supervised method is affected by several factors, such as the classifier algorithm, the input data and the available training data. The use of an imbalanced training set, i.e., the number of training samples from one class is much smaller than from other classes, often results in low classification accuracies for the small classes. In the present study support vector machines (SVM) are trained with imbalanced training data. To handle the unbalanced training data, the training data are resampled (i.e., bagging) and a multiple classifier system, with SVM as base classifier; is generated. In addition to the classifier ensemble a single SVM is applied to the data., using the original balanced and the unbalanced training data sets. The results underline that; the SVM classification is affected by imbalanced data sets, resulting in dominant lower classification accuracies for classes with fewer training data. Moreover the detailed accuracy assessment demonstrates that the proposed approach significantly improves the class accuracies achieved by a single SVM, which is trained on the whole imbalanced training data set.
引用
收藏
页码:375 / 384
页数:10
相关论文
共 50 条
  • [1] BALANCED VS IMBALANCED TRAINING DATA: CLASSIFYING RAPIDEYE DATA WITH SUPPORT VECTOR MACHINES
    Ustuner, M.
    Sanli, F. B.
    Abdikan, S.
    [J]. XXIII ISPRS CONGRESS, COMMISSION VII, 2016, 41 (B7): : 379 - 384
  • [2] CLASSIFICATION OF HYPERSPECTRAL REMOTE SENSING IMAGES BY AN ENSEMBLE OF SUPPORT VECTOR MACHINES UNDER IMBALANCED DATA
    Eeti, Laxmi Narayana
    Buddhiraju, Krishna Mohan
    [J]. IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 2659 - 2661
  • [3] Boosting support vector machines for imbalanced data sets
    Wang, Benjamin X.
    Japkowicz, Nathalie
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 25 (01) : 1 - 20
  • [4] Boosting support vector machines for imbalanced data sets
    Wang, Benjamin X.
    Japkowicz, Nathalie
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2008, 4994 : 38 - 47
  • [5] Boosting support vector machines for imbalanced data sets
    Benjamin X. Wang
    Nathalie Japkowicz
    [J]. Knowledge and Information Systems, 2010, 25 : 1 - 20
  • [6] Boosting Support Vector Machines for Imbalanced Microarray Data
    Pratama, Risky Frasetio Wahyu
    Purnami, Santi Wulan
    Rahayu, Santi Puteri
    [J]. INNS CONFERENCE ON BIG DATA AND DEEP LEARNING, 2018, 144 : 174 - 183
  • [7] Locally Linear Support Vector Machines for Imbalanced Data Classification
    Krawczyk, Bartosz
    Cano, Alberto
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT I, 2021, 12712 : 616 - 628
  • [8] Intuitionistic fuzzy twin support vector machines for imbalanced data
    Rezvani, Salim
    Wang, Xizhao
    [J]. NEUROCOMPUTING, 2022, 507 : 16 - 25
  • [9] Krein twin support vector machines for imbalanced data classification
    Jimenez-Castano, C.
    Alvarez-Meza, A.
    Cardenas-Pena, D.
    Orozco-Gutierrez, A.
    Guerrero-Erazo, J.
    [J]. PATTERN RECOGNITION LETTERS, 2024, 182 : 39 - 45
  • [10] Training data selection for support vector machines
    Wang, JG
    Neskovic, P
    Cooper, LN
    [J]. ADVANCES IN NATURAL COMPUTATION, PT 1, PROCEEDINGS, 2005, 3610 : 554 - 564