Neighborhood based sample and feature selection for SVM classification learning

被引:51
|
作者
He, Qiang [1 ]
Xie, Zongxia
Hu, Qinghua [1 ]
Wu, Congxin [1 ]
机构
[1] Harbin Inst Technol, Dept Math, Harbin 150001, Peoples R China
关键词
Support vector machine; Rough set; Neighborhood relation; Sample selection; Feature selection; SUPPORT VECTOR MACHINES; ROUGH SETS; SYSTEMS;
D O I
10.1016/j.neucom.2011.01.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Support vector machines (SVMs) are a class of popular classification algorithms for their high generalization ability. However, it is time-consuming to train SVMs with a large set of learning samples. Improving learning efficiency is one of most important research tasks on SVMs. It is known that although there are many candidate training samples in some learning tasks, only the samples near decision boundary which are called support vectors have impact on the optimal classification hyper-planes. Finding these samples and training SVMs with them will greatly decrease training time and space complexity. Based on the observation, we introduce neighborhood based rough set model to search boundary samples. Using the model, we firstly divide sample spaces into three subsets: positive region, boundary and noise. Furthermore, we partition the input features into four subsets: strongly relevant features, weakly relevant and indispensable features, weakly relevant and superfluous features, and irrelevant features. Then we train SVMs only with the boundary samples in the relevant and indispensable feature subspaces, thus feature and sample selection is simultaneously conducted with the proposed model. A set of experimental results show the model can select very few features and samples for training; in the mean time the classification performances are preserved or even improved. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1585 / 1594
页数:10
相关论文
共 50 条
  • [41] Integrated framework for profit-based feature selection and SVM classification in credit scoring
    Maldonado, Sebastian
    Bravo, Cristian
    Lopez, Julio
    Perez, Juan
    [J]. DECISION SUPPORT SYSTEMS, 2017, 104 : 113 - 121
  • [42] Feature selection for label distribution learning based on neighborhood fuzzy rough sets
    Deng, Zhixuan
    Li, Tianrui
    Zhang, Pengfei
    Liu, Keyu
    Yuan, Zhong
    Deng, Dayong
    [J]. Applied Soft Computing, 2025, 169
  • [43] Feature Selection of Protein Structural Classification Using SVM Classifier
    Krajewski, Zbigniew
    Tkacz, Ewaryst
    [J]. BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2013, 33 (01) : 47 - 61
  • [44] Beam search for feature selection in automatic SVM defect classification
    Gupta, P
    Doermann, D
    DeMenthon, D
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2002, : 212 - 215
  • [45] Classification Algorithm of Parkinson's Disease Based on Convolutional Sparse Transfer Learning and Sample/Feature Parallel Selection
    Zhang, Xiaoheng
    Li, Yongming
    Wang, Pin
    Zeng, Xiaoping
    Yan, Fang
    Zhang, Yanling
    Cheng, Oumei
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2019, 41 (07): : 1641 - 1649
  • [46] AdaBoost for Feature Selection, Classification and Its Relation with SVM*, A Review
    Wang, Ruihu
    [J]. INTERNATIONAL CONFERENCE ON SOLID STATE DEVICES AND MATERIALS SCIENCE, 2012, 25 : 800 - 807
  • [47] Genetic Algorithm Assisted by a SVM for Feature Selection in Gait Classification
    Yeoh, TzeWei
    Zapotecas-Martinez, Saul
    Akimoto, Youhei
    Aguirre, Hernan
    Tanaka, Kiyoshi
    [J]. 2014 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2014, : 191 - 195
  • [48] penalizedSVM: a R-package for feature selection SVM classification
    Becker, Natalia
    Werft, Wiebke
    Toedt, Grischa
    Lichter, Peter
    Benner, Axel
    [J]. BIOINFORMATICS, 2009, 25 (13) : 1711 - 1712
  • [49] Feature selection and classification improvement of Kinnow using SVM classifier
    Singh, Sukhpreet
    Malik, Kamal
    [J]. Measurement: Sensors, 2022, 24
  • [50] Classification Algorithm of Parkinson's Disease Based on Convolutional Sparse Transfer Learning and Sample/Feature Parallel Selection
    Zhang Xiaoheng
    Li Yongming
    Wang Pin
    Zeng Xiaoping
    Yan Fang
    Zhang Yanling
    Cheng Oumei
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (07) : 1641 - 1649