Automatic feature scaling and selection for support vector machine classification with functional data

被引:6
|
作者
Jimenez-Cordero, Asuncion [1 ]
Maldonado, Sebastian [2 ,3 ]
机构
[1] Univ Malaga, Grp OASYS, Ada Byron Res Bldg, Malaga 29010, Spain
[2] Univ Chile, Sch Econ & Business, Dept Management Control & Informat Syst, Santiago, Chile
[3] Inst Sistemas Complejos Ingn ISCI, Santiago, Chile
关键词
Feature selection; Functional data; Support vector machines; Classification; Feature scaling; K-MEANS; CANCER CLASSIFICATION; PRINCIPAL-COMPONENTS; VARIABLE SELECTION; GENE SELECTION; FILTER METHOD; KERNEL; REGRESSION; ALGORITHMS; DESIGN;
D O I
10.1007/s10489-020-01765-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
FunctionalData Analysis (FDA) has become a very important field in recent years due to its wide range of applications. However, there are several real-life applications in which hybrid functional data appear, i.e., data with functional and static covariates. The classification of such hybrid functional data is a challenging problem that can be handled with the Support Vector Machine (SVM). Moreover, the selection of the most informative features may yield to drastic improvements in the classification rates. In this paper, an embedded feature selection approach for SVM classification is proposed, in which the isotropic Gaussian kernel is modified by associating a bandwidth to each feature. The bandwidths are jointly optimized with the SVM parameters, yielding an alternating optimization approach. The effectiveness of our methodology was tested on benchmark data sets. Indeed, the proposed method achieved the best average performance when compared to 17 other feature selection and SVM classification approaches. A comprehensive sensitivity analysis of the parameters related to our proposal was also included, confirming its robustness.
引用
收藏
页码:161 / 184
页数:24
相关论文
共 50 条