Cost-based feature selection for Support Vector Machines: An application in credit scoring

被引:106
|
作者
Maldonado, Sebastian [1 ]
Perez, Juan [1 ]
Bravo, Cristian [2 ]
机构
[1] Univ Los Andes, Fac Ingn & Ciencias Aplicadas, Monsenor Alvaro del Portillo 12455, Santiago, Chile
[2] Univ Southampton, Dept Decis Analyt & Risk, Southampton Business Sch, Univ Rd, Southampton SO17 1BJ, Hants, England
关键词
Analytics; Feature selection; Support Vector Machines; Mixed-integer programming; Credit scoring;
D O I
10.1016/j.ejor.2017.02.037
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
In this work we propose two formulations based on Support Vector Machines for simultaneous classification and feature selection that explicitly incorporate attribute acquisition costs. This is a challenging task for two main reasons: the estimation of the acquisition costs is not straightforward and may depend on multivariate factors, and the inter-dependence between variables must be taken into account for the modelling process since companies usually acquire groups of related variables rather than acquiring them individually. Mixed-integer linear programming models are proposed for constructing classifiers that constrain acquisition costs while classifying adequately. Experimental results using credit scoring datasets demonstrate the effectiveness of our methods in terms of predictive performance at a low cost compared to well-known feature selection approaches. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:656 / 665
页数:10
相关论文
共 50 条
  • [31] Feature Selection and Fast Training of Subspace Based Support Vector Machines
    Kitamura, Takuya
    Takeuchi, Syogo
    Abe, Shigeo
    [J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [32] Feature selection for hyperspectral data based on recursive support vector machines
    Zhang, Rui
    Ma, Jianwen
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2009, 30 (14) : 3669 - 3677
  • [33] Least squares support vector machines ensemble models for credit scoring
    Zhou, Ligang
    Lai, Kin Keung
    Yu, Lean
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (01) : 127 - 133
  • [34] Support vector machines for credit scoring: Extension to non standard cases
    Schebesch, KB
    Stecking, R
    [J]. INNOVATIONS IN CLASSIFICATION, DATA SCIENCE, AND INFORMATION SYSTEMS, 2005, : 498 - 505
  • [35] Credit Scoring by Fuzzy Support Vector Machines with a Novel Membership Function
    Shi, Jian
    Xu, Benlian
    [J]. JOURNAL OF RISK AND FINANCIAL MANAGEMENT, 2016, 9 (04)
  • [36] Several feature selection algorithms based on the discernibility of a feature subset and support vector machines
    Xie, Juan-Ying
    Xie, Wei-Xin
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2014, 37 (08): : 1704 - 1718
  • [37] Linear penalization support vector machines for feature selection
    Miranda, J
    Montoya, R
    Weber, R
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 : 188 - 192
  • [38] Comparison of Feature Selection Methods in Support Vector Machines
    Kim, Kwangsu
    Park, Changyi
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2013, 26 (01) : 131 - 139
  • [39] An empirical study of feature selection in support vector machines
    Cao, L. J.
    Zhang Jingqing
    [J]. NEURAL NETWORK WORLD, 2006, 16 (05) : 433 - 453
  • [40] Feature selection for support vector machines with RBF kernel
    Quanzhong Liu
    Chihau Chen
    Yang Zhang
    Zhengguo Hu
    [J]. Artificial Intelligence Review, 2011, 36 : 99 - 115