Data Pre-Processing by Genetic Algorithms for Bankruptcy Prediction

被引:0
|
作者
Tsai, Chih-Fong [1 ]
Chou, Jui-Sheng
机构
[1] Natl Cent Univ, Dept Informat Management, Jhongli, Taiwan
关键词
Bankruptcy prediction; data mining; data pre-processing genetic algorithms; feature selection; data reduction; FEATURE-SELECTION; INSTANCE SELECTION; NEURAL-NETWORKS; REDUCTION; SENSITIVITY; BANKS;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Bankruptcy prediction has been approached by data mining techniques. However, since data pre-processing including feature selection or dimensionality reduction and data reduction is a very important stage for successful data mining, very few consider performing both tasks to examine the impact of data pre-processing on prediction performance. This paper applies genetic algorithms, which have been widely used for the data pre-processing tasks, for feature selection and data reduction over a public bankruptcy prediction dataset. In particular, the experiments based on different priorities of performing feature selection and data reduction are conducted. The results show that performing data reduction only can allow the support vector machine (SVM) classifier to provide the highest rate of prediction accuracy. However, executing both feature selection and data reduction with different priorities performs the same. They not only largely reduce the dataset size, but also keep the similar performance as SVM without data pre-processing.
引用
收藏
页码:1780 / 1783
页数:4
相关论文
共 50 条
  • [1] On Pre-processing Algorithms for Data Stream
    Duda, Piotr
    Jaworski, Maciej
    Pietruczuk, Lena
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT II, 2012, 7268 : 56 - 63
  • [2] Semantic Data Pre-Processing for Machine Learning Based Bankruptcy Prediction Computational Model
    Yerashenia, Natalia
    Bolotov, Alexander
    Chan, David
    Pierantoni, Gabriele
    [J]. 2020 IEEE 22ND CONFERENCE ON BUSINESS INFORMATICS (CBI 2020), VOL I - RESEARCH PAPERS, 2020, : 66 - 75
  • [3] Fuzzy-Genetic Algorithm for pre-processing the data at RTU
    Kumar, P
    Chandna, V
    Chandna, V
    Thomas, M
    [J]. 2004 IEEE POWER ENGINEERING SOCIETY GENERAL MEETING, VOLS 1 AND 2, 2004, : 1068 - 1068
  • [4] Fuzzy-genetic algorithm for pre-processing data at the RTU
    Kumar, P
    Chandna, VK
    Thomas, MS
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2004, 19 (02) : 718 - 723
  • [5] Pre-processing for data clustering
    Frigui, H
    [J]. NAFIPS 2004: ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY, VOLS 1AND 2: FUZZY SETS IN THE HEART OF THE CANADIAN ROCKIES, 2004, : 967 - 972
  • [6] Pre-processing of the speech data
    不详
    [J]. ROBUST ADAPTATION TO NON-NATIVE ACCENTS IN AUTOMATIC SPEECH RECOGNITION, 2002, 2560 : 15 - 19
  • [7] A discriminative view of MRF pre-processing algorithms
    Wang, Chen
    Herrmann, Charles
    Zabih, Ramin
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5505 - 5514
  • [8] On effectiveness of pre-processing by clustering in prediction of CE technological data with ANNs
    Kasperkiewicz, J
    Alterman, D
    [J]. INTELLIGENT INFORMATION PROCESSING AND WEB MINING, 2003, : 261 - 266
  • [9] Genetic algorithm optimization for pre-processing and variable selection of spectroscopic data
    Jarvis, RM
    Goodacre, R
    [J]. BIOINFORMATICS, 2005, 21 (07) : 860 - 868
  • [10] Comparison of algorithms for pre-processing of SELDI-TOF mass spectrometry data
    Cruz-Marcelo, Alejandro
    Guerra, Rudy
    Vannucci, Marina
    Li, Yiting
    Lau, Ching C.
    Man, Tsz-Kwong
    [J]. BIOINFORMATICS, 2008, 24 (19) : 2129 - 2136