Ordering attributes for missing values prediction and data classification

被引:0
|
作者
Hruschka, ER [1 ]
Ebecken, NFF [1 ]
机构
[1] Univ Fed Rio de Janeiro, COPPE, Rio De Janeiro, Brazil
来源
DATA MINING III | 2002年 / 6卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work shows the application of the bayesian K2 learning algorithm as a data classifier and preprocessor having an attribute order searcher to improve the results. One of the aspects that have influence on the K2 performance is the initial order of the attributes in the data set, however, in most cases, this algorithm is applied without giving special attention to this preorder, The present work performs an empirical method to select an appropriate attribute order, before applying the learning algorithm (K2). Afterwards, it does the data preparation and classification tasks. In order to analyze the results, in a first step, the data classification. is done without considering the initial order of the attributes. Thereafter it seeks for a good variable order, and having the sequence of the attributes, the classification is performed again. Once these results are obtained, the same algorithm is used to substitute missing values in the learning dataset in order to verify how the process works in this kind of task. The dataset used came from the standard classification problems databases from UCI Machine Learning Repository. The results are empirically compared taking into consideration the mean and standard deviation.
引用
收藏
页码:593 / 601
页数:9
相关论文
共 50 条
  • [1] Rough Set Analysis of Classification Data with Missing Values
    Szelag, Marcin
    Blaszczynski, Jerzy
    Slowinski, Roman
    ROUGH SETS, 2017, 10313 : 552 - 565
  • [2] Visualization of the critical patterns of missing values in classification data
    Wang, Hai
    Wang, Shouhong
    ADVANCES IN VISUAL INFORMATION SYSTEMS, 2007, 4781 : 267 - +
  • [3] Fast Imbalanced Classification of Healthcare Data with Missing Values
    Razzaghi, Talayeh
    Roderick, Oleg
    Safro, Ilya
    Marko, Nick
    2015 18TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2015, : 774 - 781
  • [4] Data decomposition and decision rule joining for classification of data with missing values
    Latkowski, R
    Mikolajczyk, M
    ROUGH SETS AND CURRENT TRENDS IN COMPUTING, 2004, 3066 : 254 - 263
  • [5] Data decomposition and decision rule joining for classification of data with missing values
    Latkowski, R
    Mikolajczyk, M
    TRANSACTIONS ON ROUGH SETS I, 2004, 3100 : 299 - 320
  • [6] Test-cost sensitive classification on data with missing values
    Yang, Q
    Ling, C
    Chai, XY
    Pan, R
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (05) : 626 - 638
  • [7] Classification of missing values in spatial data using spin models
    Zukovic, Milan
    Hristopulos, Dionissios T.
    PHYSICAL REVIEW E, 2009, 80 (01)
  • [8] Impact of imputation of missing values on classification error for discrete data
    Farhangfar, Alireza
    Kurgan, Lukasz
    Dy, Jennifer
    PATTERN RECOGNITION, 2008, 41 (12) : 3692 - 3705
  • [9] Simple data imputation for missing feature values in binary classification
    Chatterjee, Avishek
    Woodruff, Henry
    Vallieres, Martin
    Seuntjens, Jan
    MEDICAL PHYSICS, 2019, 46 (11) : 5378 - 5378
  • [10] Toward the Imputation and Prediction of Condition Monitoring Data with Missing Values
    Zhang, Di
    Li, Canbing
    Zhu, Jizhong
    2023 IEEE/IAS INDUSTRIAL AND COMMERCIAL POWER SYSTEM ASIA, I&CPS ASIA, 2023, : 996 - 1002