Ordering attributes for missing values prediction and data classification

被引:0
|
作者
Hruschka, ER [1 ]
Ebecken, NFF [1 ]
机构
[1] Univ Fed Rio de Janeiro, COPPE, Rio De Janeiro, Brazil
来源
DATA MINING III | 2002年 / 6卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work shows the application of the bayesian K2 learning algorithm as a data classifier and preprocessor having an attribute order searcher to improve the results. One of the aspects that have influence on the K2 performance is the initial order of the attributes in the data set, however, in most cases, this algorithm is applied without giving special attention to this preorder, The present work performs an empirical method to select an appropriate attribute order, before applying the learning algorithm (K2). Afterwards, it does the data preparation and classification tasks. In order to analyze the results, in a first step, the data classification. is done without considering the initial order of the attributes. Thereafter it seeks for a good variable order, and having the sequence of the attributes, the classification is performed again. Once these results are obtained, the same algorithm is used to substitute missing values in the learning dataset in order to verify how the process works in this kind of task. The dataset used came from the standard classification problems databases from UCI Machine Learning Repository. The results are empirically compared taking into consideration the mean and standard deviation.
引用
收藏
页码:593 / 601
页数:9
相关论文
共 50 条
  • [31] Imputing Missing Values for Mixed Numeric and Categorical Attributes Based on Incomplete Data Hierarchical Clustering
    Feng, Xiaodong
    Wu, Sen
    Liu, Yanchi
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2011, 7091 : 414 - 424
  • [32] On-Line Classification of Data Streams with Missing Values Based on Reinforcement Learning
    Millan-Giraldo, Monica
    Javier Traver, Vicente
    Salvador Sanchez, J.
    PATTERN RECOGNITION AND IMAGE ANALYSIS: 5TH IBERIAN CONFERENCE, IBPRIA 2011, 2011, 6669 : 355 - 362
  • [33] Test-Cost Sensitive Classification on Data with Missing Values in the Limited Time
    Wan, Chang
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT I, 2010, 6276 : 501 - 510
  • [34] Multilevel Weighted Support Vector Machine for Classification on Healthcare Data with Missing Values
    Razzaghi, Talayeh
    Roderick, Oleg
    Safro, Ilya
    Marko, Nicholas
    PLOS ONE, 2016, 11 (05):
  • [35] MISSING VALUES IN MULTIVARIATE DATA
    KUZMA, JW
    BIOMETRICS, 1965, 21 (01) : 254 - &
  • [36] A new analytical framework for missing data imputation and classification with uncertainty: Missing data imputation and heart failure readmission prediction
    Hu, Zhiyong
    Du, Dongping
    PLOS ONE, 2020, 15 (09):
  • [37] RIEMANNIAN CLASSIFICATION OF EEG SIGNALS WITH MISSING VALUES
    Hippert-Ferrer, A.
    Mian, A.
    Bouchard, F.
    Pascal, F.
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 842 - 846
  • [38] Exploiting nearest neighbor data and fuzzy membership function to address missing values in classification
    Muludi, Kurnia
    Setianingsih, Revita
    Sholehurrohman, Ridho
    Junaidi, Akmal
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [39] Exploiting nearest neighbor data and fuzzy membership function to address missing values in classification
    Muludi K.
    Setianingsih R.
    Sholehurrohman R.
    Junaidi A.
    PeerJ Computer Science, 2024, 10
  • [40] K-ranked covariance based missing values estimation for microarray data classification
    Sehgal, MSB
    Gondal, I
    Dooley, L
    HIS'04: FOURTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, PROCEEDINGS, 2005, : 274 - 279