Mining Class Association Rules on Dataset with Missing Data

被引:0
|
作者
Hoang-Lam Nguyen [1 ,2 ]
Nguyen, Loan T. T. [1 ,2 ]
Kozierkiewicz, Adrianna [3 ]
机构
[1] Int Univ, Sch Comp Sci & Engn, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[3] Wroclaw Univ Sci & Technol, Fac Comp Sci & Management, Wroclaw, Poland
关键词
Missing value; Class association rules; Incomplete instance; Imputation method; SOLVING CONFLICTS; IMPUTATION;
D O I
10.1007/978-3-030-73280-6_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many real-world datasets contain missing values, affecting the efficiency of many classification algorithms. However, this is an unavoidable error due to many reasons such as network problems, physical devices, etc. Some classification algorithms cannot work properly with incomplete dataset. Therefore, it is crucial to handle missing values. Imputation methods have been proven to be effective in handling missing data, thus, significantly improve classification accuracy. There are two types of imputation methods. Both have their pros and cons. Single imputation can lead to low accuracy while multiple imputation is time-consuming. One high-accuracy algorithm proposed in this paper is called Classification based on Association Rules (CARs). Classification based on CARs has been proven to yield higher accuracy compared to others. However, there is no investigation on how to mine CARs with incomplete datasets. The goal of this work is to develop an effective imputationmethod formining CARs on incomplete datasets. To show the impact of each imputation method, two cases of imputation will be applied and compared in experiments.
引用
收藏
页码:104 / 116
页数:13
相关论文
共 50 条
  • [21] The association rule algorithm with missing data in data mining
    Gerardo, BD
    Lee, J
    Lee, J
    Park, M
    Lee, M
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2004, PT 1, 2004, 3043 : 97 - 105
  • [22] Efficient strategies for parallel mining class association rules
    Dang Nguyen
    Bay Vo
    Bac Le
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (10) : 4716 - 4729
  • [23] Efficient mining of class association rules with the itemset constraint
    Dang Nguyen
    Nguyen, Loan T. T.
    Vo, Bay
    Pedrycz, Witold
    [J]. KNOWLEDGE-BASED SYSTEMS, 2016, 103 : 73 - 88
  • [24] Mining class association rules with Artificial Immune System
    Do, TD
    Hui, SC
    Fong, ACM
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 4, PROCEEDINGS, 2005, 3684 : 94 - 100
  • [25] Mining Normal and Abnormal Class-Association Rules
    Viet Phan-Luong
    [J]. 2013 IEEE 27TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2013, : 968 - 975
  • [26] Mining Class Association Rules for Word Sense Disambiguation
    Kobylinski, Lukasz
    [J]. SECURITY AND INTELLIGENT INFORMATION SYSTEMS, 2012, 7053 : 307 - 317
  • [27] Incremental Mining Class Association Rules Using Diffsets
    Nguyen, Loan T. T.
    Ngoc Thanh Nguyen
    [J]. ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, 2015, 358 : 197 - 208
  • [28] An effective approach to mining exception class association rules
    Yu, F
    Jin, W
    [J]. WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2000, 1846 : 145 - 150
  • [29] Removal of duplicate rules for Association Rule Mining from multilevel dataset
    Chandanan, A. K.
    Shukla, M. K.
    [J]. INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING TECHNOLOGIES AND APPLICATIONS (ICACTA), 2015, 45 : 143 - 149
  • [30] Mining Multilevel Association Rules on RFID data
    Kim, Younghee
    Kim, Ungmo
    [J]. 2009 FIRST ASIAN CONFERENCE ON INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2009, : 46 - 50