Mining Class Association Rules on Dataset with Missing Data

被引:0
|
作者
Hoang-Lam Nguyen [1 ,2 ]
Nguyen, Loan T. T. [1 ,2 ]
Kozierkiewicz, Adrianna [3 ]
机构
[1] Int Univ, Sch Comp Sci & Engn, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[3] Wroclaw Univ Sci & Technol, Fac Comp Sci & Management, Wroclaw, Poland
关键词
Missing value; Class association rules; Incomplete instance; Imputation method; SOLVING CONFLICTS; IMPUTATION;
D O I
10.1007/978-3-030-73280-6_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many real-world datasets contain missing values, affecting the efficiency of many classification algorithms. However, this is an unavoidable error due to many reasons such as network problems, physical devices, etc. Some classification algorithms cannot work properly with incomplete dataset. Therefore, it is crucial to handle missing values. Imputation methods have been proven to be effective in handling missing data, thus, significantly improve classification accuracy. There are two types of imputation methods. Both have their pros and cons. Single imputation can lead to low accuracy while multiple imputation is time-consuming. One high-accuracy algorithm proposed in this paper is called Classification based on Association Rules (CARs). Classification based on CARs has been proven to yield higher accuracy compared to others. However, there is no investigation on how to mine CARs with incomplete datasets. The goal of this work is to develop an effective imputationmethod formining CARs on incomplete datasets. To show the impact of each imputation method, two cases of imputation will be applied and compared in experiments.
引用
收藏
页码:104 / 116
页数:13
相关论文
共 50 条
  • [31] Role of sampling in data mining for association rules
    Jeragh, M
    Mehrotra, KG
    [J]. IC-AI'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS I-III, 2001, : 483 - 489
  • [32] Mining association rules from quantitative data
    Hong, Tzung-Pei
    Kuo, Chan-Sheng
    Chi, Sheng-Chai
    [J]. Intelligent Data Analysis, 1999, 3 (05): : 363 - 376
  • [33] Evaluation of sampling for data mining of association rules
    Zaki, MJ
    Parthasarathy, S
    Li, W
    Ogihara, M
    [J]. SEVENTH INTERNATIONAL WORKSHOP ON RESEARCH ISSUES IN DATA ENGINEERING, PROCEEDINGS: HIGH PERFORMANCE DATABASE MANAGEMENT FOR LARGE-SCALE APPLICATIONS, 1997, : 42 - 50
  • [34] Scalable parallel data mining for association rules
    Han, EH
    Karypis, G
    Kumar, V
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2000, 12 (03) : 337 - 352
  • [35] Mining association rules in big data with NGEP
    Yunliang Chen
    Fangyuan Li
    Junqing Fan
    [J]. Cluster Computing, 2015, 18 : 577 - 585
  • [36] Association rules for web data mining in WHOWEDA
    Madria, SK
    Raymond, C
    Bhowmick, S
    Mohania, M
    [J]. 2000 KYOTO INTERNATIONAL CONFERENCE ON DIGITAL LIBRARIES: RESEARCH AND PRACTICE, PROCEEDINGS, 2000, : 227 - 233
  • [37] Mining Association Rules in Seasonal Transaction Data
    Ayu, Sabrina Kusuma
    Surjandari, Isti
    Zulkarnain
    [J]. 2018 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2018), 2018, : 321 - 325
  • [38] MINING ASSOCIATION RULES FOR TRAJECTORIES OF SPATIOTEMPORAL DATA
    Hong, Hao
    [J]. 2011 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, MEASUREMENT, CIRCUITS AND SYSTEMS (ICIMCS 2011), VOL 3: COMPUTER-AIDED DESIGN, MANUFACTURING AND MANAGEMENT, 2011, : 247 - 253
  • [39] Mining association rules in big data with NGEP
    Chen, Yunliang
    Li, Fangyuan
    Fan, Junqing
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (02): : 577 - 585
  • [40] Improvements in data mining association rules algorithm
    Li, Dai
    [J]. International Journal of Database Theory and Application, 2015, 8 (02): : 1 - 10