A multi-objective model for discovering high-quality knowledge based on data quality and prior knowledge

被引:16
|
作者
Liu, Qi [1 ,2 ]
Feng, Gengzhong [1 ,2 ]
Wang, Nengmin [1 ,2 ]
Tayi, Giri Kumar [3 ]
机构
[1] Xi An Jiao Tong Univ, Sch Management, 28 Xianning Rd, Xian 710049, Shaanxi, Peoples R China
[2] Xi An Jiao Tong Univ, Key Lab, Minist Educ Proc Control & Efficiency Engn, 28 Xianning Rd, Xian 710049, Shaanxi, Peoples R China
[3] SUNY Albany, Sch Business, Albany, NY 12222 USA
关键词
Data mining; Data quality; KDD; Decision making; Multi-objective algorithm; GENETIC ALGORITHM; ASSOCIATION RULES; OPTIMIZATION; INFORMATION; PROJECTION; EVOLUTIONARY; SELECTION; SUPPORT; KDD;
D O I
10.1007/s10796-016-9690-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Discovering knowledge from data means finding useful patterns in data, this process has increased the opportunity and challenge for businesses in the big data era. Meanwhile, improving the quality of the discovered knowledge is important for making correct decisions in an unpredictable environment. Various models have been developed in the past; however, few used both data quality and prior knowledge to control the quality of the discovery processes and results. In this paper, a multi-objective model of knowledge discovery in databases is developed, which aids the discovery process by utilizing prior process knowledge and different measures of data quality. To illustrate the model, association rule mining is considered and formulated as a multi-objective problem that takes into account data quality measures and prior process knowledge instead of a single objective problem. Measures such as confidence, support, comprehensibility and interestingness are used. A Pareto-based integrated multi-objective Artificial Bee Colony (IMOABC) algorithm is developed to solve the problem. Using well-known and publicly available databases, experiments are carried out to compare the performance of IMOABC with NSGA-II, MOPSO and Apriori algorithms, respectively. The computational results show that IMOABC outperforms NSGA-II, MOPSO and Apriori on different measures and it could be easily customized or tailored to be in line with user requirements and still generates high-quality association rules.
引用
收藏
页码:401 / 416
页数:16
相关论文
共 50 条
  • [41] A quality management model based on databases and knowledge
    Srdoč, Alira
    Bratko, Ivan
    Sluga, Alojzij
    [J]. Strojarstvo, 2011, 53 (02): : 137 - 145
  • [42] Discovering network community based on multi-objective optimization
    [J]. Huang, F.-L. (faliang.huang@gmail.com), 1600, Chinese Academy of Sciences (24):
  • [43] Interactive knowledge discovery and knowledge visualization for decision support in multi-objective optimization
    Smedberg, Henrik
    Bandaru, Sunith
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2023, 306 (03) : 1311 - 1329
  • [44] When Stakes are High and Guards are Low: High-quality Connections in Knowledge Creation
    Aarrestad, Martine
    Brondbo, Marthe Turnes
    Carlsen, Arne
    [J]. KNOWLEDGE AND PROCESS MANAGEMENT, 2015, 22 (02) : 88 - 98
  • [45] Discovering predictive quality of knowledge artifacts in organisational repositories
    Handzic, Meliha
    Li, Winnie
    [J]. Role of Quality in Knowledge Management, 2003, : 133 - 144
  • [46] Multi-objective learning of white box models with low quality data
    Villar, Jose R.
    Berzosa, Alba
    de la Cal, Enrique
    Sedano, Javier
    Garcia-Tamargo, Marco
    [J]. NEUROCOMPUTING, 2012, 75 (01) : 219 - 225
  • [47] High-quality science requires high-quality open data infrastructure
    Susanna-Assunta Sansone
    Patricia Cruse
    Mark Thorley
    [J]. Scientific Data, 5
  • [48] Adaptive Knowledge Distillation for High-Quality Unsupervised MRI Reconstruction With Model-Driven Priors
    Wu, Zhengliang
    Li, Xuesong
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (06) : 3571 - 3582
  • [49] Domain-Knowledge Enhanced GANs for High-Quality Trajectory Generation
    Jia, Jia
    Li, Linghui
    Qiu, Pengfei
    Cai, Binsi
    Kang, Xu
    Li, Ximing
    Li, Xiaoyong
    [J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IX, ICIC 2024, 2024, 14870 : 386 - 396
  • [50] Successful high-quality knowledge translation research: three case studies
    Majumdar, Sumit R.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2011, 64 (01) : 21 - 24