Apriori-Based Rule Generation in Incomplete Information Databases and Non-Deterministic Information Systems

被引:24
|
作者
Sakai, Hiroshi [1 ]
Wu, Mao [2 ]
Nakata, Michinori [3 ]
机构
[1] Kyushu Inst Technol, Fac Engn, Dept Basic Sci, Kitakyushu, Fukuoka 8048550, Japan
[2] Kyushu Inst Technol, Dept Integrated Syst Engn, Kitakyushu, Fukuoka 8048550, Japan
[3] Josai Int Univ, Fac Management & Informat Sci, Togane, Chiba 283, Japan
基金
日本学术振兴会;
关键词
Association and decision rules; Incomplete and non-deterministic information; Set and interval valued data sets; Rough set approximations; Apriori algorithm extensions and implementation; Soundness and completeness; POSSIBLE EQUIVALENCE-RELATIONS; ROUGH SET APPROACH; CONTINUOUS ATTRIBUTES; DECISION RULES; TABLES; GRANULATION; FOUNDATIONS; EXTRACTION; ALGORITHM;
D O I
10.3233/FI-2014-995
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper discusses issues related to incomplete information databases and considers a logical framework for rule generation. In our approach, a rule is an implication satisfying specified constraints. The term incomplete information databases covers many types of inexact data, such as non-deterministic information, data with missing values, incomplete information or interval valued data. In the paper, we start by defining certain and possible rules based on non-deterministic information. We use their mathematical properties to solve computational problems related to rule generation. Then, we reconsider the NIS-Apriori algorithm which generates a given implication if and only if it is either a certain rule or a possible rule satisfying the constraints. In this sense, NIS-Apriori is logically sound and complete. In this paper, we pay a special attention to soundness and completeness of the considered algorithmic framework, which is not necessarily obvious when switching from exact to inexact data sets. Moreover, we analyze different types of non-deterministic information corresponding to different types of the underlying attributes, i.e., value sets for qualitative attributes and intervals for quantitative attributes, and we discuss various approaches to construction of descriptors related to particular attributes within the rules' premises. An improved implementation of NIS-Apriori and some demonstrations of an experimental application of our approach to data sets taken from the UCI machine learning repository are also presented. Last but not least, we show simplified proofs of some of our theoretical results.
引用
收藏
页码:343 / 376
页数:34
相关论文
共 50 条
  • [1] On Apriori-based Rule Generation in SQL - A Case of the Deterministic Information System -
    Liu, Chenxi
    Sakai, Hiroshi
    Zhu, Xiaoxin
    Nakata, Michinori
    [J]. 2016 JOINT 8TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 17TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2016, : 178 - 182
  • [2] A framework of rough sets based rule generation in non-deterministic information systems
    Sakai, H
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, 2003, 2871 : 143 - 151
  • [3] On possible rules and apriori algorithm in non-deterministic information systems
    Sakai, Hiroshi
    Nakata, Michinori
    [J]. ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2006, 4259 : 264 - 273
  • [4] Possible rules and apriori algorithm in non-deterministic information systems: Part 2
    Sakai, Hiroshi
    Ishibashi, Ryuji
    Koba, Kazuhiro
    Nakata, Michinori
    [J]. ROUGH SETS, FUZZY SETS, DATA MINING AND GRANULAR COMPUTING, PROCEEDINGS, 2007, 4482 : 280 - +
  • [5] Rule Generation in Lipski's Incomplete Information Databases
    Sakai, Hiroshi
    Nakata, Michinori
    Slezak, Dominik
    [J]. ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2010, 6086 : 376 - +
  • [6] A Modal Logic for Non-deterministic Information Systems
    Khan, Md. Aquil
    [J]. LOGIC AND ITS APPLICATIONS, ICLA 2015, 2015, 8923 : 119 - 131
  • [7] Possible equivalence relations and their application to hypothesis generation in non-deterministic information systems
    Sakai, H
    [J]. TRANSACTIONS ON ROUGH SETS II: ROUGH SETS AND FUZZY SETS, 2004, 3135 : 82 - 106
  • [8] Rough Set-Based Information Dilution by Non-deterministic Information
    Sakai, Hiroshi
    Wu, Mao
    Yamaguchi, Naoto
    Nakata, Michinori
    [J]. ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, 2013, 8170 : 55 - 66
  • [9] A Prototype System for Rule Generation in Lipski's Incomplete Information Databases
    Sakai, Hiroshi
    Nakata, Michinori
    Slezak, Dominik
    [J]. ROUGH SETS, FUZZY SETS, DATA MINING AND GRANULAR COMPUTING, RSFDGRC 2011, 2011, 6743 : 175 - 182
  • [10] A Proposal of Machine Learning by Rule Generation from Tables with Non-deterministic Information and Its Prototype System
    Sakai, Hiroshi
    Nakata, Michinori
    Watada, Junzo
    [J]. ROUGH SETS, 2017, 10313 : 535 - 551