Impurity measures in databases

被引:0
|
作者
Dan A. Simovici
Dana Cristofor
Laurentiu Cristofor
机构
[1] University of Massachusetts at Boston,
[2] Department of Computer Science,undefined
[3] Boston,undefined
[4] MA 02125,undefined
[5] USA (e-mail: {dsim,undefined
[6] dana,undefined
[7] laur}@cs.umb.edu) ,undefined
来源
Acta Informatica | 2002年 / 38卷
关键词
Impurity Measure; Relational Database; Functional Dependency; Relative Impurity; Mine Dataset;
D O I
暂无
中图分类号
学科分类号
摘要
We introduce purity dependencies as generalizations of functional dependencies in relational databases starting from the notion of impurity measure. The impurity measure of a subset of a set relative to a partition of that set and the relative impurity of two partitions allow us to define the relative impurity of two attribute sets of a table of a relational database and to introduce purity dependencies. We discuss properties of these dependencies that generalize similar properties of functional dependencies and we highlight their relevance for approximate classifications. Finally, an algorithm that mines datasets for these dependencies is presented.
引用
收藏
页码:307 / 324
页数:17
相关论文
共 50 条
  • [31] Approximated measures in construction of decision trees from large databases
    Nguyen, HS
    Nguyen, SH
    DESIGN AND APPLICATION OF HYBRID INTELLIGENT SYSTEMS, 2003, 104 : 595 - 604
  • [32] Effectiveness of retrieval in similarity searches of chemical databases: A review of performance measures
    Edgar, SJ
    Holliday, JD
    Willett, P
    JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2000, 18 (4-5): : 343 - 357
  • [33] A Study of Interestingness Measures for Knowledge Discovery in Databases-A Genetic Approach
    Garima, Goyal
    Vashishtha, Jyoti
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 2, 2015, 32 : 69 - 79
  • [34] Reengineering Probabilistic Relational Databases with Fuzzy Probability Measures into XML Model
    Ma, Zongmin
    Li, Chengwei
    Yan, Li
    JOURNAL OF DATABASE MANAGEMENT, 2017, 28 (03) : 26 - 47
  • [35] A note on approximation measures for multi-valued dependencies in relational databases
    Giannella, C
    Robertson, E
    INFORMATION PROCESSING LETTERS, 2003, 85 (03) : 153 - 158
  • [36] When to publish measures of disproportionality derived from spontaneous reporting databases?
    de Boer, Anthonius
    BRITISH JOURNAL OF CLINICAL PHARMACOLOGY, 2011, 72 (06) : 909 - 911
  • [37] The influence of global constraints on similarity measures for time-series databases
    Kurbalija, Vladimir
    Radovanovic, Milos
    Geler, Zoltan
    Ivanovic, Mirjana
    KNOWLEDGE-BASED SYSTEMS, 2014, 56 : 49 - 67
  • [38] Effectiveness of retrieval in similarity searches of chemical databases: A review of performance measures
    Edgar, Sarah J.
    Holliday, John D.
    Willett, Peter
    2000, Elsevier Science Ltd, Exeter, United Kingdom (18) : 4 - 5
  • [39] Association mining in large databases: A re-examination of its measures
    Wu, Tianyi
    Chen, Yuguo
    Han, Jiawei
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2007, PROCEEDINGS, 2007, 4702 : 621 - +
  • [40] FUZZY RELATIONAL DATABASES: REPRESENTATIONAL ISSUES AND REDUCTION USING SIMILARITY MEASURES.
    Prade, Henri
    Testemale, Claudette
    Journal of the American Society for Information Science, 1987, 38 (02): : 118 - 126