Impurity measures in databases

被引:0
|
作者
Dan A. Simovici
Dana Cristofor
Laurentiu Cristofor
机构
[1] University of Massachusetts at Boston,
[2] Department of Computer Science,undefined
[3] Boston,undefined
[4] MA 02125,undefined
[5] USA (e-mail: {dsim,undefined
[6] dana,undefined
[7] laur}@cs.umb.edu) ,undefined
来源
Acta Informatica | 2002年 / 38卷
关键词
Impurity Measure; Relational Database; Functional Dependency; Relative Impurity; Mine Dataset;
D O I
暂无
中图分类号
学科分类号
摘要
We introduce purity dependencies as generalizations of functional dependencies in relational databases starting from the notion of impurity measure. The impurity measure of a subset of a set relative to a partition of that set and the relative impurity of two partitions allow us to define the relative impurity of two attribute sets of a table of a relational database and to introduce purity dependencies. We discuss properties of these dependencies that generalize similar properties of functional dependencies and we highlight their relevance for approximate classifications. Finally, an algorithm that mines datasets for these dependencies is presented.
引用
收藏
页码:307 / 324
页数:17
相关论文
共 50 条
  • [21] Global databases will yield reliable measures of global biodiversity
    Alroy, J
    PALEOBIOLOGY, 2003, 29 (01) : 26 - 29
  • [22] Predictive performance of comorbidity measures in administrative databases for diabetes cohorts
    Lix, Lisa M.
    Quail, Jacqueline
    Fadahunsi, Opeyemi
    Teare, Gary F.
    BMC HEALTH SERVICES RESEARCH, 2013, 13
  • [23] On similarity measures for cluster analysis in clinical laboratory examination databases
    Hirano, S
    Sun, XG
    Tsumoto, S
    26TH ANNUAL INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE, PROCEEDINGS, 2002, : 1170 - 1175
  • [24] Dimensional inconsistency measures and postulates in spatio-temporal databases
    Grant J.
    Martinez M.V.
    Molinaro C.
    Parisi F.
    1600, AI Access Foundation (71): : 733 - 780
  • [25] STRATEGIES AND CONCERNS IN APPLYING DIVERSITY OR SIMILARITY MEASURES TO LARGE DATABASES
    MARTIN, YC
    BROWN, RD
    BURES, MG
    PAVLIK, PA
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1995, 210 : 126 - MEDI
  • [26] Formulation of Composite Discrete Measures for Estimating Uncertainties in Probabilistic Databases
    Bagchi, Susmit
    BEYOND DATABASES, ARCHITECTURES AND STRUCTURES: FACING THE CHALLENGES OF DATA PROLIFERATION AND GROWING VARIETY, 2018, 928 : 143 - 156
  • [27] A comparative analysis of two distance measures in color image databases
    Qian, G
    Sural, S
    Pramanik, S
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2002, : 401 - 404
  • [28] Predictive performance of comorbidity measures in administrative databases for diabetes cohorts
    Lisa M Lix
    Jacqueline Quail
    Opeyemi Fadahunsi
    Gary F Teare
    BMC Health Services Research, 13
  • [29] Different Similarity Measures to Identify Duplicate Records in Relational Databases
    Hadzic, Dulaga
    Sarajlic, Nermin
    Malkic, Jasmin
    2016 24TH TELECOMMUNICATIONS FORUM (TELFOR), 2016, : 790 - 793
  • [30] Evaluation of rule interestingness measures in medical knowledge discovery in databases
    Ohsaki, Miho
    Hidenao, Abe
    Tsumoto, Shusaku
    Yokoi, Hideto
    Yamaguchi, Takahira
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2007, 41 (03) : 177 - 196