Discovering Relaxed Functional Dependencies Based on Multi-Attribute Dominance

被引:21
|
作者
Caruccio, Loredana [1 ]
Deufemia, Vincenzo [1 ]
Naumann, Felix [2 ]
Polese, Giuseppe [1 ]
机构
[1] Univ Salerno, Dept Comp Sci, I-84084 Fisciano, SA, Italy
[2] Univ Potsdam, Hasso Plattner Inst, D-14482 Potsdam, Germany
关键词
Complexity theory; Approximation algorithms; Big Data; Distributed databases; Semantics; Lakes; Functional dependencies; data profiling; data cleansing; EFFICIENT DISCOVERY;
D O I
10.1109/TKDE.2020.2967722
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the advent of big data and data lakes, data are often integrated from multiple sources. Such integrated data are often of poor quality, due to inconsistencies, errors, and so forth. One way to check the quality of data is to infer functional dependencies (fds). However, in many modern applications it might be necessary to extract properties and relationships that are not captured through fds, due to the necessity to admit exceptions, or to consider similarity rather than equality of data values. Relaxed fds (rfds) have been introduced to meet these needs, but their discovery from data adds further complexity to an already complex problem, also due to the necessity of specifying similarity and validity thresholds. We propose Domino, a new discovery algorithm for rfds that exploits the concept of dominance in order to derive similarity thresholds of attribute values while inferring rfds. An experimental evaluation on real datasets demonstrates the discovery performance and the effectiveness of the proposed algorithm.
引用
收藏
页码:3212 / 3228
页数:17
相关论文
共 50 条
  • [21] A multi-attribute evaluating approach based on analysis and learning of attribute coordinate
    Feng, JL
    Lu, NZ
    APPLIED COMPUTATIONAL INTELLIGENCE, 2004, : 148 - 151
  • [22] Study of Multi-attribute Comprehensive Evaluation method based on Attribute Theory
    Xu Guanglin
    Liu Nianzu
    2014 INTERNATIONAL CONFERENCE ON MANAGEMENT OF E-COMMERCE AND E-GOVERNMENT (ICMECG), 2014, : 6 - 10
  • [23] An improved social attribute inference scheme based on multi-attribute correlation
    Yang, Yitong
    Lin, Qixiao
    Mao, Jian
    Liu, Lipei
    2021 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, INTERNET OF PEOPLE, AND SMART CITY INNOVATIONS (SMARTWORLD/SCALCOM/UIC/ATC/IOP/SCI 2021), 2021, : 370 - 377
  • [24] Rough approximation of a preference relation by a Multi-Attribute Stochastic Dominance for a reduced number of attributes
    Zaras, K
    RESEARCH AND PRACTICE IN MULTIPLE CRITERIA DECISION MAKING, 2000, 487 : 218 - 227
  • [25] A dominance relation-based decision making approach for multi-attribute decision making problems with incomplete information
    Li, Jin-Peng
    Yue, Chao-Yuan
    Li, Wu
    Kongzhi yu Juece/Control and Decision, 2013, 28 (02): : 229 - 234
  • [26] Multi-attribute auctions with different types of attributes: Enacting properties in multi-attribute auctions
    Pla, Albert
    Lopez, Beatriz
    Murillo, Javier
    Maudet, Nicolas
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (10) : 4829 - 4843
  • [27] MULTI-ATTRIBUTE DECISION MAKING BASED ON LABEL SEMANTICS
    Lawry, Jonathan
    He, Hongmei
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2008, 16 : 69 - 86
  • [28] Multi-attribute structural optimization based on conjoint analysis
    Amarchinta, Hemanth K.
    Grandhi, Ramana V.
    AIAA JOURNAL, 2008, 46 (04) : 884 - 893
  • [29] Multi-attribute Regret-Based Dynamic Pricing
    Jumadinova, Janyl
    Dasgupta, Prithviraj
    AGENT-MEDIATED ELECTRONIC COMMERCE AND TRADING AGENT DESIGN AND ANALYSIS, 2010, 44 : 73 - 87
  • [30] Ciphertext Query Method Based on Multi-Attribute Keywords
    Li, Haoyu
    Zhang, Longjun
    Li, Qingpeng
    Gao, Zhiqiang
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, INFORMATION MANAGEMENT AND NETWORK SECURITY, 2016, 47 : 341 - 344