Discovering Relaxed Functional Dependencies Based on Multi-Attribute Dominance

被引:21
|
作者
Caruccio, Loredana [1 ]
Deufemia, Vincenzo [1 ]
Naumann, Felix [2 ]
Polese, Giuseppe [1 ]
机构
[1] Univ Salerno, Dept Comp Sci, I-84084 Fisciano, SA, Italy
[2] Univ Potsdam, Hasso Plattner Inst, D-14482 Potsdam, Germany
关键词
Complexity theory; Approximation algorithms; Big Data; Distributed databases; Semantics; Lakes; Functional dependencies; data profiling; data cleansing; EFFICIENT DISCOVERY;
D O I
10.1109/TKDE.2020.2967722
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the advent of big data and data lakes, data are often integrated from multiple sources. Such integrated data are often of poor quality, due to inconsistencies, errors, and so forth. One way to check the quality of data is to infer functional dependencies (fds). However, in many modern applications it might be necessary to extract properties and relationships that are not captured through fds, due to the necessity to admit exceptions, or to consider similarity rather than equality of data values. Relaxed fds (rfds) have been introduced to meet these needs, but their discovery from data adds further complexity to an already complex problem, also due to the necessity of specifying similarity and validity thresholds. We propose Domino, a new discovery algorithm for rfds that exploits the concept of dominance in order to derive similarity thresholds of attribute values while inferring rfds. An experimental evaluation on real datasets demonstrates the discovery performance and the effectiveness of the proposed algorithm.
引用
收藏
页码:3212 / 3228
页数:17
相关论文
共 50 条
  • [31] Pricing the cloud based on multi-attribute auction mechanism
    Alomari, Zakaria
    AL-Oudat, Mohammad
    Alkhushayni, Suboh
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (01): : 629 - 654
  • [32] An explainable multi-attribute decision model based on argumentation
    Zhong, Qiaoting
    Fan, Xiuyi
    Luo, Xudong
    Toni, Francesca
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 117 : 42 - 61
  • [33] A Multi-attribute Based Methodology for Vehicle Detection & Identification
    Elangovan, Vinayak
    Alsaidi, Bashir
    Shirkhodaie, Amir
    SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION XXII, 2013, 8745
  • [34] Pricing the cloud based on multi-attribute auction mechanism
    Zakaria Alomari
    Mohammad AL-Oudat
    Suboh Alkhushayni
    Cluster Computing, 2024, 27 : 629 - 654
  • [35] A Framework for Component Selection Based on Multi-attribute Evaluations
    Gupta, Stueti
    RESEARCH INTO DESIGN FOR COMMUNITIES, VOL 2: PROCEEDINGS OF ICORD 2017, 2017, 66 : 509 - 516
  • [36] Image Ranking and Retrieval based on Multi-Attribute Queries
    Siddiquie, Behjat
    Feris, Rogerio S.
    Davis, Larry S.
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 801 - 808
  • [37] Discovering Reliable Approximate Functional Dependencies
    Mandros, Panagiotis
    Boley, Mario
    Vreeken, Jilles
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 355 - 363
  • [38] Coevolutionary computation based iterative multi-attribute auctions
    Nie, Lanshun
    Xu, Xiaofei
    Zhan, Dechen
    ENTERPRISE INTEROPERABILITY III: NEW CHALLENGES AND INDUSTRIAL APPROACHES, 2008, : 461 - 469
  • [39] Multi-attribute Synthetic Evaluation Based on Cloud Model
    Song, Jianjun
    Du, Xiaoping
    Wang, Qiao
    Wang, Chao
    Li, Xin
    Jiang, Yu
    2011 INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND NEURAL COMPUTING (FSNC 2011), VOL I, 2011, : 196 - 198
  • [40] Negotiation model based on GAI multi-attribute dependence
    School of Information Engineering, Zhengzhou University, Zhengzhou 450052, China
    不详
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2008, 21 (05): : 569 - 576