Explaining Missing Data in Graphs: A Constraint-based Approach

被引:1
|
作者
Song, Qi [1 ]
Lin, Peng [2 ]
Ma, Hanchao [3 ]
Wu, Yinghui [3 ,4 ]
机构
[1] Amazon Com, Bellevue, WA 98004 USA
[2] Washington State Univ, Pullman, WA 99164 USA
[3] Case Western Reserve Univ, Cleveland, OH 44106 USA
[4] Pacific Northwest Natl Lab, Richland, WA 99352 USA
关键词
Graphs; Data Constraints; Data Provenance;
D O I
10.1109/ICDE51399.2021.00131
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces a constraint-based approach to clarify missing values in graphs. Our method capitalizes on a set Sigma of graph data constraints. An explanation is a sequence of operational enforcement of Sigma towards the recovery of interested yet missing data (e.g., attribute values, edges). We show that constraint-based approach helps us to understand not only why a value is missing, but also how to recover the missing value. We study Sigma-explanation problem, which is to compute the optimal explanations with guarantees on the informativeness and conciseness. We show the problem is in Delta(P)(2) for established graph data constraints such as graph keys and graph association rules. We develop an efficient bidirectional algorithm to compute optimal explanations, without enforcing Sigma on the entire graph. We also show our algorithm can be easily extended to support graph refinement within limited time, and to explain missing answers. Using real-world graphs, we experimentally verify the effectiveness and efficiency of our algorithms.
引用
收藏
页码:1476 / 1487
页数:12
相关论文
共 50 条
  • [21] Generating Test Data for Killing SQL Mutants: A Constraint-based Approach
    Shah, Shetal
    Sudarshan, S.
    Kajbaje, Suhas
    Patidar, Sandeep
    Gupta, Bhanu Pratap
    Vira, Devang
    IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 1175 - 1186
  • [22] A Constraint-Based Approach to Automatic Data Partitioning for Distributed Memory Execution
    Lee, Wonchan
    Papadakis, Manolis
    Slaughter, Elliott
    Aiken, Alex
    PROCEEDINGS OF SC19: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2019,
  • [23] A fuzzy constraint-based approach to data reconciliation in material flow analysis
    Dubois, Didier
    Fargier, Helene
    Ababou, Meissa
    Guyonnet, Dominique
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2014, 43 (08) : 787 - 809
  • [24] GRIP: Constraint-based Explanation of Missing Answers for Graph Queries
    Song, Qi
    Ma, Hanchao
    Lin, Peng
    Wu, Yinghui
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 2779 - 2783
  • [25] A constraint-based approach to linguistic interfaces INTRODUCTION
    Bilbiie, Gabriela
    LINGUISTICAE INVESTIGATIONES, 2020, 43 (01): : 1 - 22
  • [26] Scheduling of maintenance work: A constraint-based approach
    Palma, J.
    de Leon Hijes, F. C. Gomez
    Campos Martinez, M.
    Guillen Carceles, L.
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (04) : 2963 - 2973
  • [27] Resyllabification in Standard Arabic: A Constraint-Based Approach
    Btoosh, Mousa A.
    SKASE JOURNAL OF THEORETICAL LINGUISTICS, 2019, 16 (02): : 2 - 24
  • [28] A Constraint-based Approach for Generating Transformation Patterns
    Cherif, Asma
    Imine, Abdessamad
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2015, (201): : 48 - 62
  • [29] A constraint-based approach to understanding the composition of skill
    Lewis, RL
    Vera, A
    Howes, A
    PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON COGNITIVE MODELING, 2004, : 148 - 153
  • [30] A constraint-based approach to table structure derivation
    Hurst, M
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 911 - 915