Explaining Missing Data in Graphs: A Constraint-based Approach

被引:1
|
作者
Song, Qi [1 ]
Lin, Peng [2 ]
Ma, Hanchao [3 ]
Wu, Yinghui [3 ,4 ]
机构
[1] Amazon Com, Bellevue, WA 98004 USA
[2] Washington State Univ, Pullman, WA 99164 USA
[3] Case Western Reserve Univ, Cleveland, OH 44106 USA
[4] Pacific Northwest Natl Lab, Richland, WA 99352 USA
关键词
Graphs; Data Constraints; Data Provenance;
D O I
10.1109/ICDE51399.2021.00131
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces a constraint-based approach to clarify missing values in graphs. Our method capitalizes on a set Sigma of graph data constraints. An explanation is a sequence of operational enforcement of Sigma towards the recovery of interested yet missing data (e.g., attribute values, edges). We show that constraint-based approach helps us to understand not only why a value is missing, but also how to recover the missing value. We study Sigma-explanation problem, which is to compute the optimal explanations with guarantees on the informativeness and conciseness. We show the problem is in Delta(P)(2) for established graph data constraints such as graph keys and graph association rules. We develop an efficient bidirectional algorithm to compute optimal explanations, without enforcing Sigma on the entire graph. We also show our algorithm can be easily extended to support graph refinement within limited time, and to explain missing answers. Using real-world graphs, we experimentally verify the effectiveness and efficiency of our algorithms.
引用
收藏
页码:1476 / 1487
页数:12
相关论文
共 50 条
  • [1] Constraint-based approach to semistructured data
    Hacid, MS
    Toumani, F
    Elmagarmid, AK
    FUNDAMENTA INFORMATICAE, 2001, 47 (1-2) : 53 - 73
  • [2] A Constraint-Based Approach to Solving Games on Infinite Graphs
    Beyene, Tewodros A.
    Chaudhuri, Swarat
    Popeea, Corneliu
    Rybalchenko, Andrey
    ACM SIGPLAN NOTICES, 2014, 49 (01) : 221 - 233
  • [3] A constraint-based approach to guarded algebraic data types
    Simonet, Vincent
    Pottier, Francois
    ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2007, 29 (01):
  • [4] A Constraint-Based Approach to Context
    van Wissen, Arlette
    Kamphorst, Bart
    van Eijk, Rob
    MODELING AND USING CONTEXT, CONTEXT 2013, 2013, 8175 : 171 - 184
  • [5] Constraint-based Pattern Mining in Dynamic Graphs
    Robardet, Celine
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 950 - 955
  • [6] Constraint-Based Data Transformation for Integration: An Information System Approach*
    Shahriar, Sumon
    Liu, Jixue
    International Journal of Database Theory and Application, 2010, 3 (01): : 53 - 61
  • [7] Data locality and parallelism optimization using a constraint-based approach
    Ozturk, Ozcan
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2011, 71 (02) : 280 - 287
  • [8] Handling hybrid and missing data in constraint-based causal discovery to study the etiology of ADHD
    Sokolova E.
    von Rhein D.
    Naaijen J.
    Groot P.
    Claassen T.
    Buitelaar J.
    Heskes T.
    International Journal of Data Science and Analytics, 2017, 3 (2) : 105 - 119
  • [9] An Interactive Approach to Constraint-Based Visualizations
    Lucas, Wendy
    HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION: INFORMATION AND KNOWLEDGE DESIGN AND EVALUATION, PT I, 2014, 8521 : 54 - 63
  • [10] FasTLInC: a constraint-based tracing approach
    Gates, AQ
    Mondragon, O
    JOURNAL OF SYSTEMS AND SOFTWARE, 2002, 63 (03) : 241 - 258