Answering queries over incomplete data stream histories

被引:4
|
作者
Gray, Alasdair J. G. [1 ]
Nutt, Werner [2 ]
Williams, M. Howard [1 ]
机构
[1] Heriot Watt Univ, Sch Math & Comp Sci, Edinburgh, Midlothian, Scotland
[2] Free Univ Bozen Bolzano, Fac Comp Sci, Bolzano, Italy
基金
英国工程与自然科学研究理事会;
关键词
Data handling; Networking;
D O I
10.1108/17440080710829216
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose - Distributed data streams are an important topic of current research. In such a setting, data values will be missed, e.g. due to network errors. This paper aims to allow this incompleteness to be detected and overcome with either the user not being affected or the effects of the incompleteness being reported to the user. Design/methodology/approach - A model for representing the incomplete information has been developed that captures the information that is known about the missing data. Techniques for query answering involving certain and possible answer sets have been extended so that queries over incomplete data stream histories can be answered. Findings - It is possible to detect when a distributed data stream is missing one or more values. When such data values are missing there will be some information that is known about the data and this is stored in an appropriate format. Even when the available data are incomplete, it is possible in some circumstances to answer a query completely. When this is not possible, additional meta-data can be returned to inform the user of the effects of the incompleteness. Research limitations/implications - The techniques and models proposed in this paper have only been partially implemented. Practical implications - The proposed system is general and can be applied wherever there is a need to query the history of distributed data streams. The work in this paper enables the system to answer queries when there are missing values in the data. Originality/value - This paper presents a general model of how to detect, represent, and answer historical queries over incomplete distributed data streams.
引用
收藏
页码:41 / 60
页数:20
相关论文
共 50 条
  • [31] Efficient processing of multiple continuous skyline queries over a data stream
    Lee, Yu Won
    Lee, Ki Yong
    Kim, Myoung Ho
    INFORMATION SCIENCES, 2013, 221 : 316 - 337
  • [32] Answering Natural Language Queries over Linked Data Graphs: A Distributional Semantics Approach
    Freitas, Andre
    de Faria, Fabricio F.
    O'Riain, Sean
    Curry, Edward
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 1107 - 1108
  • [33] Answering Cross-Source Keyword Queries over Deep Web Data Sources
    Wang, Fan
    Agrawal, Gagan
    CONTEMPORARY COMPUTING, 2011, 168 : 475 - 490
  • [34] Diversified Top-k Answering of Cypher Queries over Large Data Graphs
    Mahfoud, Houari
    2023 20TH ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, AICCSA, 2023,
  • [35] Stream Frequency over Interval Queries
    Ben Basat, Ran
    Friedman, Roy
    Shahout, Rana
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 12 (04): : 433 - 445
  • [36] Data exchange: On the complexity of answering queries with inequalities
    Madry, A
    INFORMATION PROCESSING LETTERS, 2005, 94 (06) : 253 - 257
  • [37] Answering XML queries by means of data summaries
    Baralis, Elena
    Garza, Paolo
    Quintarelli, Elisa
    Tanca, Letizia
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2007, 25 (03)
  • [39] Answering Complex Structured Queries over the Deep Web
    Wang, Fan
    Agrawal, Gagan
    PROCEEDINGS OF THE 15TH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM (IDEAS '11), 2011, : 115 - 123
  • [40] FUKG: answering flexible queries over knowledge graphs
    Yague, Jose Felix
    Huitzil, Ignacio
    Bobed, Carlos
    Bobillo, Fernando
    ELECTRONIC LIBRARY, 2024, 42 (03): : 368 - 392