Evaluating aggregate operations over imprecise data

被引:42
|
作者
Chen, ALP [1 ]
Chiu, JS [1 ]
Tseng, FSC [1 ]
机构
[1] YUAN ZE INST TECHNOL,DEPT INFORMAT MANAGEMENT,CHUNGLI 32026,TAIWAN
关键词
relational databases; null values; partial values; scalar aggregates; aggregate functions; graph theory;
D O I
10.1109/69.494166
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imprecise data in databases were originally denoted as null values, which represent the meaning of ''values unknown at present.'' More generally, a partial value corresponds to a finite set of possible values for an attribute in which exactly one of the values is the ''true'' value. In this paper, we define a set of extended aggregate operations, namely sum, average, count, maximum, and minimum, which can be applied to an attribute containing partial values. Two types of aggregate operators are considered: scalar aggregates and aggregate functions. We study the properties of the aggregate operations and develop efficient algorithms for count, maximum and minimum. However, for sum and average, we point out that in general it takes exponential time complexity to do the computations.
引用
收藏
页码:273 / 284
页数:12
相关论文
共 50 条
  • [21] Fuzzy nonlinear programming approach for evaluating and ranking process yields with imprecise data
    Wu, Chien-Wei
    Liao, Mou-Yuan
    [J]. FUZZY SETS AND SYSTEMS, 2014, 246 : 142 - 155
  • [22] Efficient aggregate computation over data streams
    Nagaraj, Kanthi
    Naidu, K. V. M.
    Rastogi, Rajeev
    Satkin, Scott
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1382 - +
  • [23] Multi-dimensional Probabilistic Regression over Imprecise Data Streams
    Gao, Ran
    Xie, Xike
    Zou, Kai
    Pedersen, Torben Bach
    [J]. PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 3317 - 3326
  • [24] Evaluating Genomic Big Data Operations on SciDB and Spark
    Cattani, Simone
    Ceri, Stefano
    Kaitoua, Abdulrahman
    Pinoli, Pietro
    [J]. WEB ENGINEERING (ICWE 2017), 2017, 10360 : 482 - 493
  • [25] Divide-and-Aggregate Learning for Evaluating Performance on Unlabeled Data
    Miao, Shuyu
    Liu, Jian
    Zheng, Lin
    Jin, Hong
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21395 - 21402
  • [26] Evaluating the sensitivity of a trophic mass-balance model (Ecopath) to imprecise data inputs
    Essington, Timothy E.
    [J]. CANADIAN JOURNAL OF FISHERIES AND AQUATIC SCIENCES, 2007, 64 (04) : 628 - 637
  • [27] Distribution of Operations and Data Arrays over Processors
    N. A. Likhoded
    [J]. Programming and Computer Software, 2003, 29 : 173 - 179
  • [28] Distribution of operations and data arrays over processors
    Likhoded, NA
    [J]. PROGRAMMING AND COMPUTER SOFTWARE, 2003, 29 (03) : 173 - 179
  • [29] Processing Probabilistic range query over imprecise data based on quality of result
    Zhang, W
    Li, JZ
    [J]. ADVANCED WEB AND NETWORK TECHNOLOGIES, AND APPLICATIONS, PROCEEDINGS, 2006, 3842 : 441 - 449
  • [30] Natural language aggregate query over RDF data
    Hu, Xin
    Dang, Depeng
    Yao, Yingting
    Ye, Luting
    [J]. INFORMATION SCIENCES, 2018, 454 : 363 - 381