Empirical studies to assess the understandability of data warehouse schemas using structural metrics

被引:24
|
作者
Serrano, Manuel Angel [1 ]
Calero, Coral [1 ]
Sahraoui, Houari A. [1 ,2 ]
Piattini, Mario [1 ]
机构
[1] Univ Castilla La Mancha, Dept Informat Technol & Syst, Alarcos Res Grp, E-13071 Ciudad Real, Spain
[2] Univ Montreal, Dept Informat & Rech Operat, Montreal, PQ H3C 3J7, Canada
关键词
data warehouse; quality; metrics; empirical studies;
D O I
10.1007/s11219-007-9030-7
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Data warehouses are powerful tools for making better and faster decisions in organizations where information is an asset of primary importance. Due to the complexity of data warehouses, metrics and procedures are required to continuously assure their quality. This article describes an empirical study and a replication aimed at investigating the use of structural metrics as indicators of the understandability, and by extension, the cognitive complexity of data warehouse schemas. More specifically, a four-step analysis is conducted: (1) check if individually and collectively, the considered metrics can be correlated with schema understandability using classical statistical techniques, (2) evaluate whether understandability can be predicted by case similarity using the case-based reasoning technique, (3) determine, for each level of understandability, the subsets of metrics that are important by means of a classification technique, and assess, by means of a probabilistic technique, the degree of participation of each metric in the understandability prediction. The results obtained show that although a linear model is a good approximation of the relation between structure and understandability, the associated coefficients are not significant enough. Additionally, classification analyses reveal respectively that prediction can be achieved by considering structure similarity, that extracted classification rules can be used to estimate the magnitude of understandability, and that some metrics such as the number of fact tables have more impact than others.
引用
收藏
页码:79 / 106
页数:28
相关论文
共 50 条
  • [21] Using transmissibility data to assess structural damage
    Mares, C
    Ruotolo, R
    Surace, C
    DAMAS 99: DAMAGE ASSESSMENT OF STRUCTURES, 1999, 167-1 : 236 - 245
  • [22] PREDICTING LANDSCAPE STRUCTURAL METRICS USING ASTER SATELLITE DATA
    Ozdemir, Ibrahim
    Mert, Ahmet
    Senturk, Ozdemir
    JOURNAL OF ENVIRONMENTAL ENGINEERING AND LANDSCAPE MANAGEMENT, 2012, 20 (02) : 168 - 176
  • [23] Interpreting the results of studies using latent variable models to assess data quality: an empirical example using confirmatory factor analysis
    Adam C. Carle
    Quality & Quantity, 2010, 44 : 483 - 497
  • [24] Interpreting the results of studies using latent variable models to assess data quality: an empirical example using confirmatory factor analysis
    Carle, Adam C.
    QUALITY & QUANTITY, 2010, 44 (03) : 483 - 497
  • [25] Using a Clinical Data Warehouse to Calculate and Present Key Metrics for the Radiology Department: Implementation and Performance Evaluation
    Liman, Leon
    May, Bernd
    Fette, Georg
    Krebs, Jonathan
    Puppe, Frank
    JMIR MEDICAL INFORMATICS, 2023, 11
  • [26] EVALUATION OF ALTERNATE TUMOR METRICS AND CUT-POINTS FOR RESPONSE CATEGORIZATION USING THE RECIST 1.1 DATA WAREHOUSE
    Mandrekar, S. J.
    An, M.
    Meyers, J.
    Grothey, A.
    Bogaerts, J.
    Sargent, D. J.
    ANNALS OF ONCOLOGY, 2012, 23 : 535 - 535
  • [27] Evaluation of Alternate Categorical Tumor Metrics and Cut Points for Response Categorization Using the RECIST 1.1 Data Warehouse
    Mandrekar, Sumithra J.
    An, Ming-Wen
    Meyers, Jeffrey
    Grothey, Axel
    Bogaerts, Jan
    Sargent, Daniel J.
    JOURNAL OF CLINICAL ONCOLOGY, 2014, 32 (08) : 841 - +
  • [28] Investigation of metrics to assess vascular flow modifications for diverter device designs using hydrodynamics and angiographic studies
    Ionita, Ciprian N.
    Bednarek, Daniel R.
    Rudin, Stephen
    MEDICAL IMAGING 2012: BIOMEDICAL APPLICATIONS IN MOLECULAR, STRUCTURAL, AND FUNCTIONAL IMAGING, 2012, 8317
  • [29] Replicating medication trend studies using ad hoc information extraction in a clinical data warehouse
    Georg Dietrich
    Jonathan Krebs
    Leon Liman
    Georg Fette
    Maximilian Ertl
    Mathias Kaspar
    Stefan Störk
    Frank Puppe
    BMC Medical Informatics and Decision Making, 19
  • [30] Replicating medication trend studies using ad hoc information extraction in a clinical data warehouse
    Dietrich, Georg
    Krebs, Jonathan
    Liman, Leon
    Fette, Georg
    Ertl, Maximilian
    Kaspar, Mathias
    Stoerk, Stefan
    Puppe, Frank
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (1)