Empirical validation of structural metrics for predicting understandability of conceptual schemas for data warehouse

被引:6
|
作者
Kumar, Manoj [1 ]
Gosain, Anjana [2 ]
Singh, Yogesh [3 ]
机构
[1] Ambedkar Inst Adv Commun Technol & Res, Dept Comp Sci & Engn, Delhi, India
[2] Guru Gobind Singh Indraprastha Univ, Univ Sch Informat Technol, Delhi, India
[3] Maharaja Sayajirao Univ Baroda, Vadodara, Gujarat, India
关键词
Data warehouse quality; Multidimensional conceptual model; Metrics; Logistic regression analysis; Naive Bayes Classifier; Decision Trees;
D O I
10.1007/s13198-013-0159-4
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data warehouse (DW) quality depends on its data models (conceptual, logical and physical model). Multidimensional (MD) modeling has been widely recognized as the backbone of data modeling for DW. Recently, some of the authors have proposed a set of structural metrics to assess quality of MD conceptual models. They have found the significant relationship between metrics and understandability of DW conceptual schemas using various correlation analysis techniques such as Spearman's, Pearson etc. However, advanced statistical and machine learning methods have not been used to predict effect of each metric on understandability. In this paper, our focus is on predicting the effect of structural metrics on understandability of conceptual schemas using (i) statistical method (logistic regression analysis) that include univariate and multivariate analysis, (ii) machine learning methods (Decision Trees, Naive Bayesian Classifier) and (iii) compare the performance of these statistical and machine learning methods. The results obtained show that some of the metrics individually have a significant effect on the understandability of MD conceptual schema. Further, few of the metrics have a significant combined effect on understandability of conceptual schema. The results also show that the performance of Naive Bayesian Classifier prediction method is better than logistic regression analysis and Decision Trees methods.
引用
收藏
页码:291 / 306
页数:16
相关论文
共 37 条
  • [1] Empirical studies to assess the understandability of data warehouse schemas using structural metrics
    Serrano, Manuel Angel
    Calero, Coral
    Sahraoui, Houari A.
    Piattini, Mario
    [J]. SOFTWARE QUALITY JOURNAL, 2008, 16 (01) : 79 - 106
  • [2] Empirical studies to assess the understandability of data warehouse schemas using structural metrics
    Manuel Angel Serrano
    Coral Calero
    Houari A. Sahraoui
    Mario Piattini
    [J]. Software Quality Journal, 2008, 16 : 79 - 106
  • [3] Metrics for data warehouse conceptual models understandability
    Serrano, Manuel
    Trujillo, Juan
    Calero, Coral
    Piattini, Mario
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2007, 49 (08) : 851 - 870
  • [4] Empirical study to predict the understandability of requirements schemas of data warehouse using requirements metrics
    Singh, Tanu
    Kumar, Manoj
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2021, 9 (04) : 329 - 354
  • [5] Comparing the understandability of alternative data warehouse schemas: An empirical study
    Schuff, David
    Corral, Karen
    Turetken, Ozgur
    [J]. DECISION SUPPORT SYSTEMS, 2011, 52 (01) : 9 - 20
  • [6] Investigating structural metrics for understandability prediction of data warehouse multidimensional schemas using machine learning techniques
    Gosain A.
    Singh J.
    [J]. Innovations in Systems and Software Engineering, 2018, 14 (1) : 59 - 80
  • [7] An Experiment towards Metrics Validation for Data Warehouse Conceptual Models
    Dahiya, Naveen
    Sangwan, Neeti
    Bhatnagar, Vishal
    Singh, Manjeet
    [J]. 2014 5TH INTERNATIONAL CONFERENCE CONFLUENCE THE NEXT GENERATION INFORMATION TECHNOLOGY SUMMIT (CONFLUENCE), 2014, : 116 - 123
  • [8] Investigating Requirements Completeness Metrics for Requirements Schemas Using Requirements Engineering Approach of Data Warehouse: A Formal and Empirical Validation
    Tanu Singh
    Manoj Kumar
    [J]. Arabian Journal for Science and Engineering, 2022, 47 : 9527 - 9546
  • [9] Investigating Requirements Completeness Metrics for Requirements Schemas Using Requirements Engineering Approach of Data Warehouse: A Formal and Empirical Validation
    Singh, Tanu
    Kumar, Manoj
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) : 9527 - 9546
  • [10] Empirical validation of metrics for conceptual models of data warehouses
    Serrano, M
    Calero, C
    Trujillo, J
    Luján-Mora, S
    Piattini, M
    [J]. ADVANCED INFORMATION SYSTEMS ENGINEERING, PROCEEDINGS, 2004, 3084 : 506 - 520