Empirical analysis of metrics for object oriented multidimensional model of data warehouse using unsupervised machine learning techniques

被引:4
|
作者
Sabharwal S. [1 ]
Nagpal S. [1 ]
Aggarwal G. [2 ]
机构
[1] Computer Science and Engineering Department, NSIT, Dwarka, New Delhi
[2] Information Technology Department, NSIT, Dwarka, New Delhi
关键词
Data warehouse quality; Hierarchical clustering; K-means clustering; Metrics; Understandability;
D O I
10.1007/s13198-016-0508-1
中图分类号
学科分类号
摘要
Data Warehouse provides the foundation for businesses to take informed decisions for day to day operations and making future strategy. Since the role is so pivotal to the growth and success of the business, its quality is very critical. Conceptual models of data warehouses give us a great insight into the quality of the developed system during the early stages of the design process. Researchers have proposed a number of metrics to evaluate the quality of these object oriented multidimensional models. Further, for these metrics to be used in practice, empirical evaluation is crucial. There are a number of propositions in literature that work towards empirical validation of metrics. But most of them are either restricted to statistical techniques or supervised machine learning techniques. In order to empirically validate the metrics, we need to get user responses for a number of schemas and take down observations to quantify model quality aspects like understandability, efficiency etc. This can result in personal biases, errors and random outliers which impacts the evaluation model. In this paper, we have made a first attempt to assess the relationship between the object oriented multidimensional data warehouse structural metrics and understandability of its models by using unsupervised machine learning techniques with the aid of a data warehouse quality expert. The results indicate that the proposed metrics have a strong relationship with understandability and inturn quality of the data warehouse conceptual models and the unsupervised techniques are able to identify this relationship with high degree of accuracy. © 2016, The Society for Reliability Engineering, Quality and Operations Management (SREQOM), India and The Division of Operation and Maintenance, Lulea University of Technology, Sweden.
引用
收藏
页码:703 / 715
页数:12
相关论文
共 50 条
  • [1] Empirical validation of metrics for object oriented multidimensional model for data warehouse
    Gosain A.
    Mann S.
    [J]. International Journal of System Assurance Engineering and Management, 2014, 5 (3) : 262 - 275
  • [2] Empirical Investigation of Metrics for Multidimensional Model of Data Warehouse Using Support Vector Machine
    Sabharwal, Sangeeta
    Nagpal, Sushama
    Aggarwal, Gargi
    [J]. 2015 4TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (ICRITO) (TRENDS AND FUTURE DIRECTIONS), 2015,
  • [3] Theoretical Validation of Object-Oriented Metrics for Data Warehouse Multidimensional Model
    Gosain, Anjana
    Gupta, Rakhi
    [J]. PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON FRONTIERS IN INTELLIGENT COMPUTING: THEORY AND APPLICATIONS, FICTA 2016, VOL 1, 2017, 515 : 681 - 691
  • [4] AN OBJECT ORIENTED MULTIDIMENSIONAL MODEL FOR DATA WAREHOUSE
    Gosain, Anjana
    Mann, Suman
    [J]. FOURTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2011): COMPUTER VISION AND IMAGE ANALYSIS: PATTERN RECOGNITION AND BASIC TECHNOLOGIES, 2012, 8350
  • [5] Investigating structural metrics for understandability prediction of data warehouse multidimensional schemas using machine learning techniques
    Gosain A.
    Singh J.
    [J]. Innovations in Systems and Software Engineering, 2018, 14 (1) : 59 - 80
  • [6] Object oriented multidimensional model for a data warehouse with operators
    Gosain Dr., Anjana
    Mann, Suman
    [J]. International Journal of Database Theory and Application, 2010, 3 (04): : 35 - 40
  • [7] Empirical Validation of Object Oriented Data Warehouse Design Quality Metrics
    Gupta, Jaya
    Gosain, Anjana
    Nagpal, Sushama
    [J]. ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, 2011, 198 : 320 - +
  • [8] Empirical Analysis of Object Oriented Metrics using Dimensionality Reduction Techniques
    Sharma, Rashmi
    Sabharwal, Sangeeta
    Nagpal, Sushma
    [J]. 2014 RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE), 2014,
  • [9] Multidimensional Modeling for Data Warehouse Using Object Oriented Approach
    Gosain, Anjana
    Khatri, Sunil Kumar
    Manna, Suman
    [J]. 2014 3RD INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (ICRITO) (TRENDS AND FUTURE DIRECTIONS), 2014,
  • [10] Theoretical and Empirical Validation of Coupling Metrics for Object-Oriented Data Warehouse Design
    Aggarwal, Gargi
    Sabharwal, Sangeeta
    Nagpal, Sushama
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2018, 43 (02) : 675 - 691