Applying Mechanisms of Data Profiling for Assuring Data Quality in the software: a first approach

被引:0
|
作者
Guerra-Garcia, Cesar [1 ]
Perez-Gonzalez, Hector G. [1 ]
Martinez-Perez, Francisco [1 ]
Juarez-Ramirez, Reyes [2 ]
Jimenez, Samantha [2 ]
机构
[1] Autonomous Univ San Luis Potosi, San Luis Potosi, Mexico
[2] Autonomous Univ Baja California, Mexicali, Baja California, Mexico
关键词
process improvement; data quality dimensions; INFORMATION;
D O I
10.1109/CONISOFT58849.2023.00023
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
For any organization is necessary to satisfy their business objectives besides using data besides using data to implement organizational processes, for that reason, it is indispensable to have knowledge of how these data satisfy the preset quality requirements. Thus, these requirements could be expressed by means of some data quality dimensions. In some scenarios, models and methodologies of data quality assessment require of mechanisms to control and monitor the level of quality of data. Thus, proposing a methodology with a qualitative diagnosis of the data quality dimensions and using data profiling techniques to measure some of these dimensions, will have a significant impact on the processes of appropriate use of the data. The main contribution of this paper is a methodology that assesses the data quality, by diagnosing its dimensions through a survey and data profiling techniques.
引用
收藏
页码:108 / 115
页数:8
相关论文
共 50 条
  • [41] Data Quality for Software Vulnerability Datasets
    Croft, Roland
    Babar, M. Ali
    Kholoosi, M. Mehdi
    [J]. 2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ICSE, 2023, : 121 - 133
  • [42] Data mining for predictors of software quality
    Khoshgoftaar, TM
    Allen, EB
    Jones, WD
    Hudepohl, JP
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 1999, 9 (05) : 547 - 563
  • [43] EPIC: Profiling the propagation and effect of data errors in software
    Hiller, M
    Jhumka, A
    Suri, N
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2004, 53 (05) : 512 - 530
  • [44] Applying Data Mining to Water Quality Prediction
    Zhang, Rui-Jian
    Li, De-Ren
    [J]. 2015 INTERNATIONAL CONFERENCE ON ENVIRONMENT, MANUFACTURING INDUSTRY AND ECONOMIC DEVELOPMENT, (EMIED 2015), 2015, : 1 - 7
  • [45] Quality Data for Data Mining and Data Mining for Quality Data: A Constraint Based Approach in XML
    Shahriar, Md. Sumon
    Anam, Sarawat
    [J]. 2008 SECOND INTERNATIONAL CONFERENCE ON FUTURE GENERATION COMMUNICATION AND NETWORKING SYMPOSIA, VOLS 1-5, PROCEEDINGS, 2008, : 142 - +
  • [46] On the placement of software mechanisms for detection of data errors
    Hiller, M
    Jhumka, A
    Suri, N
    [J]. INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2002, : 135 - 144
  • [47] Applying global gene expression data to mechanisms of hepatocarcinogenesis
    Lehman-McKeeman, LD
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2005, 230 : U1880 - U1881
  • [48] Software channel approach to data banding
    Arnett, P.C.
    Lam, D.
    [J]. 1600, (26):
  • [49] SOFTWARE CHANNEL APPROACH TO DATA BANDING
    ARNETT, PC
    LAM, D
    [J]. IEEE TRANSACTIONS ON MAGNETICS, 1990, 26 (05) : 2324 - 2326
  • [50] A Study of Applying Data Mining Approach for CRM
    Chen Zhangliang
    [J]. RECENT ADVANCE IN STATISTICS APPLICATION AND RELATED AREAS, PTS 1 AND 2, 2008, : 168 - 172