A statistical and signal processing based system for data quality management

被引:0
|
作者
Dantas, ACH [1 ]
de Seixas, JM [1 ]
Diniz, FB [1 ]
Ferreira, TN [1 ]
机构
[1] UFRJ, COPPE, BR-21945970 Rio De Janeiro, Brazil
关键词
data quality; quality control system; statistical tests; signal processing; techniques; multidimensional data; financial time series;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Research in data quality is getting more important as databases in research centers and companies get larger. Therefore, developing new mechanisms to discover knowledge in large data bases is as urgent as finding ways to measure and assure the quality of the data. This work describes the development of a Quality Control System, intended to operate on dynamic and large data bases. Techniques used vary from standard statistical tests to signal processing methodology, such as filtering, wavelet transformation and neural processing. Data used for this work consist of asocial database and of time series for five years of stock data for companies present in the SP500 index. Generalization tests show that feed forward neural networks represent a suitable tool for tracking pre-processed (filtered) financial series, and can be used to define a corridor inside which one may consider new data as acceptable. For this data, we were also able to develop a model for the distribution of the differences between consecutive days, which can be combined to neural processing for data acceptation. Tests performed on the social data allowed us to identify probabilistic density functions for a set of variables, making it possible to create a objective test of data quality assessment.
引用
收藏
页码:209 / 218
页数:10
相关论文
共 50 条
  • [21] Join Query Processing in Data Quality Management
    Yue, Mingliang
    Gao, Hong
    Shi, Shengfei
    Wang, Hongzhi
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2016, 2016, 9645 : 329 - 342
  • [22] Net analyte signal based statistical quality control
    Skibsted, ETS
    Boelens, HFM
    Westerhuis, JA
    Smilde, AK
    Broad, NW
    Rees, DR
    Witte, DT
    ANALYTICAL CHEMISTRY, 2005, 77 (22) : 7103 - 7114
  • [23] Six Sigma Quality Management System and Design of Risk-based Statistical Quality Control
    Westgard, James O.
    Westgard, Sten A.
    CLINICS IN LABORATORY MEDICINE, 2017, 37 (01) : 85 - +
  • [24] Product quality data management system based on ORACLE database
    Fang, Xi-feng
    Zhao, Liang-cai
    Huadong Chuanbo Gongye Xueyuan Xuebao/Journal of East China Shipbuilding Institute, 2000, 14 (02): : 57 - 60
  • [25] Model-based statistical signal processing using electromagnetic induction data for landmine detection and classification
    Collins, L
    Gao, P
    Tantum, S
    2001 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING PROCEEDINGS, 2001, : 162 - 165
  • [26] EM Image Fusion Algorithm Based on Statistical Signal Processing
    Jin Xue-bo
    Zhang Qiao-ling
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 3548 - 3551
  • [27] Research on Energy Management System based Data-Processing Middleware
    Zhu, Honglei
    Xue, Xiao
    Song, Junke
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE OF MANAGEMENT SCIENCE AND INFORMATION SYSTEM, VOLS 1-4, 2009, : 220 - 222
  • [28] Statistical relationship-based data quality checking mechanism of information system
    Hu, JQ
    Hu, HP
    Lu, ZD
    PROCEEDINGS OF '97 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, 1997, : 105 - 110
  • [29] MICROSIRIS - STATISTICAL AND DATA MANAGEMENT SOFTWARE SYSTEM
    SONQUIST, JA
    AMERICAN STATISTICIAN, 1990, 44 (03): : 236 - 239
  • [30] Data Acquisition and Signal Processing System for CW Radar
    Kumawat, Harish Chandra
    Raj, Arockia Bazil
    2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2019,