Data Quality in Secondary Data Analysis: A Case Study of Ecological Data using a Semiotic-based Approach

被引:1
|
作者
Kwiatkowska, Mila [1 ]
Pouw, Frank [2 ]
机构
[1] Thompson Rivers Univ, Dept Comp Sci, 805 TRU Way, Kamloops, BC, Canada
[2] Thompson Rivers Univ, Dept Environm Sci, 805 TRU Way, Kamloops, BC, Canada
关键词
Data Quality; Secondary Data Analysis; Ecological Data; Semiotics;
D O I
10.5220/0007978403770384
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data quality problems are widespread in secondary data when they are used for data warehousing and data mining. This paper advocates a broad semiotic approach to data quality. The main premises of this expanded semiotic framework are (1) data represent some reality, (2) data are created and interpreted by humans in a communication process, (3) data are used for specific purposes by humans, and (4) data cannot be created, interpreted and used without knowledge. Thus, the semiotic-based approach to data quality in secondary data analysis has four aspects: (1) representational, (3) communicational, (3) pragmatic, and (4) knowledge-based. To illustrate these four characteristics, we present a case study of ecological data analysis used in the creation of an ornithological data warehouse. We discuss the temporal data (ecological notion of time), spatial ecological data (communication processes and protocols used for data collection), and bioacoustic data processing (domain knowledge needed for the specification of data provenance).
引用
收藏
页码:377 / 384
页数:8
相关论文
共 50 条
  • [1] A Semiotic Approach to Data Quality
    Krogstie, John
    [J]. ENTERPRISE, BUSINESS-PROCESS AND INFORMATION SYSTEMS MODELING, BPMDS 2013, 2013, 147 : 395 - 410
  • [2] Colombian Case Study for the Analysis of Open Data Government: a Data Quality Approach
    Osorio Sanabria, Mariutsi Alexandra
    Amaya Fernandez, Ferney Orlando
    Gonzalez Zabala, Mayda Patricia
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON THEORY AND PRACTICE OF ELECTRONIC GOVERNANCE (ICEGOV2018), 2018, : 389 - 394
  • [3] Capturing Enterprise Data Integration Challenges Using a Semiotic Data Quality Framework
    Krogstie, John
    [J]. BUSINESS & INFORMATION SYSTEMS ENGINEERING, 2015, 57 (01) : 27 - 36
  • [4] Capturing Enterprise Data Integration Challenges Using a Semiotic Data Quality Framework
    John Krogstie
    [J]. Business & Information Systems Engineering, 2015, 57 : 27 - 36
  • [5] Quality Data for Data Mining and Data Mining for Quality Data: A Constraint Based Approach in XML
    Shahriar, Md. Sumon
    Anam, Sarawat
    [J]. 2008 SECOND INTERNATIONAL CONFERENCE ON FUTURE GENERATION COMMUNICATION AND NETWORKING SYMPOSIA, VOLS 1-5, PROCEEDINGS, 2008, : 142 - +
  • [6] Determinants of Data Quality Dimensions for Assessing Highway Infrastructure Data Using Semiotic Framework
    Krishna, Chenchu Murali
    Ruikar, Kirti
    Jha, Kumar Neeraj
    [J]. BUILDINGS, 2023, 13 (04)
  • [7] ELDER MISTREATMENT AND RELATIONSHIP QUALITY: SECONDARY DATA ANALYSIS USING DATA FROM THE NSHAP
    Xue, Wei-Lin
    Hass, Zach
    Liu, Pi Ju
    [J]. INNOVATION IN AGING, 2022, 6 : 660 - 660
  • [8] SECONDARY ANALYSIS OF SURVEY DATA WITH ECOLOGICAL VARIABLES
    VALKONEN, T
    [J]. SOCIAL SCIENCE INFORMATION SUR LES SCIENCES SOCIALES, 1969, 8 (06): : 33 - 36
  • [9] Assessing data quality in Open Data: A case study
    John Ferney, Mahecha Moyano
    Nicolas Estefan, Lopez Beltran
    John Alexander, Velandia Vega
    [J]. 2017 CONGRESO INTERNACIONAL DE INNOVACION Y TENDENCIAS EN INGENIERIA (CONIITI), 2017,
  • [10] Data quality: a case study
    Herzog, Thomas N.
    Scheuren, Fritz J.
    Winkler, William E.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2011, 3 (01): : 12 - 21