Data Quality in Secondary Data Analysis: A Case Study of Ecological Data using a Semiotic-based Approach

被引:1
|
作者
Kwiatkowska, Mila [1 ]
Pouw, Frank [2 ]
机构
[1] Thompson Rivers Univ, Dept Comp Sci, 805 TRU Way, Kamloops, BC, Canada
[2] Thompson Rivers Univ, Dept Environm Sci, 805 TRU Way, Kamloops, BC, Canada
关键词
Data Quality; Secondary Data Analysis; Ecological Data; Semiotics;
D O I
10.5220/0007978403770384
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data quality problems are widespread in secondary data when they are used for data warehousing and data mining. This paper advocates a broad semiotic approach to data quality. The main premises of this expanded semiotic framework are (1) data represent some reality, (2) data are created and interpreted by humans in a communication process, (3) data are used for specific purposes by humans, and (4) data cannot be created, interpreted and used without knowledge. Thus, the semiotic-based approach to data quality in secondary data analysis has four aspects: (1) representational, (3) communicational, (3) pragmatic, and (4) knowledge-based. To illustrate these four characteristics, we present a case study of ecological data analysis used in the creation of an ornithological data warehouse. We discuss the temporal data (ecological notion of time), spatial ecological data (communication processes and protocols used for data collection), and bioacoustic data processing (domain knowledge needed for the specification of data provenance).
引用
收藏
页码:377 / 384
页数:8
相关论文
共 50 条
  • [21] A MODEL APPROACH TO THE INCREASE OF DATA QUALITY OF CLINICAL ROUTINE DATA IN THE CONTEXT OF SECONDARY USE
    Holzer, K.
    Dorda, W.
    Duftschmid, G.
    Nachbagauer, A.
    Strasser, N.
    Wrba, T.
    Gall, W.
    [J]. EHEALTH2012 - HEALTH INFORMATICS MEETS EHEALTH - VON DER WISSENSCHAFT ZUR ANWENDUNG UND ZURUCK: MOBILE HEALTH & CARE - GESUNDHEITSVORSORGE IMMER UND UBERALL, 2012, : 205 - 210
  • [22] Evaluation of Data Quality of Multisite Electronic Health Record Data for Secondary Analysis
    Nobles, Alicia L.
    Vilankar, Ketki
    Wu, Hao
    Barnes, Laura E.
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2612 - 2620
  • [23] A Semiotic Approach to Data in Medical Decision Making
    Kwiatkowska, Mila
    McMillan, Linda
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2010), 2010,
  • [24] A Data Privacy Preservation Approach and a Case Study in Data Analytics
    Salhi, Abdellah
    [J]. 4TH INNOVATION AND ANALYTICS CONFERENCE & EXHIBITION (IACE 2019), 2019, 2138
  • [25] Affordances of narrative and numerical data: A social-semiotic approach to data use
    Fjortoft, Henning
    Lai, Mei Kuin
    [J]. STUDIES IN EDUCATIONAL EVALUATION, 2021, 69
  • [27] Secondary Data Analysis: Using existing data to answer new questions
    Kelly, Michelle M.
    Martin-Peters, Tasha
    Farber, Jessica Strohm
    [J]. JOURNAL OF PEDIATRIC HEALTH CARE, 2024, 38 (04) : 616 - 619
  • [28] Data quality analysis using data-mining methods
    Windheuser, U
    [J]. OPERATIONS RESEARCH PROCEEDINGS 1999, 2000, : 304 - 310
  • [29] Application of data mining to the analysis of meteorological data for air quality prediction: A case study in Shenyang
    Zhao, Chang
    Song, Guojun
    [J]. 2ND INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE, ENERGY TECHNOLOGY AND ENVIRONMENTAL ENGINEERING (MSETEE 2017), 2017, 81
  • [30] Secondary analysis of case-control data
    Jiang, YN
    Scott, AJ
    Wild, CJ
    [J]. STATISTICS IN MEDICINE, 2006, 25 (08) : 1323 - 1339