Developing a Workflow to Identify Inconsistencies in Volunteered Geographic Information: A Phenological Case Study

被引:14
|
作者
Mehdipoor, Hamed [1 ]
Zurita-Milla, Raul [1 ]
Rosemartin, Alyssa [2 ,3 ]
Gerst, Katharine L. [2 ,3 ]
Weltzin, Jake F. [3 ,4 ]
机构
[1] Univ Twente, Fac GeoInformat Sci & Earth Observat ITC, NL-7500 AE Enschede, Netherlands
[2] Univ Arizona, Coll Agr & Life Sci, Sch Nat Resources & Environm, Tucson, AZ USA
[3] USA Natl Phenol Network, Natl Coordinating Off, Tucson, AZ USA
[4] US Geol Survey, Tucson, AZ USA
来源
PLOS ONE | 2015年 / 10卷 / 10期
关键词
MODEL; QUALITY; NETWORK; SCIENCE; ONSET; PLANT; TOOL;
D O I
10.1371/journal.pone.0140811
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent improvements in online information communication and mobile location-aware technologies have led to the production of large volumes of volunteered geographic information. Widespread, large-scale efforts by volunteers to collect data can inform and drive scientific advances in diverse fields, including ecology and climatology. Traditional workflows to check the quality of such volunteered information can be costly and time consuming as they heavily rely on human interventions. However, identifying factors that can influence data quality, such as inconsistency, is crucial when these data are used in modeling and decision- making frameworks. Recently developed workflows use simple statistical approaches that assume that the majority of the information is consistent. However, this assumption is not generalizable, and ignores underlying geographic and environmental contextual variability that may explain apparent inconsistencies. Here we describe an automated workflow to check inconsistency based on the availability of contextual environmental information for sampling locations. The workflow consists of three steps: (1) dimensionality reduction to facilitate further analysis and interpretation of results, (2) model-based clustering to group observations according to their contextual conditions, and (3) identification of inconsistent observations within each cluster. The workflow was applied to volunteered observations of flowering in common and cloned lilac plants (Syringa vulgaris and Syringa x chinensis) in the United States for the period 1980 to 2013. About 97% of the observations for both common and cloned lilacs were flagged as consistent, indicating that volunteers provided reliable information for this case study. Relative to the original dataset, the exclusion of inconsistent observations changed the apparent rate of change in lilac bloom dates by two days per decade, indicating the importance of inconsistency checking as a key step in data quality assessment for volunteered geographic information. Initiatives that leverage volunteered geographic information can adapt this workflow to improve the quality of their datasets and the robustness of their scientific analyses.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Improvement in environmental accessibility via volunteered geographic information: a case study
    Limin Zeng
    Romina Kühn
    Gerhard Weber
    [J]. Universal Access in the Information Society, 2017, 16 : 939 - 949
  • [2] Improvement in environmental accessibility via volunteered geographic information: a case study
    Zeng, Limin
    Kuehn, Romina
    Weber, Gerhard
    [J]. UNIVERSAL ACCESS IN THE INFORMATION SOCIETY, 2017, 16 (04) : 939 - 949
  • [3] Anomaly detection for volunteered geographic information: a case study of Safecast data
    Xin, Yanan
    [J]. INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2022, 36 (07) : 1423 - 1442
  • [4] The impact of society on volunteered geographic information: The case of OpenStreetMap
    Mashhadi, Afra
    Quattrone, Giovanni
    Capra, Licia
    [J]. Lecture Notes in Geoinformation and Cartography, 2015, 0 (9783319142791): : 125 - 141
  • [5] Volunteered Geographic Information and Crowdsourcing Disaster Relief: A Case Study of the Haitian Earthquake
    Zook, Matthew
    Graham, Mark
    Shelton, Taylor
    Gorman, Sean
    [J]. WORLD MEDICAL & HEALTH POLICY, 2010, 2 (02): : 7 - 33
  • [6] Combining conventional and volunteered geographic information to identify and model forest recreational resources
    Upton, Vincent
    Ryan, Mary
    O'Donoghue, Cathal
    Dhubhain, Aine Ni
    [J]. APPLIED GEOGRAPHY, 2015, 60 : 69 - 76
  • [7] VOLUNTEERED GEOGRAPHIC INFORMATION (VGI) AND ENVIRONMENTAL MONITORING, A NEW APPROACH IN DEVELOPING COUNTRIES
    Manouchehri, Mohsen
    Moghaddam, Majid Kiavarz
    [J]. ISPRS INTERNATIONAL JOINT CONFERENCES OF THE 2ND GEOSPATIAL INFORMATION RESEARCH (GI RESEARCH 2017); THE 4TH SENSORS AND MODELS IN PHOTOGRAMMETRY AND REMOTE SENSING (SMPR 2017); THE 6TH EARTH OBSERVATION OF ENVIRONMENTAL CHANGES (EOEC 2017), 2017, 42-4 (W4): : 401 - 405
  • [8] The study of disaster situation awareness based on volunteered geographic information
    Zhao, Qiansheng
    Chen, Zi
    Li, Shengming
    Luo, Nianxue
    [J]. INTERNATIONAL CONFERENCE ON INTELLIGENT EARTH OBSERVING AND APPLICATIONS 2015, 2015, 9808
  • [9] A Design Study to Identify Inconsistencies in Kinship Information: the Case of the 1000 Genomes Project
    Aupetit, Michael
    Ullah, Ehsan
    Rawi, Reda
    Bensmail, Halima
    [J]. 2016 IEEE PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS), 2016, : 254 - 258
  • [10] Analyzing the Contributor Activity of a Volunteered Geographic Information Project - The Case of OpenStreetMap
    Neis, Pascal
    Zipf, Alexander
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2012, 1 (02) : 146 - 165